YorkSpace has migrated to a new version of its software. Access our Help Resources to learn how to use the refreshed site. Contact diginit@yorku.ca if you have any questions about the migration.
 

Warclight: A Rails Engine for Web Archive Discovery

dc.contributor.authorRuest, Nick
dc.contributor.authorMilligan, Ian
dc.contributor.authorLin, Jimmy
dc.date.accessioned2019-04-19T23:02:55Z
dc.date.available2019-04-19T23:02:55Z
dc.date.issued2019
dc.description.abstractThis paper describes the development of Warclight, a portmanteau of the open-source Blacklight platform and the ISO-standard Web ARChive file format. Warclight allows users to explore web archives that have been indexed into Apache Solr using the UK Web Archive's Web Archive Discovery tool. Referencing previous work, we explain how the standard search engine results page is inadequate to support scholarly inquiries. Instead, Warclight provides full-text and faceted search, as well as faceted browsing, to enable exploration and discovery. Given the large sizes of many web archives, we share experiences with deploying our tool at scale using a federated architecture.en_US
dc.description.sponsorshipThis work was primarily supported by the Andrew W. Mellon Foundation and Compute Canada's Research Platforms and Portals program. Additional funding for the project has come from Start Smart Labs and the Social Sciences and Humanities Research Council of Canada.
dc.identifier.citationNick Ruest, Ian Milligan, and Jimmy Lin. “Warclight: A Rails Engine for Web Archive Discovery.” Proceedings of the ACM/IEEE Joint Conference on Digital Libraries, Vol. 19 (2019).
dc.identifier.citationNick Ruest, Ian Milligan, and Jimmy Lin. “Warclight: A Rails Engine for Web Archive Discovery.” Proceedings of the ACM/IEEE Joint Conference on Digital Libraries, Vol. 19 (2019).
dc.identifier.issn978-1-7281-1547-4/19
dc.identifier.urihttp://hdl.handle.net/10315/36159
dc.identifier.urihttps://doi.org/10.1109/JCDL.2019.00110
dc.language.isoen
dc.rights.urihttps://doi.org/10.1109/JCDL.2019.00110
dc.subjectweb archivesen
dc.subjectinformation retrievalen
dc.subjectdiscoveryen
dc.subjectBlacklighten
dc.subjectfaceted searchen
dc.titleWarclight: A Rails Engine for Web Archive Discoveryen
dc.typeArticle

Files

Original bundle
Now showing 1 - 2 of 2
Loading...
Thumbnail Image
Name:
warclight.pdf
Size:
706.95 KB
Format:
Adobe Portable Document Format
Description:
Main article
Loading...
Thumbnail Image
Name:
JCDL-Warclight-Poster-Draft-3x4.pdf
Size:
8.95 MB
Format:
Adobe Portable Document Format
Description:
JCDL Poster
License bundle
Now showing 1 - 1 of 1
No Thumbnail Available
Name:
license.txt
Size:
1.83 KB
Format:
Item-specific license agreed upon to submission
Description: