DSpace Repository

Warclight: A Rails Engine for Web Archive Discovery

Warclight: A Rails Engine for Web Archive Discovery

Show full item record

Title: Warclight: A Rails Engine for Web Archive Discovery
Author: Ruest, Nick
Milligan, Ian
Lin, Jimmy
Abstract: This paper describes the development of Warclight, a portmanteau of the open-source Blacklight platform and the ISO-standard Web ARChive file format. Warclight allows users to explore web archives that have been indexed into Apache Solr using the UK Web Archive's Web Archive Discovery tool. Referencing previous work, we explain how the standard search engine results page is inadequate to support scholarly inquiries. Instead, Warclight provides full-text and faceted search, as well as faceted browsing, to enable exploration and discovery. Given the large sizes of many web archives, we share experiences with deploying our tool at scale using a federated architecture.
Sponsor: This work was primarily supported by the Andrew W. Mellon Foundation and Compute Canada's Research Platforms and Portals program. Additional funding for the project has come from Start Smart Labs and the Social Sciences and Humanities Research Council of Canada.
Subject: web archives
information retrieval
faceted search
Type: Article
URI: http://hdl.handle.net/10315/36159
Citation: Nick Ruest, Ian Milligan, and Jimmy Lin. “Warclight: A Rails Engine for Web Archive Discovery.” Proceedings of the ACM/IEEE Joint Conference on Digital Libraries, Vol. 19 (2019).
Date: 2019

Files in this item

This item appears in the following Collection(s)