Now showing items 1-3 of 3
Hands on with warcbase
Warcbase is an open-source platform for managing web archives built on Hadoop and HBase. The platform provides a flexible data model for storing and managing raw content as well as metadata and extracted knowledge. Tight ...
Engaging the Public with Web Archives: Providing Access to 10 Years of Political History with WebArchives.ca
Introduction The growth of digital sources since the advent of the World Wide Web in 1990-91 presents profound opportunities for historians. Large web archives contain billions of webpages, and now make it possible ...
The Archives Unleashed Notebook: Madlibs for Jumpstarting Scholarly Exploration
This paper introduces the Archives Unleashed Notebook, which is designed to work with derivative datasets from the Archives Unleashed Cloud, a platform for analyzing web archives. These datasets contain common starting ...