DSpace Repository

The Great WARC Adventure: WARCs from creation to use

The Great WARC Adventure: WARCs from creation to use

Show full item record

Title: The Great WARC Adventure: WARCs from creation to use
Author: Ruest, Nick
Milligan, Ian
Abstract: We live in a reality where documents are born, revised and disseminated online. Every day, users record their thoughts, feelings, locations, ratings, votes, comments, reviews, jokes, and so forth; an assemblage of traces of the past that historians will be able to mold into historical narratives. Luckily, we have some established standards for preserving and disseminating web archives, and emerging processes for analysis.

This presentation will cover a historical overview of web archiving, how best to both capture and preserve websites, and make them discoverable and usable using open source tools that can be easily replicated by other organizations, the interplay of the archivist and historian with respect to web archives, and finally ways to access web archives beyond the Wayback Machine using open-source tools such as WARC Tools, Apache Solr, and Carrot2 Workbench. Two web archive examples are used for this practical hands-on component: a collection of websites concerning the Dale Askey legal case with Edwin Mellen Press (the #freedaleaskey collection) and a case study collection of archived websites from the .ca top-level domain (amounting to 4.7% in total).
Subject: web archiving
textual analysis
digital history
digital preservation
Type: Presentation
Rights: Attribution-NonCommercial-ShareAlike 2.5 Canada
URI: http://hdl.handle.net/10315/27544
Date: 2014-06-26

Files in this item

The following license files are associated with this item:

This item appears in the following Collection(s)

Attribution-NonCommercial-ShareAlike 2.5 Canada Except where otherwise noted, this item's license is described as Attribution-NonCommercial-ShareAlike 2.5 Canada