Web Archives Analysis at Scale with the Archives Unleashed Cloud

dc.contributor.authorRuest, Nick
dc.contributor.authorMilligan, Ian
dc.date.accessioned2019-04-08T19:42:39Z
dc.date.available2019-04-08T19:42:39Z
dc.date.issued2019-04-08
dc.descriptionCNI 2019 Spring Membership Meeting
dc.description.abstractWeb archives, repositories of born-digital information dating back to the Internet Archive and national libraries in the mid-1990s, are fantastic resources of information covering topics of interest to humanities and social sciences scholars. Imagine a political historian studying elections, a historian studying youth culture in the late 1990s, or a scholar of the military or policy exploring how wars were reflected online. Yet while we have been collecting this information for over two decades, access has lagged: most scholars are limited to working with web archives one page at a time through portals such as the Wayback Machine. With the rise of the digital humanities, the computational social sciences, and web science more generally, scholars increasingly have the ability and desire to work with data at scale. In this presentation, we introduce the Archives Unleashed Cloud, currently supported through a grant from The Andrew W. Mellon Foundation. This service facilitates the (a) transfer of web archival data to the Cloud; (b) its analysis and transformation into standard scholarly derivatives; and (c) the building of a community around it via in-person events and learning guides. Our presentation begins by introducing the Cloud and discussing its motivation, discussing its technical underpinnings, and then exploring our current sustainability plan to keep the Archives Unleashed Cloud running after our foundation funding ends in 2020.en_US
dc.description.sponsorshipThis work is primarily supported by the Andrew W. Mellon Foundation. Additional funding has come from the U.S. National Science Foundation, Columbia University Library's Mellon-funded Web Archiving Incentive Award, the Natural Sciences and Engineering Research Council of Canada, the Social Sciences and Humanities Research Council of Canada, and the Ontario Ministry of Research and Innovation's Early Researcher Award program.
dc.identifier.urihttp://hdl.handle.net/10315/36119
dc.language.isoen
dc.rightsAttribution-ShareAlike 2.5 Canada*
dc.rights.urihttp://creativecommons.org/licenses/by-sa/2.5/ca/*
dc.subjectweb archivesen
dc.subjectweb archive analysisen
dc.subjectsustainabilityen
dc.subjectcloud computingen
dc.titleWeb Archives Analysis at Scale with the Archives Unleashed Clouden
dc.typePresentation

Files

Original bundle
Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
CNI 2019 St-Louis.pdf
Size:
24.28 MB
Format:
Adobe Portable Document Format
Description:
License bundle
Now showing 1 - 1 of 1
No Thumbnail Available
Name:
license.txt
Size:
1.83 KB
Format:
Item-specific license agreed upon to submission
Description: