Walking the WALK; Facilitating Interdisciplinary Web Archive Collaboration

Date

2016-06-02

Authors

Ruest, Nick
Milligan, Ian

Journal Title

Journal ISSN

Volume Title

Publisher

Abstract

The growth of digital sources since the advent of the World Wide Web in 1991, and the commencement of widespread web archiving in 1996, presents profound new opportunities for social and cultural analysis. In simple terms, the 1990s cannot be studied without web archives: they are both primary sources that reflect how people consume and understand media, as well as repositories that document the thoughts, opinions, and activities of millions of everyday people. These are a dream for social historians.

However, with all this opportunity comes challenges: large data, the need for interdisciplinary collaboration between historians who might have the questions but not the technical resources or knowledge to work with these sources, and basic questions around what a web archive is and how to access them. Libraries and archives are perfectly positioned to work in this new emerging field that brings together historians, computer scientists, and information specialists.

In this talk, our speakers will discuss the fruits of one collaboration that has emerged at York University, the University of Alberta, and the University of Waterloo. Bringing together librarians, historians, and computer scientists, as well as an interdisciplinary team of undergraduate and graduate students, this distributed group is developing several web archival analytics projects.

They work using a combination of centralized and de-centralized infrastructure to run data analytics, store web archives, provide a publicly-facing portal, and collaborate. Ian and Nick will discuss the challenges of working in an interdisciplinary environment, and give insights into how the team has been working through in-detail case studies of their work with http://webarchives.ca, Twitter archiving and analysis, Compute Canada, and warcbase, a web analytics platform.

The combination of computer scientists and humanists is not always a simple one, but it has proven to be worthwhile.

Description

Keywords

shine, twitter, web archives, text analysis

Citation