Now showing items 1-20 of 46
Islandora: Creating and Sustaining an Open Source Community
Three years have passed since the formation of the Islandora Foundation was announced at Open Repositories 2013. Since that time, the project has welcomed more than two dozen supporting institutions, hosted Islandora Camps ...
Gettin Sh*t Done in the Digital Archives
We live in a reality where official documents are born, revised and disseminated online. Most post-secondary institutions have record retention schedules to facilitate the transfer of official records with lasting historical ...
Project Sustainability and Research Platforms: The Archives Unleashed Cloud Project
(IIPC Web Archiving Conference 2019, 2019-06-07)
The Archives Unleashed Project, founded in 2017 with funding from the Andrew W. Mellon Foundation, aims to make petabytes of historical internet content accessible to scholars and others interested in researching the recent ...
See a little Warclight: building an open-source web archive portal with project blacklight
(IIPC Web Archiving Conference 2019, 2019-06-06)
In 2014-15, due to close collaboration between UK-based researchers and the UK Web Archive, the open-source Shine project was launched. It allowed faceted search, trend diagram exploration, and other advanced methods of ...
Building Community and Tools for Analyzing Web Archives through Datathons
Starting in March 2016, the Archives Unleashed team and our collaborators have brought together social scientists, humanists, archivists, librarians, computer scientists, and other stakeholders to explore web archives as ...
The Cost of a WARC: Analyzing Web Archives in the Cloud
The value of web archives to support scholarship in the humanities and social sciences is slowly being realized by the increasing availability of scalable tools and platforms. The cost of providing scholarly access is a ...
Sustainability of Community-owned Repository Software: A Call to Action
Sustainability of open-source software is a continual challenge in the relatively small world of cultural heritage institutions. The challenge is amplified due to the critical preservation implications tied to institutional ...
Active Digital Preservation and Data/Metadata Migration
Digital preservation activities increasingly focus on the movement of data and metadata between systems. This panel will present case studies in moving content through preservation activities with APTrust, the Digital ...
Warclight: A Rails Engine for Web Archive Discovery
This paper describes the development of Warclight, a portmanteau of the open-source Blacklight platform and the ISO-standard Web ARChive file format. Warclight allows users to explore web archives that have been indexed ...
Scalable Content-Based Analysis of Images in Web Archives with TensorFlow and the Archives Unleashed Toolkit
We demonstrate the integration of the Archives Unleashed Toolkit, a scalable platform for exploring web archives, with Google's TensorFlow deep learning toolkit to provide scholars with content-based image analysis ...
The Archives Unleashed Notebook: Madlibs for Jumpstarting Scholarly Exploration
This paper introduces the Archives Unleashed Notebook, which is designed to work with derivative datasets from the Archives Unleashed Cloud, a platform for analyzing web archives. These datasets contain common starting ...
The Great WARC Adventure: WARCs from creation to use
We live in a reality where documents are born, revised and disseminated online. Every day, users record their thoughts, feelings, locations, ratings, votes, comments, reviews, jokes, and so forth; an assemblage of traces ...
Digital Preservation Tools, Practices, and Policies in Islandora
There exists many standards and best practices in the digital preservation community, but not many of these practices are implemented as easy to use tools in our digital repository platforms. This presentation will focus ...
Open Source Sustainability in Digital Curation/Preservation Software
Open source sustainability is hard. This talk will outline what the Islandora and Fedora communities have done to address sustainability in their projects, as well as touch in the critical need for sustainability around ...
Solr Integration in the Anserini Information Retrieval Toolkit
Anserini is an open-source information retrieval toolkit built around Lucene to facilitate replicable research. In this demonstration, we examine different architectures for Solr integration in order to address two current ...
Web Archives Analysis at Scale with the Archives Unleashed Cloud
Web archives, repositories of born-digital information dating back to the Internet Archive and national libraries in the mid-1990s, are fantastic resources of information covering topics of interest to humanities and social ...
Lowering the Barrier to Access: The Archives Unleashed Cloud Project
(The web that was: archives, traces, reflections RESAW 2019*, 2019-06-19)
The Archives Unleashed Project, aims to make petabytes of historical internet content accessible to scholars and others interested in researching the recent past. We respond to one of the major issues facing web archiving ...
CURATEcamp iPres 2012
Mark Jordan, Courtney Mumma, Nick Ruest and the participants of CURATEcamp iPres 2012 report on this unconference for digital curation practitioners and researchers, held on 2 October 2012 in Toronto.
OCUL Digital Curation Summit: Digital Curation Life-cycle & Blue Ribbon Task Force on Sustainable Digital Preservation & Access
Presentation slides for Digital Curation Life-cycle and Blue Ribbon Task Force on Sustainable Digital Preservation & Access, and corresponding worksheets + scenarios.
d3 Data Visualization Bootcamp
Brief introduction of data visualization concepts, brief introduction of d3, and a walkthrough of three exercises using library datasets.