Now showing items 1-10 of 46
Solr Integration in the Anserini Information Retrieval Toolkit
Anserini is an open-source information retrieval toolkit built around Lucene to facilitate replicable research. In this demonstration, we examine different architectures for Solr integration in order to address two current ...
Lowering the Barrier to Access: The Archives Unleashed Cloud Project
(The web that was: archives, traces, reflections RESAW 2019*, 2019-06-19)
The Archives Unleashed Project, aims to make petabytes of historical internet content accessible to scholars and others interested in researching the recent past. We respond to one of the major issues facing web archiving ...
Building Successful, Open Repository Software Ecosystems: Technology and Community
Archivematica, AtoM (Access to Memory), Fedora, Hydra, and Islandora provide a set of functions which contribute to a diverse curation and repository ecosystem. They are also projects existing in a greater open source ...
Islandora and Fedora 4; The Atonement.
In the context of repository platforms, Islandora has a fair bit of age, and with that a fair bit of cruft. In the early winter of 2014/2015 the Islandora community began working on a project plan to outline what would be ...
Turn, Turn, Turn: Seasons in the life of a digital object - Through the lens of the digital curation lifecycle
More and more, some of the most important assets libraries have are digital. We need to curate these objects with the same rigor and expertise that we apply to our physical collections. follow a digital object through the ...
Upgrading? Migrating? There’s a portmanteau for that!
Fedora 4, the new, revitalized version of Fedora, boasts a feature set that includes improvements in scalability, linked data capabilities, research data support, modularity, and more. Since the launch of Fedora 4.0, a ...
Sharing and preserving your research data
The Ontario Library Research Cloud: From Idea to Infrastructure in Three Easy Years
The members of the Ontario Council of University Libraries required a viable and affordable offsite storage solution for their preservation programs. Existing options may have solved the needs in the short term, while ...
Hands on with warcbase
Warcbase is an open-source platform for managing web archives built on Hadoop and HBase. The platform provides a flexible data model for storing and managing raw content as well as metadata and extracted knowledge. Tight ...
Warc? Warkwark wark? Wark Warrrk
"Libraries have always been at the heart of the research process, but new ways of searching, writing, and publishing are significantly altering this relationship. What will the research enterprise look like in five, ten, ...