Solr Integration in the Anserini Information Retrieval Toolkit

dc.contributor.authorClancy, Ryan
dc.contributor.authorEskildsen, Toke
dc.contributor.authorRuest, Nick
dc.contributor.authorLin, Jimmy
dc.date.accessioned2019-05-15T00:37:50Z
dc.date.available2019-05-15T00:37:50Z
dc.date.issued2019
dc.description.abstractAnserini is an open-source information retrieval toolkit built around Lucene to facilitate replicable research. In this demonstration, we examine different architectures for Solr integration in order to address two current limitations of the system: the lack of an interactive search interface and support for distributed retrieval. Two architectures are explored: In the first approach, Anserini is used as a frontend to index directly into a running Solr instance. In the second approach, Lucene indexes built directly with Anserini can be copied into a Solr installation and placed under its management. We discuss the tradeoffs associated with each architecture and report the results of a performance evaluation comparing indexing throughput. To illustrate the additional capabilities enabled by Anserini/Solr integration, we present a search interface built using the open-source Blacklight discovery interface.en_US
dc.description.sponsorshipThis work was supported in part by the Natural Sciences and Engineering Research Council (NSERC) of Canada, the Canada Foundation for Innovation Leaders Fund, and the Ontario Research Fund.
dc.identifier.citationRyan Clancy, Toke Eskildsen, Nick Ruest, and Jimmy Lin. “Solr Integration in the Anserini Information Retrieval Toolkit.” Proceedings of the 42nd International ACM SIGIR Conference on Research and Development in Information Retrieval (2019).
dc.identifier.citationRyan Clancy, Toke Eskildsen, Nick Ruest, and Jimmy Lin. “Solr Integration in the Anserini Information Retrieval Toolkit.” Proceedings of the 42nd International ACM SIGIR Conference on Research and Development in Information Retrieval (2019).
dc.identifier.isbn978-1-4503-6172-9/19/07
dc.identifier.urihttps://doi.org/10.1145/3331184.3331401en_US
dc.identifier.urihttp://hdl.handle.net/10315/36205
dc.language.isoen
dc.subjectDistributed retrievalen
dc.subjectSolrClouden
dc.subjectSolren
dc.subjectInformation systemsen
dc.subjectSearch engine architectures and scalabilityInformation retrievalen
dc.subjectanserinien
dc.subjectBlacklighten
dc.subjectLuceneen
dc.titleSolr Integration in the Anserini Information Retrieval Toolkiten
dc.typeArticle

Files

Original bundle
Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
anserini-solr.pdf
Size:
775.56 KB
Format:
Adobe Portable Document Format
Description:
License bundle
Now showing 1 - 1 of 1
No Thumbnail Available
Name:
license.txt
Size:
1.83 KB
Format:
Item-specific license agreed upon to submission
Description: