Rewarding the Location of Terms in Sentences to Enhance Probabilistic Information Retrieval

Liu, Baiyan

Rewarding the Location of Terms in Sentences to Enhance Probabilistic Information Retrieval

Files

Liu_Baiyan_2016_Masters.pdf (895.16 KB)

Date

2017-07-27

Authors

Liu, Baiyan

Abstract

In most traditional retrieval models, the weight (or probability) of a query term is estimated based on its own distribution or statistics. Intuitively, however, the nouns are more important in information retrieval and are more often found near the beginning and the end of sentences. In this thesis, we investigate the effect of rewarding the terms based on their location in sentences on information retrieval. Particularly, we propose a kernel-based method to capture the term placement pattern, in which a novel Term Location retrieval model is derived in combination with the BM25 model to enhance probabilistic information retrieval. Experiments on five TREC datasets of varied size and content indicates that the proposed model significantly outperforms the optimized BM25 and DirichletLM in MAP over all datasets with all kernel functions, and excels compared to the optimized BM25 and DirichletLM over most of the datasets in P@5 and P@20 with different kernel functions.

Keywords

Computer science

URI

http://hdl.handle.net/10315/33533

Collections

Information Systems and Technology

Full item page

Rewarding the Location of Terms in Sentences to Enhance Probabilistic Information Retrieval

Files

Date

Authors

Journal Title

Journal ISSN

Volume Title

Publisher

Abstract

Description

Keywords

Citation

URI

Collections