A Simple Enhancement for Ad-hoc Information Retrieval via Topic ModellingOpen Website

2016 (modified: 04 Oct 2023)SIGIR 2016Readers: Everyone
Abstract: Traditional information retrieval (IR) models, in which a document is normally represented as a bag of words and their frequencies, capture the term-level and document-level information. Topic models, on the other hand, discover semantic topic-based information among words. In this paper, we consider term-based information and semantic information as two features of query terms and propose a simple enhancement for ad-hoc IR via topic modeling. In particular, three topic-based hybrid models, LDA-BM25, LDA-MATF and LDA-LM, are proposed. A series of experiments on eight standard datasets show that our proposed models can always outperform significantly the corresponding strong baselines over all datasets in terms of MAP and most of datasets in terms of [email protected] and [email protected] A direct comparison on eight standard datasets also indicates our proposed models are at least comparable to the state-of-the-art approaches.
0 Replies

Loading