Kullback-Leibler Divergence Revisited

Published: 2017, Last Modified: 27 Apr 2025ICTIR 2017EveryoneRevisionsBibTeXCC BY-SA 4.0
Abstract: Thee KL divergence is the most commonly used measure for comparing query and document language models in the language modeling framework to ad hoc retrieval. Since KL is rank equivalent to a specific weighted geometric mean, we examine alternative weighted means for language-model comparison, as well as alternative divergence measures. The study includes analysis of the inverse document frequency (IDF) effect of the language-model comparison methods. Empirical evaluation, performed with different types of queries (short and verbose) and query-model induction approaches, shows that there are methods that often outperform the KL divergence in some settings.
Loading