Recent Advances in Generative Information Retrieval
Abstract: Generative retrieval (GR) has witnessed significant growth recently in the area of information retrieval.
Compared to the traditional ``index-retrieve-then-rank'' pipeline, the GR paradigm aims to consolidate all information within a corpus into a single model.
Typically, a sequence-to-sequence model is trained to directly map a query to its relevant document identifiers (i.e., docids).
This tutorial offers an introduction to the core concepts of the GR paradigm and a comprehensive overview of recent advances in its foundations and applications.
We start by providing preliminary information covering foundational aspects and problem formulations of GR.
Then, our focus shifts towards recent progress in docid design, training approaches, inference strategies, and applications of GR.
We end by outlining challenges and issuing a call for future GR research.
Throughout the tutorial we highlight the availability of relevant resources so as to enable a broad audience to contribute to this topic.
This tutorial is intended to be beneficial to both researchers and industry practitioners interested in developing novel GR solutions or applying them in real-world scenarios.
Loading