A Survey of Model Architectures in Information Retrieval

A Survey of Model Architectures in Information Retrieval

ACL ARR 2025 May Submission3962 Authors

19 May 2025 (modified: 03 Jul 2025)ACL ARR 2025 May SubmissionEveryoneRevisionsBibTeXCC BY 4.0

Abstract: The period from 2022 to the present has represented one of the biggest paradigm shifts in information retrieval (IR) and natural language processing (NLP). This work surveys the evolution of model architectures in IR, focusing on two key aspects: backbone models for feature extraction and end-to-end system architectures for relevance estimation. The review intentionally separates architectural considerations from training methodologies to provide a focused analysis of structural innovations in IR systems. We trace the development from traditional term-based methods to modern neural approaches, particularly highlighting the impact of transformer-based models and subsequent large language models (LLMs). We conclude with a forward-looking discussion of emerging challenges and future directions, including architectural optimizations for performance and scalability, handling of multimodal, multilingual data, and adaptation to novel application domains such as autonomous search agents that is beyond traditional search paradigms.

Paper Type: Long

Research Area: Information Retrieval and Text Mining

Research Area Keywords: passage retrieval, dense retrieval, document representation, re-ranking

Contribution Types: Surveys

Languages Studied: English

Submission Number: 3962

Loading