Optimizing LLM Based Retrieval Augmented Generation Pipelines in the Financial Domain

Hanoz Bhathena, Yiyun Zhao, Prateek Singh, Bernardo Ramos, Aviral Joshi, Swaroop Gadiyaram, Saket Sharma

Published: 16 Jun 2024, Last Modified: 24 Mar 2026NAACL 2024 Industry TrackEveryoneRevisionsCC BY 4.0

Abstract: Retrieval Augmented Generation (RAG) is a prominent approach in real-word applications for grounding large language model (LLM) generations in up to date and domain-specific knowledge. However, there is a lack of systematic investigations of the impact of each component (retrieval quality, prompts, generation models) on the generation quality of a RAG pipeline in real world scenarios. In this study, we benchmark 6 LLMs in 15 retrieval scenarios exploring 9 prompts over 2 real world financial domain datasets. We thoroughly discuss the impact of each component in RAG pipeline on answer generation quality and formulate specific recommendations for the design of RAG systems.