Hallucination Detection for Generative Large Language Models by Bayesian Sequential Estimation

Xiaohua Wang; Yuliang Yan; Longtao Huang; Xiaoqing Zheng; Xuanjing Huang

Hallucination Detection for Generative Large Language Models by Bayesian Sequential Estimation

Xiaohua Wang, Yuliang Yan, Longtao Huang, Xiaoqing Zheng, Xuanjing Huang

Published: 07 Oct 2023, Last Modified: 01 Dec 2023EMNLP 2023 MainEveryoneRevisionsBibTeX

Submission Type: Regular Long Paper

Submission Track: Theme Track: Large Language Models and the Future of NLP

Submission Track 2: Interpretability, Interactivity, and Analysis of Models for NLP

Keywords: Hallucination dectection, fact checking, Bayesian sequential estimation, generative large language models

TL;DR: A Bayesian-based method for hallucination detection in LLMs.

Abstract: Large Language Models (LLMs) have made remarkable advancements in the field of natural language generation. However, the propensity of LLMs to generate inaccurate or non-factual content, termed "hallucinations", remains a significant challenge. Current hallucination detection methods often necessitate the retrieval of great numbers of relevant evidence, thereby increasing response times. We introduce a unique framework that leverages statistical decision theory and Bayesian sequential analysis to optimize the trade-off between costs and benefits during the hallucination detection process. This approach does not require a predetermined number of observations. Instead, the analysis proceeds in a sequential manner, enabling an expeditious decision towards "belief" or "disbelief" through a stop-or-continue strategy. Extensive experiments reveal that this novel framework surpasses existing methods in both efficiency and precision of hallucination detection. Furthermore, it requires fewer retrieval steps on average, thus decreasing response times.

Submission Number: 3015

Loading