Stimulating In-Depth Confidence Estimation for LLMs through Reasoning over the Answer Space

Stimulating In-Depth Confidence Estimation for LLMs through Reasoning over the Answer Space

ACL ARR 2026 January Submission5681 Authors

05 Jan 2026 (modified: 20 Mar 2026)ACL ARR 2026 January SubmissionEveryoneRevisionsBibTeXCC BY 4.0

Keywords: confidence estimation, verbalized probability distribution, large language model, reinforcement learning

Abstract: Knowing the reliability of a model's response is essential in application. With the strong generation capabilities of LLMs, research has focused on generating verbalized confidence. This is further enhanced by combining chain-of-thought reasoning, which provides logical and transparent estimation. However, how reasoning strategies affect the estimated confidence is still under-explored. In this work, we demonstrate that predicting a verbalized probability distribution can effectively encourage in-depth reasoning for confidence estimation. Intuitively, it requires an LLM to consider all candidates within the answer space instead of basing on a single guess, and to carefully assign confidence scores to meet the requirements of a distribution. This method shows an advantage across different models and various tasks, regardless of whether the answer space is known. Its advantage is maintained even after reinforcement learning, and further analysis shows it promotes richer reasoning patterns, leading to better estimation.

Paper Type: Long

Research Area: Interpretability and Analysis of Models for NLP

Research Area Keywords: calibration/uncertainty, free-text/natural language explanations

Contribution Types: Model analysis & interpretability

Languages Studied: English

Submission Number: 5681

Loading