Conformal Reasoning: Uncertainty Estimation in Interactive Environments

Eric Frankel; Shuyue Stella Li; Lillian J. Ratliff; Yulia Tsvetkov; Sewoong Oh; Pang Wei Koh

Conformal Reasoning: Uncertainty Estimation in Interactive Environments

Eric Frankel, Shuyue Stella Li, Lillian J. Ratliff, Yulia Tsvetkov, Sewoong Oh, Pang Wei Koh

28 Sept 2024 (modified: 05 Feb 2025)Submitted to ICLR 2025EveryoneRevisionsBibTeXCC BY 4.0

Keywords: Conformal Prediction, Uncertainty Quantification, LM Abstention, Interactive Reasoning

TL;DR: We propose conformal reasoning, a method to provide confidence guarantees on LLM output in multi-turn interactive settings by inferring when to abstain from answering.

Abstract: We introduce conformal reasoning, a principled method for models in interactive environments to reason about their uncertainty and decide whether to seek out more information or to return a prediction. The challenge with standard conformal prediction---a popular statistical framework for uncertainty estimation that constructs prediction sets with formal coverage guarantees---is that it relies on a fixed set of calibration data points. In interactive environments, however, the calibration trajectories require certain termination criteria determined a priori, introducing heuristic bias and/or circular dependency that break the assumptions needed for coverage guarantees. We address this issue by building on adaptive conformal inference techniques. On two real-world tasks on medical diagnosis and embodied question answering, we show that conformal reasoning empirically achieves its theoretical coverage guarantees---in contrast with standard conformal prediction approaches that can significantly over- or under-cover---while improving exploration efficiency by approximately 20% on both tasks.

Primary Area: foundation or frontier models, including LLMs

Code Of Ethics: I acknowledge that I and all co-authors of this work have read and commit to adhering to the ICLR Code of Ethics.

Submission Guidelines: I certify that this submission complies with the submission instructions as described on https://iclr.cc/Conferences/2025/AuthorGuide.

Anonymous Url: I certify that there is no URL (e.g., github page) that could be used to find authors’ identity.

No Acknowledgement Section: I certify that there is no acknowledgement section in this submission for double blind review.

Submission Number: 13330

Loading