How Reliable are Confidence Estimators for Large Reasoning Models? A Systematic Benchmark on High-Stakes Domains | OpenReview

How Reliable are Confidence Estimators for Large Reasoning Models? A Systematic Benchmark on High-Stakes Domains

Open Webpage

Reza Khanmohammadi, Erfan Miahi, Simerjot Kaur, Ivan Brugere, Charese H. Smiley, Kundan Thind, Mohammad M. Ghassemi

Published: 2026, Last Modified: 07 May 2026CoRR 2026EveryoneRevisionsBibTeXCC BY-SA 4.0

External IDs:dblp:journals/corr/abs-2601-08134

Loading