How Reliable are Confidence Estimators for Large Reasoning Models? A Systematic Benchmark on High-Stakes Domains

Reza Khanmohammadi, Erfan Miahi, Simerjot Kaur, Ivan Brugere, Charese H. Smiley, Kundan Thind, Mohammad M. Ghassemi

Published: 2026, Last Modified: 07 May 2026CoRR 2026EveryoneRevisionsBibTeXCC BY-SA 4.0
Loading