Agentic Uncertainty Reveals Agentic Overconfidence

Jean Kaddour; Srijan Patel; Gbetondji Jean-Sebastien Dovonon; Leo Richter; Pasquale Minervini; Matt J. Kusner

Agentic Uncertainty Reveals Agentic Overconfidence

Jean Kaddour, Srijan Patel, Gbetondji Jean-Sebastien Dovonon, Leo Richter, Pasquale Minervini, Matt J. Kusner

Published: 01 Mar 2026, Last Modified: 15 Apr 2026ICLR 2026 AIWILDEveryoneRevisionsCC BY 4.0

Keywords: agents, uncertainty, llms

TL;DR: Can AI agents predict whether they will succeed at a task? We study agentic uncertainty by eliciting success probability estimates before, during, and after task execution. All agents exhibit agentic overconfidence.

Abstract: Can AI agents predict whether they will succeed at a task? We study agentic uncertainty by eliciting success probability estimates before, during, and after task execution. All results exhibit agentic overconfidence: some agents that succeed only 22% of the time predict 77% success. Counterintuitively, pre-execution assessment with strictly less information achieves better discrimination than standard post-execution review. Adversarial prompting reframing assessment as bug-finding achieves the best calibration. Code is available at https://github.com/sevn-ai/agentic-uncertainty

Email Sharing: We authorize the sharing of all author emails with Program Chairs.

Data Release: We authorize the release of our submission and author names to the public in the event of acceptance.

Submission Number: 138

Loading