Conformal Agent Error Attribution

Published: 31 May 2026, Last Modified: 26 Jun 2026ICML 2026 Workshop AgenticUQ PosterEveryoneRevisionsBibTeXCC BY 4.0
Keywords: Multi-agent systems, conformal prediction, filtering
TL;DR: We propose novel conformal prediction algorithms that provide precise uncertainty guarantees for error attribution in multi-agent systems.
Abstract: When multi-agent systems (MAS) fail, identifying where the decisive error occurred is the first step for automated recovery to an earlier state. Error attribution remains a fundamental challenge due to the long and intertwined interaction traces that large language model-based MAS generate. This paper presents a framework for error attribution based on conformal prediction (CP) which provides finite-sample, distribution-free coverage guarantees. We introduce new algorithms for filtration-based CP designed for sequential data such as agent trajectories. Unlike existing CP algorithms, our approach predicts sets that are contiguous sequences, which is a crucial property to enable efficient recovery and debugging. We verify our theoretical guarantees on a variety of agents and datasets, show that errors can be precisely isolated, then use prediction sets to rollback MAS to correct their own errors. Our overall approach is model-agnostic, and offers a principled uncertainty layer for MAS error attribution.
Email Sharing: We authorize the sharing of all author emails with Program Chairs.
Data Release: We authorize the release of our submission and author names to the public in the event of acceptance.
Submission Number: 5
Loading