ExVerus: Verus Proof Repair via Counterexample Reasoning

Published: 16 Jun 2026, Last Modified: 24 Jun 2026ICML 2026 Workshop DL4C PosterEveryoneRevisionsBibTeXCC BY-NC 4.0
Keywords: Software Verification, Proof Generation, Large Language Models, Software Engineering
TL;DR: Counterexample-guided proof repair driven by LLM
Abstract: Large Language Models (LLMs) have shown promising results in automating formal verification. However, existing approaches treat proof generation as a static, end-to-end prediction over source code, relying on limited verifier feedback and lacking access to concrete program behaviors. We present EXVERUS, a counterexample-guided framework that enables LLMs to reason about proofs using behavioral feedback via counterexamples. When a proof fails, EXVERUS automatically generates and validates counterexamples, and then guides the LLM to generalize them into inductive invariants to block these failures. Our evaluation shows that EXVERUS significantly improves proof accuracy, robustness, and token efficiency over the state-of-the-art prompting-based Verus proof generator.
Submission Number: 140
Loading