BrokenMath: A Benchmark for Sycophancy in Theorem Proving with LLMs

Ivo Petrov, Jasper Dekoninck, Martin T. Vechev

Published: 2025, Last Modified: 29 Mar 2026CoRR 2025EveryoneRevisionsBibTeXCC BY-SA 4.0
Loading