Proof-RM: A Scalable and Generalizable Reward Model for Math Proof

Haotong Yang, Zitong Wang, Shijia Kang, Siqi Yang, Wenkai Yu, Xu Niu, Yike Sun, Yi Hu, Zhouchen Lin, Muhan Zhang

Published: 2026, Last Modified: 30 May 2026CoRR 2026EveryoneRevisionsBibTeXCC BY-SA 4.0
Loading