Toggle navigation
OpenReview
.net
Login
×
Back to
ICML
ICML 2025 Workshop AI4MATH Submissions
Inferring Loop Invariants for Program Verification: an Abductive Learning Perspective
Daiyang Luan
,
Ming Li
Published: 09 Jul 2025, Last Modified: 25 Jul 2025
AI4Math@ICML25 Poster
Readers:
Everyone
Optimizing Anytime Reasoning via Budget Relative Policy Optimization
Penghui Qi
,
Zichen Liu
,
Tianyu Pang
,
Chao Du
,
Wee Sun Lee
,
Min Lin
Published: 09 Jul 2025, Last Modified: 25 Jul 2025
AI4Math@ICML25 Poster
Readers:
Everyone
Do Not Let Low-Probability Tokens Over-Dominate in RL for LLMs
Zhihe Yang
,
Xufang Luo
,
Zilong Wang
,
Dongqi Han
,
Zhiyuan He
,
Dongsheng Li
,
Yunjian Xu
Published: 09 Jul 2025, Last Modified: 25 Jul 2025
AI4Math@ICML25 Poster
Readers:
Everyone
NoisyRollout: Reinforcing Visual Reasoning with Data Augmentation
Xiangyan Liu
,
Jinjie Ni
,
Zijian Wu
,
Chao Du
,
Longxu Dou
,
Haonan Wang
,
Tianyu Pang
,
Michael Qizhe Shieh
Published: 09 Jul 2025, Last Modified: 25 Jul 2025
AI4Math@ICML25 Poster
Readers:
Everyone
MM-Agent: LLM as Agents for Real-world Mathematical Modeling Problem
Fan Liu
,
Zhe-Rui Yang
,
Cancheng Liu
,
Tianrui Song
,
Xiaofeng Gao
,
Hao Liu
Published: 09 Jul 2025, Last Modified: 16 Jul 2025
AI4Math@ICML25 Poster
Readers:
Everyone
Training Language Models to Reason Efficiently
Daman Arora
,
Andrea Zanette
Published: 09 Jul 2025, Last Modified: 16 Jul 2025
AI4Math@ICML25 Poster
Readers:
Everyone
Spectral Policy Optimization: Coloring your Incorrect Reasoning in GRPO
Peter Chen
,
Xiaopeng Li
,
Ziniu Li
,
Xi Chen
,
Tianyi Lin
Published: 09 Jul 2025, Last Modified: 25 Jul 2025
AI4Math@ICML25 Poster
Readers:
Everyone
How Is LLM Reasoning Distracted by Irrelevant Context? An Analysis Using a Controlled Benchmark
Minglai Yang
,
Ethan Huang
,
Liang Zhang
,
Mihai Surdeanu
,
William Yang Wang
,
Liangming Pan
Published: 09 Jul 2025, Last Modified: 25 Jul 2025
AI4Math@ICML25 Poster
Readers:
Everyone
VerifierQ: Enhancing LLM Test Time Compute with Q-Learning-based Verifiers
Jianing Qi
,
Hao Tang
,
Zhigang Zhu
Published: 09 Jul 2025, Last Modified: 25 Jul 2025
AI4Math@ICML25 Poster
Readers:
Everyone
A Survey on Large Language Model Reasoning Failures
Peiyang Song
,
Pengrui Han
,
Noah Goodman
Published: 09 Jul 2025, Last Modified: 25 Jul 2025
AI4Math@ICML25 Poster
Readers:
Everyone
Does Reinforcement Learning Really Incentivize Reasoning Capacity in LLMs Beyond the Base Model?
Yang Yue
,
Zhiqi Chen
,
Rui Lu
,
Andrew Zhao
,
Zhaokai Wang
,
Yang Yue
,
Shiji Song
,
Gao Huang
Published: 09 Jul 2025, Last Modified: 25 Jul 2025
AI4Math@ICML25 Oral
Readers:
Everyone
Verifying Prompt-Induced Search-Space Shifts in LLM-Generated Mathematical Functions
Shervin Ardeshir
Published: 09 Jul 2025, Last Modified: 25 Jul 2025
AI4Math@ICML25 Poster
Readers:
Everyone
«
‹
1
2
3
4
5
›
»