Toggle navigation
OpenReview
.net
Login
×
Go to
DBLP
homepage
Building Math Agents with Multi-Turn Iterative Preference Learning
Wei Xiong
,
Chengshuai Shi
,
Jiaming Shen
,
Aviv Rosenberg
,
Zhen Qin
,
Daniele Calandriello
,
Misha Khalman
,
Rishabh Joshi
,
Bilal Piot
,
Mohammad Saleh
,
Chi Jin
,
Tong Zhang
,
Tianqi Liu
Published: 01 Jan 2025, Last Modified: 18 May 2025
ICLR 2025
Everyone
Revisions
BibTeX
CC BY-SA 4.0
Loading