Toggle navigation
OpenReview
.net
Login
×
Go to
DBLP
homepage
OpenPRM: Building Open-domain Process-based Reward Models with Preference Trees
Kaiyan Zhang
,
Jiayuan Zhang
,
Haoxin Li
,
Xuekai Zhu
,
Ermo Hua
,
Xingtai Lv
,
Ning Ding
,
Biqing Qi
,
Bowen Zhou
Published: 01 Jan 2025, Last Modified: 13 May 2025
ICLR 2025
Everyone
Revisions
BibTeX
CC BY-SA 4.0
Loading