OpenReview
.net
OpenReview
.net
Login
OpenReview
.net
Login
Go to
DBLP
homepage
CREAM: Consistency Regularized Self-Rewarding Language Models
Zhaoyang Wang
,
Weilei He
,
Zhiyuan Liang
,
Xuchao Zhang
,
Chetan Bansal
,
Ying Wei
,
Weitong Zhang
,
Huaxiu Yao
Published: 2025, Last Modified: 03 Oct 2025
ICLR 2025
Everyone
Revisions
BibTeX
CC BY-SA 4.0
External IDs:
dblp:conf/iclr/WangHLZBWZY25
Loading