Toggle navigation
OpenReview
.net
Login
×
Go to
DBLP
homepage
SPaR: Self-Play with Tree-Search Refinement to Improve Instruction-Following in Large Language Models
Jiale Cheng
,
Xiao Liu
,
Cunxiang Wang
,
Xiaotao Gu
,
Yida Lu
,
Dan Zhang
,
Yuxiao Dong
,
Jie Tang
,
Hongning Wang
,
Minlie Huang
Published: 01 Jan 2025, Last Modified: 16 May 2025
ICLR 2025
Everyone
Revisions
BibTeX
CC BY-SA 4.0
Loading