Toggle navigation
OpenReview
.net
Login
×
Go to
DBLP
homepage
Inference-Aware Fine-Tuning for Best-of-N Sampling in Large Language Models
Yinlam Chow
,
Guy Tennenholtz
,
Izzeddin Gur
,
Vincent Zhuang
,
Bo Dai
,
Aviral Kumar
,
Rishabh Agarwal
,
Sridhar Thiagarajan
,
Craig Boutilier
,
Aleksandra Faust
Published: 01 Jan 2025, Last Modified: 13 May 2025
ICLR 2025
Everyone
Revisions
BibTeX
CC BY-SA 4.0
Loading