Correcting the Mythos of KL-Regularization: Direct Alignment without Overoptimization via Chi-Squared Preference Optimization | OpenReview

Correcting the Mythos of KL-Regularization: Direct Alignment without Overoptimization via Chi-Squared Preference Optimization

Open Webpage

Audrey Huang, Wenhao Zhan, Tengyang Xie, Jason D. Lee, Wen Sun, Akshay Krishnamurthy, Dylan J. Foster

Published: 2025, Last Modified: 01 Oct 2025ICLR 2025EveryoneRevisionsBibTeXCC BY-SA 4.0

External IDs:dblp:conf/iclr/HuangZXL0KF25

Loading