Toggle navigation
OpenReview
.net
Login
×
Go to
DBLP
homepage
Unified Preference Optimization: Language Model Alignment Beyond the Preference Frontier
Anirudhan Badrinath
,
Prabhat Agarwal
,
Jiajing Xu
Published: 01 Jan 2025, Last Modified: 27 Jun 2025
Trans. Mach. Learn. Res. 2025
Everyone
Revisions
BibTeX
CC BY-SA 4.0
Loading