Toggle navigation
OpenReview
.net
Login
×
Back to
ICML
ICML 2025 Workshop MoFA Submissions
Entropy Controllable Direct Preference Optimization
Motoki Omura
,
Yasuhiro Fujita
,
Toshiki Kataoka
Published: 10 Jun 2025, Last Modified: 30 Jun 2025
MoFA Poster
Readers:
Everyone
Implicit User Feedback in Human-LLM Dialogues: Informative to Understand Users yet Noisy as a Learning Signal
Yuhan Liu
,
Michael JQ Zhang
,
Eunsol Choi
Published: 10 Jun 2025, Last Modified: 30 Jun 2025
MoFA Poster
Readers:
Everyone
Selective Preference Aggregation
Shreyas Kadekodi
,
Hayden McTavish
,
Berk Ustun
Published: 10 Jun 2025, Last Modified: 30 Jun 2025
MoFA Poster
Readers:
Everyone
Tracing Human-like Traits in LLMs: Origins, Real-World Manifestation, and Controllability
Pengrui Han
,
Rafal Dariusz Kocielnik
,
Peiyang Song
,
Ramit Debnath
,
Dean Mobbs
,
Anima Anandkumar
,
R. Michael Alvarez
Published: 10 Jun 2025, Last Modified: 30 Jun 2025
MoFA Poster
Readers:
Everyone
Understanding Likelihood Over-optimisation in Direct Alignment Algorithms
Zhengyan Shi
,
Sander Land
,
Acyr Locatelli
,
Matthieu Geist
,
Max Bartolo
Published: 10 Jun 2025, Last Modified: 30 Jun 2025
MoFA Poster
Readers:
Everyone
Inference-Time Reward Hacking in Large Language Models
Hadi Khalaf
,
Claudio Mayrink Verdun
,
Alex Oesterling
,
Himabindu Lakkaraju
,
Flavio Calmon
Published: 10 Jun 2025, Last Modified: 30 Jun 2025
MoFA Poster
Readers:
Everyone
Alignment of Large Language Models with Constrained Learning
Botong Zhang
,
Shuo Li
,
Ignacio Hounie
,
Osbert Bastani
,
Dongsheng Ding
,
Alejandro Ribeiro
Published: 10 Jun 2025, Last Modified: 30 Jun 2025
MoFA Poster
Readers:
Everyone
Aligned Textual Scoring Rule
Yuxuan Lu
,
Yifan Wu
,
Jason Hartline
,
Michael Curry
Published: 10 Jun 2025, Last Modified: 30 Jun 2025
MoFA Oral
Readers:
Everyone
Do Language Models Understand Discrimination? Testing Alignment with Human Legal Reasoning under the ECHR
Tatiana Botskina
Published: 10 Jun 2025, Last Modified: 30 Jun 2025
MoFA Poster
Readers:
Everyone
Towards a Sharp Analysis of Offline Policy Learning for $f$-Divergence-Regularized Contextual Bandits
Qingyue Zhao
,
Kaixuan Ji
,
Heyang Zhao
,
Tong Zhang
,
Quanquan Gu
Published: 10 Jun 2025, Last Modified: 07 Jul 2025
MoFA Poster
Readers:
Everyone
What Matters when Modeling Human Behavior using Imitation Learning?
Aneri Muni
,
Esther Derman
,
Vincent Taboga
,
Pierre-Luc Bacon
,
Erick Delage
Published: 10 Jun 2025, Last Modified: 30 Jun 2025
MoFA Poster
Readers:
Everyone
Dropping Just a Handful of Preferences Can Change Top Large Language Model Rankings
Jenny Y. Huang
,
Yunyi Shen
,
Dennis Wei
,
Tamara Broderick
Published: 10 Jun 2025, Last Modified: 14 Aug 2025
MoFA Oral
Readers:
Everyone
Doubly Robust Alignment for Large Language Models
Erhan Xu
,
Kai Ye
,
Hongyi Zhou
,
Luhan Zhu
,
Francesco Quinzan
,
Chengchun Shi
Published: 10 Jun 2025, Last Modified: 30 Jun 2025
MoFA Poster
Readers:
Everyone
Theoretical Analysis of KL-regularized RLHF with Multiple Reference Models
Gholamali Aminian
,
Amir R. Asadi
,
Idan Shenfeld
,
Youssef Mroueh
Published: 10 Jun 2025, Last Modified: 30 Jun 2025
MoFA Poster
Readers:
Everyone
Deep Context-Dependent Choice Model
Shuhan Zhang
,
Zhi Wang
,
Rui Gao
,
Shuang Li
Published: 10 Jun 2025, Last Modified: 30 Jun 2025
MoFA Oral
Readers:
Everyone
Aligning Neural Style Representations for Style-based Clustering
Abhishek Dangeti
,
Pavan Gajula
,
Vikram Jamwal
,
Vivek Srivastava
Published: 10 Jun 2025, Last Modified: 30 Jun 2025
MoFA Poster
Readers:
Everyone
ActiveDPO: Active Direct Preference Optimization for Sample-Efficient Alignment
Xiaoqiang Lin
,
Arun Verma
,
Zhongxiang Dai
,
Daniela Rus
,
See-Kiong Ng
,
Bryan Kian Hsiang Low
Published: 10 Jun 2025, Last Modified: 30 Jun 2025
MoFA Poster
Readers:
Everyone
Online Learning and Equilibrium Computation with Ranking Feedback
Mingyang Liu
,
Yongshan Chen
,
Zhiyuan Fan
,
Gabriele Farina
,
Asuman E. Ozdaglar
,
Kaiqing Zhang
Published: 10 Jun 2025, Last Modified: 30 Jun 2025
MoFA Poster
Readers:
Everyone
Improvement-Guided Iterative DPO for Diffusion Models
Ying Fan
,
Fei Deng
,
Yang Zhao
,
Sahil Singla
,
Rahul Jain
,
Tingbo Hou
,
Kangwook Lee
,
Feng Yang
,
Deepak Ramachandran
,
Qifei Wang
Published: 10 Jun 2025, Last Modified: 30 Jun 2025
MoFA Poster
Readers:
Everyone
Multi-Task Reward Learning from Human Ratings
Mingkang Wu
,
Devin White
,
Evelyn Rose
,
Vernon Lawhern
,
Nicholas R Waytowich
,
Yongcan Cao
Published: 10 Jun 2025, Last Modified: 30 Jun 2025
MoFA Poster
Readers:
Everyone
Copilot Arena: A Platform for Code LLM Evaluation in the Wild
Wayne Chi
,
Valerie Chen
,
Anastasios Nikolas Angelopoulos
,
Wei-Lin Chiang
,
Aditya Mittal
,
Naman Jain
,
Tianjun Zhang
,
Ion Stoica
,
Chris Donahue
,
Ameet Talwalkar
Published: 10 Jun 2025, Last Modified: 30 Jun 2025
MoFA Oral
Readers:
Everyone
Robust Reward Modeling via Causal Rubrics
Pragya Srivastava
,
Harman Singh
,
Rahul Madhavan
,
Gandharv Patil
,
Sravanti Addepalli
,
Arun Suggala
,
Rengarajan Aravamudhan
,
Soumya Sharma
,
Anirban Laha
,
Aravindan Raghuveer
,
Karthikeyan Shanmugam
,
Doina Precup
Published: 10 Jun 2025, Last Modified: 30 Jun 2025
MoFA Poster
Readers:
Everyone
Geometry-Aware Preference Learning for 3D Texture Generation
AmirHossein Zamani
,
Tianhao Xie
,
Amir Aghdam
,
Tiberiu Popa
,
Eugene Belilovsky
Published: 10 Jun 2025, Last Modified: 11 Jul 2025
MoFA Poster
Readers:
Everyone
Aggregated Individual Reporting for Post-Deployment Evaluation
Jessica Dai
,
Inioluwa Deborah Raji
,
Benjamin Recht
,
Irene Y. Chen
Published: 10 Jun 2025, Last Modified: 30 Jun 2025
MoFA Poster
Readers:
Everyone
Doctor Approved: Generating Medically Accurate Skin Disease Images through AI–Expert Feedback
Janet Wang
,
Yunbei Zhang
,
Zhengming Ding
,
Jihun Hamm
Published: 10 Jun 2025, Last Modified: 30 Jun 2025
MoFA Oral
Readers:
Everyone
«
‹
1
2
3
›
»