Toggle navigation
OpenReview
.net
Login
×
Back to
ICML
ICML 2024 Workshop MFHAIA Submissions
Query Design for Crowdsourced Clustering: Effect of Cognitive Overload and Contextual Bias
Yi Chen
,
Ramya Korlakai Vinayak
Published: 17 Jun 2024, Last Modified: 02 Jul 2024
ICML 2024 Workshop MHFAIA Poster
Readers:
Everyone
"You just can’t go around killing people'' Explaining Agent Behavior to a Human Terminator
Uri Menkes
,
Ofra Amir
,
Assaf Hallak
Published: 17 Jun 2024, Last Modified: 02 Jul 2024
ICML 2024 Workshop MHFAIA Poster
Readers:
Everyone
Filtered Direct Preference Optimization
Tetsuro Morimura
,
Mitsuki Sakamoto
,
Yuu Jinnai
,
Kenshi Abe
,
Kaito Ariu
Published: 17 Jun 2024, Last Modified: 02 Jul 2024
ICML 2024 Workshop MHFAIA Poster
Readers:
Everyone
Scaling Laws for Reward Model Overoptimization in Direct Alignment Algorithms
Rafael Rafailov
,
Yaswanth Chittepu
,
Ryan Park
,
Harshit Sikchi
,
Joey Hejna
,
W. Bradley Knox
,
Chelsea Finn
,
Scott Niekum
Published: 17 Jun 2024, Last Modified: 02 Jul 2024
ICML 2024 Workshop MHFAIA Poster
Readers:
Everyone
Prompt Optimization with Human Feedback
Xiaoqiang Lin
,
Zhongxiang Dai
,
Arun Verma
,
See-Kiong Ng
,
Patrick Jaillet
,
Bryan Kian Hsiang Low
Published: 17 Jun 2024, Last Modified: 02 Jul 2024
ICML 2024 Workshop MHFAIA Oral
Readers:
Everyone
Reuse Your Rewards: Reward Model Transfer for Zero-Shot Cross-Lingual Alignment
Zhaofeng Wu
,
Ananth Balashankar
,
Yoon Kim
,
Jacob Eisenstein
,
Ahmad Beirami
Published: 17 Jun 2024, Last Modified: 02 Jul 2024
ICML 2024 Workshop MHFAIA Poster
Readers:
Everyone
Models That Prove Their Own Correctness
Noga Amit
,
Shafi Goldwasser
,
Orr Paradise
,
Guy N. Rothblum
Published: 17 Jun 2024, Last Modified: 02 Jul 2024
ICML 2024 Workshop MHFAIA Poster
Readers:
Everyone
Distributional Preference Alignment of LLMs via Optimal Transport
Igor Melnyk
,
Youssef Mroueh
,
Brian Belgodere
,
Mattia Rigotti
,
Apoorva Nitsure
,
Mikhail Yurochkin
,
Kristjan Greenewald
,
Jiri Navratil
,
Jarret Ross
Published: 17 Jun 2024, Last Modified: 02 Jul 2024
ICML 2024 Workshop MHFAIA Poster
Readers:
Everyone
Uncertainty-aware Preference Alignment in Reinforcement Learning from Human Feedback
Sheng Xu
,
Bo Yue
,
Hongyuan Zha
,
Guiliang Liu
Published: 17 Jun 2024, Last Modified: 02 Jul 2024
ICML 2024 Workshop MHFAIA Poster
Readers:
Everyone
Language Alignment via Nash-learning and Adaptive feedback
Ari Azarafrooz
,
Farshid Faal
Published: 17 Jun 2024, Last Modified: 02 Jul 2024
ICML 2024 Workshop MHFAIA Poster
Readers:
Everyone
Modeling the Plurality of Human Preferences via Ideal Points
Daiwei Chen
,
Yi Chen
,
Aniket Rege
,
Ramya Korlakai Vinayak
Published: 17 Jun 2024, Last Modified: 02 Jul 2024
ICML 2024 Workshop MHFAIA Oral
Readers:
Everyone
Adversarial Multi-dueling Bandits
Pratik Gajane
Published: 17 Jun 2024, Last Modified: 02 Jul 2024
ICML 2024 Workshop MHFAIA Poster
Readers:
Everyone
Preference Learning Algorithms Do Not Learn Preference Rankings
Angelica Chen
,
Sadhika Malladi
,
Lily H Zhang
,
Xinyi Chen
,
Qiuyi Zhang
,
Rajesh Ranganath
,
Kyunghyun Cho
Published: 17 Jun 2024, Last Modified: 02 Jul 2024
ICML 2024 Workshop MHFAIA Oral
Readers:
Everyone
Bootstrapping Language Models with DPO Implicit Rewards
Changyu Chen
,
Zichen Liu
,
Chao Du
,
Tianyu Pang
,
Qian Liu
,
Arunesh Sinha
,
Pradeep Varakantham
,
Min Lin
Published: 17 Jun 2024, Last Modified: 02 Jul 2024
ICML 2024 Workshop MHFAIA Poster
Readers:
Everyone
REBEL: Reinforcement Learning via Regressing Relative Rewards
Zhaolin Gao
,
Jonathan Daniel Chang
,
Wenhao Zhan
,
Owen Oertell
,
Gokul Swamy
,
Kianté Brantley
,
Thorsten Joachims
,
J. Andrew Bagnell
,
Jason D. Lee
,
Wen Sun
Published: 17 Jun 2024, Last Modified: 02 Jul 2024
ICML 2024 Workshop MHFAIA Poster
Readers:
Everyone
Stochastic Concept Bottleneck Models
Moritz Vandenhirtz
,
Sonia Laguna
,
Ričards Marcinkevičs
,
Julia E Vogt
Published: 17 Jun 2024, Last Modified: 02 Jul 2024
ICML 2024 Workshop MHFAIA Poster
Readers:
Everyone
Multi-Agent Imitation Learning: Value is Easy, Regret is Hard
Jingwu Tang
,
Gokul Swamy
,
Fei Fang
,
Steven Wu
Published: 17 Jun 2024, Last Modified: 02 Jul 2024
ICML 2024 Workshop MHFAIA Poster
Readers:
Everyone
Concept-Based Interpretable Reinforcement Learning with Limited to No Human Labels
Zhuorui Ye
,
Stephanie Milani
,
Fei Fang
,
Geoffrey J. Gordon
Published: 17 Jun 2024, Last Modified: 02 Jul 2024
ICML 2024 Workshop MHFAIA Poster
Readers:
Everyone
Preference Elicitation for Offline Reinforcement Learning
Alizée Pace
,
Bernhard Schölkopf
,
Gunnar Ratsch
,
Giorgia Ramponi
Published: 17 Jun 2024, Last Modified: 02 Jul 2024
ICML 2024 Workshop MHFAIA Poster
Readers:
Everyone
New Desiderata for Direct Preference Optimization
Xiangkun Hu
,
Tong He
,
David Wipf
Published: 17 Jun 2024, Last Modified: 02 Jul 2024
ICML 2024 Workshop MHFAIA Poster
Readers:
Everyone
Cross-Domain Knowledge Transfer for RL via Preference Consistency
Ting-Hsuan Huang
,
Ping-Chun Hsieh
Published: 17 Jun 2024, Last Modified: 02 Jul 2024
ICML 2024 Workshop MHFAIA Poster
Readers:
Everyone
Generalizing Offline Alignment Theoretical Paradigm with Diverse Divergence Constraints
Haoyuan Sun
,
Yuxin Zheng
,
Yifei Zhao
,
Yongzhe Chang
,
Xueqian Wang
Published: 17 Jun 2024, Last Modified: 02 Jul 2024
ICML 2024 Workshop MHFAIA Poster
Readers:
Everyone
AI Alignment with Changing and Influenceable Reward Functions
Micah Carroll
,
Davis Foote
,
Anand Siththaranjan
,
Stuart Russell
,
Anca Dragan
Published: 17 Jun 2024, Last Modified: 02 Jul 2024
ICML 2024 Workshop MHFAIA Oral
Readers:
Everyone
Towards Safe Large Language Models for Medicine
Tessa Han
,
Aounon Kumar
,
Chirag Agarwal
,
Himabindu Lakkaraju
Published: 17 Jun 2024, Last Modified: 02 Jul 2024
ICML 2024 Workshop MHFAIA Poster
Readers:
Everyone
Informed Meta-Learning
Kasia Kobalczyk
,
Mihaela van der Schaar
Published: 17 Jun 2024, Last Modified: 02 Jul 2024
ICML 2024 Workshop MHFAIA Poster
Readers:
Everyone
«
‹
1
2
3
›
»