Toggle navigation
OpenReview
.net
Login
×
Back to
ICLR
ICLR 2025 Workshop Bi-Align Submissions
Envision Human-AI Perceptual Alignment from a Multimodal Interaction Perspective
Shu Zhong
,
Marianna Obrist
Published: 06 Mar 2025, Last Modified: 05 May 2025
ICLR 2025 Bi-Align Workshop Poster
Readers:
Everyone
Negotiative Alignment: An interactive approach to human-AI co-adaptation for clinical applications
Florence Xini Doo
,
Nikhil Shah
,
Pranav Kulkarni
,
Vishwa Sanjay Parekh
,
Heng Huang
Published: 06 Mar 2025, Last Modified: 05 May 2025
ICLR 2025 Bi-Align Workshop Poster
Readers:
Everyone
A Roadmap for Human-Agent Moral Alignment: Integrating Pre-defined Intrinsic Rewards and Learned Reward Models
Elizaveta Tennant
,
Stephen Hailes
,
Mirco Musolesi
Published: 06 Mar 2025, Last Modified: 05 May 2025
ICLR 2025 Bi-Align Workshop Poster
Readers:
Everyone
A Benchmark for Scalable Oversight Mechanisms
Abhimanyu Pallavi Sudhir
,
Jackson Kaunismaa
,
Arjun Panickssery
Published: 06 Mar 2025, Last Modified: 05 May 2025
ICLR 2025 Bi-Align Workshop Poster
Readers:
Everyone
Rethinking Anti-Misinformation AI
Vidya Sujaya
,
Kellin Pelrine
,
Andreea Musulan
,
Reihaneh Rabbany
Published: 06 Mar 2025, Last Modified: 05 May 2025
ICLR 2025 Bi-Align Workshop Poster
Readers:
Everyone
Exploring Persona-dependent LLM Alignment for the Moral Machine Experiment
Jiseon Kim
,
Jea Kwon
,
Luiz Felipe Vecchietti
,
Alice Oh
,
Meeyoung Cha
Published: 06 Mar 2025, Last Modified: 05 May 2025
ICLR 2025 Bi-Align Workshop Poster
Readers:
Everyone
Cooperative Agency-Centered LLMs
Iyadunni J. Adenuga
Published: 06 Mar 2025, Last Modified: 05 May 2025
ICLR 2025 Bi-Align Workshop Poster
Readers:
Everyone
Multi-Objective Probabilistic Preference Learning with Soft and Hard Bounds
Edward Chen
,
Sang T. Truong
,
Natalie Dullerud
,
Sanmi Koyejo
,
Carlos Guestrin
Published: 06 Mar 2025, Last Modified: 05 May 2025
ICLR 2025 Bi-Align Workshop Poster
Readers:
Everyone
ValueMap: Mapping Crowdsourced Human Values to Computational Scores for Bi-directional Alignment
Priya Ronald DCosta
,
Rupkatha Hira
Published: 06 Mar 2025, Last Modified: 05 May 2025
ICLR 2025 Bi-Align Workshop Poster
Readers:
Everyone
We Shape AI, and Thereafter AI Shape Us: Humans Align with AI through Social Influences
Jingshu Li
,
Tianqi Song
,
Beichen Xue
,
Yi-Chieh Lee
Published: 06 Mar 2025, Last Modified: 05 May 2025
ICLR 2025 Bi-Align Workshop Poster
Readers:
Everyone
Towards LVLM-Aided Alignment of Task-Specific Vision Models
Alexander Koebler
,
Christian Greisinger
,
Jan Paulus
,
Ingo Thon
,
Florian Buettner
Published: 06 Mar 2025, Last Modified: 05 May 2025
ICLR 2025 Bi-Align Workshop Poster
Readers:
Everyone
SafeChain: Safety of Language Models with Long Chain-of-Thought Reasoning Capabilities
Fengqing Jiang
,
Zhangchen Xu
,
Yuetai Li
,
Luyao Niu
,
Zhen Xiang
,
Bo Li
,
Bill Yuchen Lin
,
Radha Poovendran
Published: 06 Mar 2025, Last Modified: 05 May 2025
ICLR 2025 Bi-Align Workshop ICLROral
Readers:
Everyone
Patterns and Mechanisms of Contrastive Activation Engineering
Yixiong Hao
,
Ayush Panda
,
Stepan Shabalin
,
Sheikh Abdur Raheem Ali
Published: 06 Mar 2025, Last Modified: 05 May 2025
ICLR 2025 Bi-Align Workshop Poster
Readers:
Everyone
CTRL-Rec: Controlling Recommender Systems With Natural Language
Micah Carroll
,
Adeline Foote
,
Marcus Williams
,
Anca Dragan
,
W. Bradley Knox
,
Smitha Milli
Published: 06 Mar 2025, Last Modified: 05 May 2025
ICLR 2025 Bi-Align Workshop Poster
Readers:
Everyone
Position: Interpretability is a Bidirectional Communication Problem
Kola Ayonrinde
Published: 06 Mar 2025, Last Modified: 05 May 2025
ICLR 2025 Bi-Align Workshop Poster
Readers:
Everyone
Outlier-Aware Preference Optimization for Large Language Models
Pragya Srivastava
,
Sai Soumya Nalli
,
Amit Deshpande
,
Amit Sharma
Published: 06 Mar 2025, Last Modified: 05 May 2025
ICLR 2025 Bi-Align Workshop Poster
Readers:
Everyone
TraCeS: Trajectory Based Credit Assignment From Sparse Safety Feedback
Siow Meng Low
,
Akshat Kumar
Published: 06 Mar 2025, Last Modified: 05 May 2025
ICLR 2025 Bi-Align Workshop Poster
Readers:
Everyone
Probing Mechanical Reasoning in Large Vision Language Models
Haoran Sun
,
Yijiang Li
,
Qingying Gao
,
Haiyun Lyu
,
Dezhi Luo
,
Hokin Deng
Published: 06 Mar 2025, Last Modified: 05 May 2025
ICLR 2025 Bi-Align Workshop Poster
Readers:
Everyone
Vision Language Models See What You Want but not What You See
Qingying Gao
,
Yijiang Li
,
Haiyun Lyu
,
Haoran Sun
,
Dezhi Luo
,
Hokin Deng
Published: 06 Mar 2025, Last Modified: 05 May 2025
ICLR 2025 Bi-Align Workshop Poster
Readers:
Everyone
Vision Language Models Know Law of Conservation without Understanding More-or-Less
Dezhi Luo
,
Haiyun Lyu
,
Qingying Gao
,
Haoran Sun
,
Yijiang Li
,
Hokin Deng
Published: 06 Mar 2025, Last Modified: 05 May 2025
ICLR 2025 Bi-Align Workshop Poster
Readers:
Everyone
Rethinking AI Cultural Alignment
Michal Bravansky
,
Filip Trhlík
,
Fazl Barez
Published: 06 Mar 2025, Last Modified: 05 May 2025
ICLR 2025 Bi-Align Workshop Poster
Readers:
Everyone
Human Alignment: How Much We Adapt to LLMs?
Cazalet Tanguy
,
Ruben Janssens
,
Tony Belpaeme
,
Joni Dambre
Published: 06 Mar 2025, Last Modified: 21 Apr 2025
ICLR 2025 Bi-Align Workshop Poster
Readers:
Everyone
AI-enhanced semantic feature norms for 786 concepts
Siddharth Suresh
,
Kushin Mukherjee
,
Tyler Giallanza
,
Xizheng Yu
,
Mia Patil
,
Jonathan D. Cohen
,
Timothy T. Rogers
Published: 06 Mar 2025, Last Modified: 05 May 2025
ICLR 2025 Bi-Align Workshop ICLROral
Readers:
Everyone
Moral Alignment for LLM Agents
Elizaveta Tennant
,
Stephen Hailes
,
Mirco Musolesi
Published: 06 Mar 2025, Last Modified: 05 May 2025
ICLR 2025 Bi-Align Workshop Poster
Readers:
Everyone
A Sociotechnical Perspective on Aligning AI with Pluralistic Human Values
Dalia Ali
,
Aysenur Kocak
,
Dora Zhao
,
Allison Koenecke
,
Orestis Papakyriakopoulos
Published: 06 Mar 2025, Last Modified: 05 May 2025
ICLR 2025 Bi-Align Workshop Poster
Readers:
Everyone
«
‹
1
2
3
›
»