OpenReview
.net
OpenReview
.net
Login
OpenReview
.net
Login
Go to
NeurIPS 2025 Workshop WiML
homepage
Who is In Charge? Dissecting Role Conflicts in LLM Instruction Following
Siqi Zeng
Published: 22 Sept 2025, Last Modified: 03 Jan 2026
WiML @ NeurIPS 2025
Everyone
Revisions
BibTeX
CC BY 4.0
Keywords:
Probing, Steering, AI Safety, instruction hierarchies, role conflicts
Submission Number:
106
Loading