Toggle navigation
OpenReview
.net
Login
×
Go to
DBLP
homepage
On the Role of Attention Heads in Large Language Model Safety
Zhenhong Zhou
,
Haiyang Yu
,
Xinghua Zhang
,
Rongwu Xu
,
Fei Huang
,
Kun Wang
,
Yang Liu
,
Junfeng Fang
,
Yongbin Li
Published: 01 Jan 2025, Last Modified: 21 May 2025
ICLR 2025
Everyone
Revisions
BibTeX
CC BY-SA 4.0
Loading