OpenReview
.net
OpenReview
.net
Login
OpenReview
.net
Login
Go to
OpenReview Public Article DBLP
homepage
Friend or Foe: How LLMs' Safety Mind Gets Fooled by Intent Shift Attack
Peng Ding
,
Jun Kuang
,
Wen Sun
,
Zongyu Wang
,
Xuezhi Cao
,
Xunliang Cai
,
Jiajun Chen
,
Shujian Huang
Published: 2025, Last Modified: 01 Jun 2026
CoRR 2025
Everyone
Revisions
BibTeX
CC BY-SA 4.0
External IDs:
dblp:journals/corr/abs-2511-00556
Loading