JailGuard: A Universal Detection Framework for Prompt-based Attacks on LLM Systems

Xiaoyu Zhang, Cen Zhang, Tianlin Li, Yihao Huang, Xiaojun Jia, Ming Hu, Jie Zhang, Yang Liu, Shiqing Ma, Chao Shen

Published: 2026, Last Modified: 04 Mar 2026ACM Trans. Softw. Eng. Methodol. 2026EveryoneRevisionsBibTeXCC BY-SA 4.0
Loading