SwordEcho: A LLM Jailbreaking Optimization Strategy Driven by Reinforcement Learning

Xuehai Tang, Wenjie Xiao, Zhongjiang Yao, Jizhong Han

Published: 06 Dec 2024, Last Modified: 07 Jan 2026CrossrefEveryoneRevisionsCC BY-SA 4.0
Loading