Mass-Scale Analysis of In-the-Wild Conversations Reveals Complexity Bounds on LLM Jailbreaking

Aldan Creo, Raul Castro Fernandez, Manuel Cebrián

Published: 2025, Last Modified: 05 Feb 2026AI (1) 2025EveryoneRevisionsBibTeXCC BY-SA 4.0

Abstract: As large language models become increasingly deployed, understanding the complexity and evolution of jailbreaking strategies is critical for AI safety.

External IDs:dblp:conf/sgai/CreoFC25