Breach By A Thousand Leaks: Unsafe Information Leakage in 'Safe' AI Responses

David Glukhov, Ziwen Han, Ilia Shumailov, Vardan Papyan, Nicolas Papernot

Published: 2025, Last Modified: 20 Apr 2026ICLR 2025EveryoneRevisionsBibTeXCC BY-SA 4.0
Loading