OpenReview
.net
OpenReview
.net
Login
OpenReview
.net
Login
Back to
the profile of David Glukhov
Breach By A Thousand Leaks: Unsafe Information Leakage in 'Safe' AI Responses
David Glukhov
,
Ziwen Han
,
Ilia Shumailov
,
Vardan Papyan
,
Nicolas Papernot
Published: 2025, Last Modified: 20 Apr 2026
ICLR 2025
Everyone
Revisions
BibTeX
CC BY-SA 4.0
Loading