Adversarial attacks and defenses on text-to-image diffusion models: A survey

Published: 01 Jan 2025, Last Modified: 13 May 2025Inf. Fusion 2025EveryoneRevisionsBibTeXCC BY-SA 4.0
Abstract: Highlights•We highlight that the text-to-image diffusion model has vulnerabilities in both robustness and safety.•We provide an in-depth analysis of adversarial attacks on the text-to-image diffusion model.•We present a detailed analysis of current defense methods that improve model robustness and safety.•We provide a introduction of metrics and datasets commonly used in existing attacks and defenses.•We discuss ongoing challenges of existing methods and explore promising future research directions.
Loading

OpenReview is a long-term project to advance science through improved peer review with legal nonprofit status. We gratefully acknowledge the support of the OpenReview Sponsors. © 2025 OpenReview