Abstract: Highlights•A novel attribution algorithm to locate adversarial patches, by leveraging the image-independence of adversarial patches.•A purification method based on the diffusion model to purify adversarial patches, aligning attacked images with their natural distribution.•The attribution-guided purification method can effectively remove adversarial patches, and improve the robustness of the model.
Loading