Meta Adversarial Perturbations

Chia-Hung Yuan; Pin-Yu Chen; Chia-Mu Yu

Meta Adversarial Perturbations

Chia-Hung Yuan, Pin-Yu Chen, Chia-Mu Yu

Published: 02 Dec 2021, Last Modified: 05 May 2023AAAI-22 AdvML Workshop ShortPaperReaders: Everyone

Keywords: meta perturbation, adversarial perturbation, adversarial example, meta-learning, efficient adversarial attack

TL;DR: We propose a meta adversarial perturbation (MAP), a better initialization that causes data to be misclassified with high probability after being updated through only a one-step gradient ascent update.

Abstract: A plethora of attack methods have been proposed to generate adversarial examples, among which the iterative methods have been demonstrated the ability to find a strong attack. However, the computation of an adversarial perturbation for a new data point requires solving a time-consuming optimization problem from scratch. To generate a stronger attack, it normally requires updating a data point with more iterations. In this paper, we show the existence of a \textit{meta adversarial perturbation} (MAP), a better initialization that causes natural images to be misclassified with high probability after being updated through only a one-step gradient ascent update, and propose an algorithm for computing such perturbations. We conduct extensive experiments, and the empirical results demonstrate that state-of-the-art deep neural networks are vulnerable to meta perturbations. We further show that these perturbations are not only image-agnostic, but also model-agnostic, as a single perturbation generalizes well across unseen data points and different neural network architectures.

2 Replies

Loading