A Non-Targeted Attack Approach for the Coarse Misclassification Problem

Ismail R. Alkhouri, Alvaro Velasquez, George K. Atia

Published: 2023, Last Modified: 13 Nov 2024IJCNN 2023EveryoneRevisionsBibTeXCC BY-SA 4.0

Abstract: The evaluation of classifiers' robustness against adversarial attacks is typically performed through metrics based on the minimal perturbation required for misclassification. The conventional method of generating these perturbations relies on setting a limit on the maximum allowed perturbation (restricted attack method) and a non-targeted attack formulation. This approach, however, disregards any relationships between classes. Our paper introduces a novel, non-targeted, bound-restricted method for achieving coarse misclassification, so that the perturbed feature is classified outside its true coarse class. We present an efficient, single-step solution to the coarse misclassification problem and analyze its computational requirements. Our experiments showcase the superiority of our method, surpassing state-of-the-art in terms of both the perceptibility of adversarial examples and runtime.