NetSat: Network Saturation Adversarial Attack

Katarzyna Filus; Joanna Domanska

NetSat: Network Saturation Adversarial Attack

Katarzyna Filus, Joanna Domanska

Published: 01 Jan 2023, Last Modified: 28 Aug 2024IEEE Big Data 2023EveryoneRevisionsBibTeXCC BY-SA 4.0

Abstract: Deep Convolutional Neural Networks can be fooled with imperceptible image perturbations. This negatively affects the security of intelligent systems. It is especially important for applications, in which security is crucial, such as Automated Guided Vehicles (AGVs) and production systems. A great majority of adversarial attacks target the classification layer of the network. These attacks focus only on changing the prediction, but it is also beneficial to corrupt the deep data representations. It can result in higher confusion of neural networks, which is important for more in-depth security testing. Classifier-independent attacks could be used in the future to test other types of networks frequently used in industry, production and AGVs, e.g. object detectors. In this paper, we present Network Saturation Attack (NetSat) - a novel classifier-independent, white-box, non-targeted adversarial attack. It aims to ‘saturate’ the final convolutional feature maps with introduced patterns and thus - make the network unable to provide reasonable predictions. Classifier-independence results in samples with predictions further away from the true label and high flexibility of the method. To assess its harmfulness, we propose Dissimilarity Metric (DM). It describes how far is the predicted label from the true one in the class space. The proposed methods, due to their simplicity, flexibility and practicality, are suitable for testing of real-life systems. We release a GitHub repository with code and examples: https://github.com/iitis/NetSat-NetworkSaturationAttack.

Loading