Clustering Effect of Adversarial Robust Models

Yang Bai; Xin Yan; Yong Jiang; Shu-Tao Xia; Yisen Wang

Clustering Effect of Adversarial Robust Models

Yang Bai, Xin Yan, Yong Jiang, Shu-Tao Xia, Yisen Wang

Published: 09 Nov 2021, Last Modified: 26 May 2025NeurIPS 2021 SpotlightReaders: Everyone

Keywords: Adversarial robust models, hierarchical clustering effect, domain adaption tasks

Abstract: Adversarial robustness has received increasing attention along with the study of adversarial examples. So far, existing works show that robust models not only obtain robustness against various adversarial attacks but also boost the performance in some downstream tasks. However, the underlying mechanism of adversarial robustness is still not clear. In this paper, we interpret adversarial robustness from the perspective of linear components, and find that there exist some statistical properties for comprehensively robust models. Specifically, robust models show obvious hierarchical clustering effect on their linearized sub-networks, when removing or replacing all non-linear components (e.g., batch normalization, maximum pooling, or activation layers). Based on these observations, we propose a novel understanding of adversarial robustness and apply it on more tasks including domain adaption and robustness boosting. Experimental evaluations demonstrate the rationality and superiority of our proposed clustering strategy. Our code is available at https://github.com/bymavis/Adv_Weight_NeurIPS2021.

Code Of Conduct: I certify that all co-authors of this work have read and commit to adhering to the NeurIPS Statement on Ethics, Fairness, Inclusivity, and Code of Conduct.

Supplementary Material: pdf

TL;DR: Clustering Effect of (Linearized) Adversarial Robust Models

Code: https://github.com/bymavis/Adv_Weight_NeurIPS2021

Community Implementations: [![CatalyzeX](/images/catalyzex_icon.svg) 1 code implementation](https://www.catalyzex.com/paper/clustering-effect-of-adversarial-robust/code)

7 Replies

Loading