Training for Faster Adversarial Robustness Verification via Inducing ReLU Stability

Kai Y. Xiao; Vincent Tjeng; Nur Muhammad (Mahi) Shafiullah; Aleksander Madry

Training for Faster Adversarial Robustness Verification via Inducing ReLU Stability

Kai Y. Xiao, Vincent Tjeng, Nur Muhammad (Mahi) Shafiullah, Aleksander Madry

Published: 21 Dec 2018, Last Modified: 22 Jun 2025ICLR 2019 Conference Blind SubmissionReaders: Everyone

Abstract: We explore the concept of co-design in the context of neural network verification. Specifically, we aim to train deep neural networks that not only are robust to adversarial perturbations but also whose robustness can be verified more easily. To this end, we identify two properties of network models - weight sparsity and so-called ReLU stability - that turn out to significantly impact the complexity of the corresponding verification task. We demonstrate that improving weight sparsity alone already enables us to turn computationally intractable verification problems into tractable ones. Then, improving ReLU stability leads to an additional 4-13x speedup in verification times. An important feature of our methodology is its "universality," in the sense that it can be used with a broad range of training procedures and verification approaches.

Keywords: verification, adversarial robustness, adversarial examples, stability, deep learning, regularization

TL;DR: We develop methods to train deep neural models that are both robust to adversarial perturbations and whose robustness is significantly easier to verify.

Code: [![github](/images/github_icon.svg) MadryLab/relu_stable](https://github.com/MadryLab/relu_stable)

Community Implementations: [![CatalyzeX](/images/catalyzex_icon.svg) 1 code implementation](https://www.catalyzex.com/paper/training-for-faster-adversarial-robustness/code)

18 Replies

Loading