An SDE Framework for Adversarial Training, with Convergence and Robustness AnalysisDownload PDFOpen Website

Published: 01 Jan 2021, Last Modified: 17 May 2023CoRR 2021Readers: Everyone
Abstract: Adversarial training has gained great popularity as one of the most effective defenses for deep neural network and more generally for gradient-based machine learning models against adversarial perturbations on data points. This paper establishes a continuous-time approximation for the mini-max game of adversarial training. This approximation approach allows for precise and analytical comparisons between stochastic gradient descent and its adversarial training counterpart; and confirms theoretically the robustness of adversarial training from a new gradient-flow viewpoint. The analysis is then corroborated through various analytical and numerical examples.
0 Replies

Loading