On the Paradox of Certified Training

Nikola Jovanović; Mislav Balunovic; Maximilian Baader; Martin Vechev

On the Paradox of Certified Training

Nikola Jovanović, Mislav Balunovic, Maximilian Baader, Martin Vechev

Published: 28 Oct 2022, Last Modified: 17 Sept 2024Accepted by TMLREveryoneRevisionsBibTeXCC BY 4.0

Abstract: Certified defenses based on convex relaxations are an established technique for training provably robust models. The key component is the choice of relaxation, varying from simple intervals to tight polyhedra. Counterintuitively, loose interval-based training often leads to higher certified robustness than what can be achieved with tighter relaxations, which is a well-known but poorly understood paradox. While recent works introduced various improvements aiming to circumvent this issue in practice, the fundamental problem of training models with high certified robustness remains unsolved. In this work, we investigate the underlying reasons behind the paradox and identify two key properties of relaxations, beyond tightness, that impact certified training dynamics: continuity and sensitivity. Our extensive experimental evaluation with a number of popular convex relaxations provides strong evidence that these factors can explain the drop in certified robustness observed for tighter relaxations. We also systematically explore modifications of existing relaxations and discover that improving unfavorable properties is challenging, as such attempts often harm other properties, revealing a complex tradeoff. Our findings represent an important first step towards understanding the intricate optimization challenges involved in certified training.

Submission Length: Regular submission (no more than 12 pages of main content)

Changes Since Last Submission: No prior submissions to TMLR.

Code: https://github.com/eth-sri/paradox

Assigned Action Editor: ~Andrej_Risteski2

License: Creative Commons Attribution 4.0 International (CC BY 4.0)

Submission Number: 169

Loading