Exploiting Verified Neural Networks via Floating Point Numerical Error

Kai Jia; Martin Rinard

Exploiting Verified Neural Networks via Floating Point Numerical Error

Kai Jia, Martin Rinard

28 Sept 2020 (modified: 22 Jun 2025)ICLR 2021 Conference Blind SubmissionReaders: Everyone

Abstract: Motivated by the need to reliably characterize the robustness of deep neural networks, researchers have developed verification algorithms for deep neural networks. Given a neural network, the verifiers aim to answer whether certain properties are guaranteed with respect to all inputs in a space. However, little attention has been paid to floating point numerical error in neural network verification. We exploit floating point errors in the inference and verification implementations to construct adversarial examples for neural networks that a verifier claims to be robust with respect to certain inputs. We argue that, to produce sound verification results, any verification system must accurately (or conservatively) model the effects of any float point computations in the network inference or verification system.

Code Of Ethics: I acknowledge that I and all co-authors of this work have read and commit to adhering to the ICLR Code of Ethics

One-sentence Summary: We show that floating point error in neural network verifiers and neural network inference implementations can be systematically exploited to invalidate robustness claims.

Community Implementations: [![CatalyzeX](/images/catalyzex_icon.svg) 1 code implementation](https://www.catalyzex.com/paper/exploiting-verified-neural-networks-via/code)

Reviewed Version (pdf): https://openreview.net/references/pdf?id=-QWnIIr_gx

14 Replies

Loading