VeriX: Towards Verified Explainability of Deep Neural Networks

Published: 21 Sept 2023, Last Modified: 02 Nov 2023NeurIPS 2023 posterEveryoneRevisionsBibTeX
Keywords: trustworthy machine learning, deep neural networks, explainability, interpretability, formal methods, automated verification
TL;DR: We present VeriX (Verified Explainability), a system for producing optimal robust explanations and generating counterfactuals along decision boundaries of machine learning models.
Abstract: We present **VeriX** (**Veri**fied e**X**plainability), a system for producing *optimal robust explanations* and generating *counterfactuals* along decision boundaries of machine learning models. We build such explanations and counterfactuals iteratively using constraint solving techniques and a heuristic based on feature-level sensitivity ranking. We evaluate our method on image recognition benchmarks and a real-world scenario of autonomous aircraft taxiing.
Supplementary Material: pdf
Submission Number: 9228