VeriX: Towards Verified Explainability of Deep Neural Networks

Published: 21 Sept 2023, Last Modified: 02 Nov 2023NeurIPS 2023 posterEveryoneRevisionsBibTeX
Keywords: trustworthy machine learning, deep neural networks, explainability, interpretability, formal methods, automated verification
TL;DR: We present VeriX (Verified Explainability), a system for producing optimal robust explanations and generating counterfactuals along decision boundaries of machine learning models.
Abstract:

We present VeriX (Verified eXplainability), a system for producing optimal robust explanations and generating counterfactuals along decision boundaries of machine learning models. We build such explanations and counterfactuals iteratively using constraint solving techniques and a heuristic based on feature-level sensitivity ranking. We evaluate our method on image recognition benchmarks and a real-world scenario of autonomous aircraft taxiing.

Supplementary Material: pdf
Submission Number: 9228
Loading