DeDUCE: Generating Counterfactual Explanations At Scale

Benedikt Höltgen; Lisa Schut; Jan M. Brauner; Yarin Gal

DeDUCE: Generating Counterfactual Explanations At Scale

Benedikt Höltgen, Lisa Schut, Jan M. Brauner, Yarin Gal

Published: 17 Oct 2021, Last Modified: 04 Aug 2025XAI 4 Debugging Workshop @ NEURIPS 2021 PosterReaders: Everyone

Keywords: Counterfactual explanations, XAI, Debugging

TL;DR: We introduce a novel, efficient algorithm providing counterfactual explanations for residual networks and compare its performance against baselines.

Abstract: When an image classifier outputs a wrong class label, it can be helpful to see what changes in the image would lead to a correct classification. This is the aim of algorithms generating counterfactual explanations. However, there is no easily scalable method to generate such counterfactuals. We develop a new algorithm providing counterfactual explanations for large image classifiers trained with spectral normalisation at low computational cost. We empirically compare this algorithm against baselines from the literature; our novel algorithm consistently finds counterfactuals that are much closer to the original inputs. At the same time, the realism of these counterfactuals is comparable to the baselines.

Community Implementations: [![CatalyzeX](/images/catalyzex_icon.svg) 1 code implementation](https://www.catalyzex.com/paper/deduce-generating-counterfactual-explanations/code)

0 Replies

Loading