Diffeomorphic Explanations with Normalizing Flows

Ann-Kathrin Dombrowski; Jan E Gerken; Pan Kessel

Diffeomorphic Explanations with Normalizing Flows

Ann-Kathrin Dombrowski, Jan E Gerken, Pan Kessel

Published: 15 Jun 2021, Last Modified: 05 May 2023INNF+ 2021 contributedtalkReaders: Everyone

Keywords: normalizing flows, explanations, data manifold

TL;DR: We use normalizing flows to find interpretable explanations for neural network decisions.

Abstract: Normalizing flows are diffeomorphisms which are parameterized by neural networks. As a result, they can induce coordinate transformations in the tangent space of the data manifold. In this work, we demonstrate that such transformations can be used to generate interpretable explanations for decisions of neural networks. More specifically, we perform gradient ascent in the base space of the flow to generate counterfactuals which are classified with great confidence as a specified target class. We analyze this generation process theoretically using Riemannian differential geometry and establish a rigorous theoretical connection between gradient ascent on the data manifold and in the base space of the flow.

4 Replies

Loading