Implications of Model Indeterminacy for Explanations of Automated Decisions

Marc-Etienne Brunet; Ashton Anderson; Richard Zemel

Implications of Model Indeterminacy for Explanations of Automated Decisions

Marc-Etienne Brunet, Ashton Anderson, Richard Zemel

Published: 31 Oct 2022, Last Modified: 15 Jan 2023NeurIPS 2022 AcceptReaders: Everyone

Keywords: underspecification, Rashomon effect, explainability, robustness, epistemic uncertainty

TL;DR: An empirically motivated analysis of the consequences of underspecification and the Rashomon effect on post-hoc explainability.

Abstract: There has been a significant research effort focused on explaining predictive models, for example through post-hoc explainability and recourse methods. Most of the proposed techniques operate upon a single, fixed, predictive model. However, it is well-known that given a dataset and a predictive task, there may be a multiplicity of models that solve the problem (nearly) equally well. In this work, we investigate the implications of this kind of model indeterminacy on the post-hoc explanations of predictive models. We show how it can lead to explanatory multiplicity, and we explore the underlying drivers. We show how predictive multiplicity, and the related concept of epistemic uncertainty, are not reliable indicators of explanatory multiplicity. We further illustrate how a set of models showing very similar aggregate performance on a test dataset may show large variations in their local explanations, i.e., for a specific input. We explore these effects for Shapley value based explanations on three risk assessment datasets. Our results indicate that model indeterminacy may have a substantial impact on explanations in practice, leading to inconsistent and even contradicting explanations.

Supplementary Material: zip

11 Replies

Loading