Reproducibility Study of "Label-Free Explainability for Unsupervised Models"

Valentinos Pariza; Avik Pal; Madhura Pawar; Quim Serra Faber

Reproducibility Study of "Label-Free Explainability for Unsupervised Models"

Valentinos Pariza, Avik Pal, Madhura Pawar, Quim Serra Faber

Published: 02 Aug 2023, Last Modified: 02 Aug 2023MLRC 2022Readers: Everyone

Keywords: Reproducibility, Feature Importance, Example Importance, Disentangled VAEs, Label-Free, Unsupervised, Post-Hoc Explainability

TL;DR: Reproducibility Study of the paper "Label-Free Explainability for Unsupervised Models"

Abstract: Scope of Reproducibility In this work, we evaluate the reproducibility of the paper Label-Free Explainability for Unsupervised Models by Crabbe and van der Schaar. Our goal is to reproduce the paper's four main claims in a label-free setting:(1) feature importance scores determine salient features of a model's input, (2) example importance scores determine salient training examples to explain a test example, (3) interpretability of saliency maps is hard for disentangled VAEs, (4) distinct pretext tasks don’t have interchangeable representations. Methodology The authors of the paper provide an implementation in PyTorch for their proposed techniques and experiments. We reuse and extend their code for our additional experiments. Our reproducibility study comes at a total computational cost of 110 GPU hours, using an NVIDIA Titan RTX. Results We reproduced the original paper's work through our experiments. We find that the main claims of the paper largely hold. We assess the robustness and generalizability of some of the claims, through our additional experiments. In that case, we find that one claim is not generalizable and another is not reproducible for the graph dataset. What was easy The original paper is well-structured. The code implementation is well-organized and with clear instructions on how to get started. This was helpful to understand the paper's work and begin experimenting with their proposed methods. What was difficult We found it difficult to extrapolate some of the authors' proposed techniques to datasets other than those used by them. Also, we were not able to reproduce the results for one of the experiments. We couldn't find the exact reason for it by running explorative experiments due to time and resource constraints. Communication with original authors We reached out to the authors once about our queries regarding one experimental setup and to understand the assumptions and contexts of some sub-claims in the paper. We received a prompt response which satisfied most of our questions.

Paper Url: https://proceedings.mlr.press/v162/crabbe22a/crabbe22a.pdf

Paper Venue: ICML 2022

Supplementary Material: zip

Confirmation: The report pdf is generated from the provided camera ready Google Colab script, The report metadata is verified from the camera ready Google Colab script, The report contains correct author information., The report contains link to code and SWH metadata., The report follows the ReScience latex style guides as in the Reproducibility Report Template (https://paperswithcode.com/rc2022/registration)., The report contains the Reproducibility Summary in the first page., The latex .zip file is verified from the camera ready Google Colab script

Latex: zip

Journal: ReScience Volume 9 Issue 2 Article 11

Doi: https://www.doi.org/10.5281/zenodo.8173674

Code: https://archive.softwareheritage.org/swh:1:dir:8aa22d2a6b71c52b0863a06ab40f40ada1ec5355

0 Replies

Loading