Keywords: additive models, GAM, SHAP, Shapley
Abstract: In recent years, the Shapley value and SHAP explanations have emerged as one
of the most dominant paradigms for providing post-hoc explanations of blackbox models. Despite their well-founded theoretical properties, many recent works
have focused on the limitations in both their computational efficiency and their
representation power. The underlying connection with additive models, however,
is left critically under-emphasized in the current literature. In this work, we find
that a variational perspective linking GAM models and SHAP explanations is able
to provide deep insights into nearly all recent developments. In light of this connection, we borrow in the other direction to develop a new method to train interpretable GAM models which are automatically purified to compute the Shapley
value in a single forward pass. Finally, we provide theoretical results showing the
limited representation power of GAM models is the same Achilles’ heel existing
in SHAP and discuss the implications for SHAP’s modern usage in CV and NLP.
Primary Area: interpretability and explainable AI
Code Of Ethics: I acknowledge that I and all co-authors of this work have read and commit to adhering to the ICLR Code of Ethics.
Submission Guidelines: I certify that this submission complies with the submission instructions as described on https://iclr.cc/Conferences/2025/AuthorGuide.
Reciprocal Reviewing: I understand the reciprocal reviewing requirement as described on https://iclr.cc/Conferences/2025/CallForPapers. If none of the authors are registered as a reviewer, it may result in a desk rejection at the discretion of the program chairs. To request an exception, please complete this form at https://forms.gle/Huojr6VjkFxiQsUp6.
Anonymous Url: I certify that there is no URL (e.g., github page) that could be used to find authors’ identity.
No Acknowledgement Section: I certify that there is no acknowledgement section in this submission for double blind review.
Submission Number: 1649
Loading