Explaining What Machine Learning Learns through Explainable AI

Published: 04 Jun 2026, Last Modified: 04 Jun 2026PhilML@ICML 2026 PosterEveryoneRevisionsBibTeXCC BY 4.0
Keywords: Explainable Artificial Intelligence; Opacity; Explanatory Relevance; Trust; Mechanistic Interpretability
Abstract: Recent discussions of explainable artificial intelligence (XAI) often treat explanation as a homogeneous epistemic category. This paper argues that different XAI methods provide different forms of explanatory relevance and therefore reduce opacity in importantly different ways. Drawing on Boge’s (2022) distinction between h-opacity and w-opacity, I argue that many existing XAI methods primarily reduce computational visibility while leaving learned representational structure substantially unresolved. Consequently, strong forms of explanation-based trust require explanatorily relevant access to learned output-producing structures rather than predictive reliability alone.
Email Sharing: We authorize the sharing of all author emails with Program Chairs.
Data Release: We authorize the release of our submission and author names to the public in the event of acceptance.
Submission Number: 54
Loading