P-TAME: Explain Any Image Classifier with Trained Perturbations

Mariano V. Ntrougkas; Vasileios Mezaris; Ioannis Patras

P-TAME: Explain Any Image Classifier with Trained Perturbations

Mariano V. Ntrougkas, Vasileios Mezaris, Ioannis Patras

Published: 23 Jun 2025, Last Modified: 23 Jun 2025Greeks in AI 2025 PosterEveryoneRevisionsBibTeXCC BY 4.0

Keywords: 1. Vision and Learning

TL;DR: P-TAME (Perturbation-based Trainable Attention Mechanism for Explanations) is a model-architecture-agnostic method for explaining DNN-based image classifiers. IEEE Open Journal of Signal Processing, 2025, https://doi.org/10.1109/OJSP.2025.3568756.

Abstract: The adoption of Deep Neural Networks (DNNs) in critical fields where predictions need to be accompanied by justifications is hindered by their inherent black-box nature. This paper introduces P-TAME (Perturbation-based Trainable Attention Mechanism for Explanations), a model-agnostic method for explaining DNN-based image classifiers. P-TAME employs an auxiliary image classifier to extract features from the input image, bypassing the need to tailor the explanation method to the internal architecture of the backbone classifier being explained. Unlike traditional perturbation-based methods, which have high computational requirements, P-TAME offers an efficient alternative by generating high-resolution explanations in a single forward pass during inference. We apply P-TAME to explain the decisions of VGG-16, ResNet-50, and ViT-B-16, three distinct and widely used image classifiers. Quantitative and qualitative results show that P-TAME matches or outperforms previous explainability methods, including model-specific ones. Code and trained models are available at https://github.com/IDT-ITI/P-TAME. Keyword: 1. Vision and Learning. IEEE Open Journal of Signal Processing, 2025, Early Access: https://doi.org/10.1109/OJSP.2025.3568756.

Submission Number: 75

Loading