CXAD: Contrastive Explanations for Anomaly Detection: Algorithms, Complexity Results and Experiments

Ian Davidson; Nicolás Kennedy; S. S. Ravi

CXAD: Contrastive Explanations for Anomaly Detection: Algorithms, Complexity Results and Experiments

Ian Davidson, Nicolás Kennedy, S. S. Ravi

Published: 09 Jun 2025, Last Modified: 09 Jun 2025Accepted by TMLREveryoneRevisionsBibTeXCC BY 4.0

Abstract: Anomaly/Outlier detection (AD/OD) is often used in controversial applications to detect unusual behavior which is then further investigated or policed. This means an explanation of why something was predicted as an anomaly is desirable not only for individuals but also for the general population and policy-makers. However, existing explainable AI (XAI) methods are not well suited for Explainable Anomaly detection (XAD). In particular, most XAI methods provide instance-level explanations, whereas a model/global-level explanation is desirable for a complete understanding of the definition of normality or abnormality used by an AD algorithm. Further, existing XAI methods try to explain an algorithm’s behavior by finding an explanation of why an instance belongs to a category. However, by definition, anomalies/outliers are chosen because they are different from the normal instances. We propose a new style of model agnostic explanation, called contrastive explanation, that is designed specifically for AD algorithms. It addresses the novel challenge of providing a model-agnostic and global-level explanation by finding contrasts between the outlier group of instances and the normal group. We propose three formulations: (i) Contrastive Explanation, (ii) Strongly Contrastive Explanation, and (iii) Multiple Strong Contrastive Explanations. The last formulation is specifically for the case where a given dataset is believed to have many types of anomalies. For the first two formulations, we show the underlying problem is in the computational class P by presenting linear and polynomial time exact algorithms. We show that the last formulation is computationally intractable, and we use an integer linear program for that version to generate experimental results. We demonstrate our work on several data sets such as the CelebA image data set, the HateXplain language data set, and the COMPAS dataset on fairness. These data sets are chosen as their ground truth explanations are clear or well-known.

Submission Length: Long submission (more than 12 pages of main content)

Changes Since Last Submission: As requested by the area chair and reviewers, we have made the following three changes: 1) We have introduced a new sub-section ("Tag Generation, Assumptions and Limitations") in section 2 including Table 2, which shows various sources of tags. 2) We have lessened our claims where appropriate. 3) We have made the notation consistent. Finally, as suggested, we have included a link to our GitHub repository of the code.

Video: https://www.dropbox.com/scl/fi/uztvxmany1ahmialms4pv/FinalCXADClip.mp4?rlkey=zg9x03n7nc3kkusxvno4e1kab&st=ugepyqqf&dl=0

Code: https://github.com/nicbk/tmlr-cxad?tab=readme-ov-file

Assigned Action Editor: ~Samira_Ebrahimi_Kahou1

Submission Number: 3990

Loading