\section{Background on Actual Causality}\label{sec:cause}
%\section{Actual Causes, Explanations and Partial Explanations}\label{sec:cause}

While the need for explanations is recognized almost universally, there is no definition of explanation even close to universal acceptance~\citep{Mil19}. We use a definition provided by the theory of actual causality. This definition has a number of useful properties which we use in our method.
Actual causality was first
introduced in~\citet{HP05a}. The reader is referred to that paper
and to~\citet{Hal19} for an updated overview and more
information on actual causality 
(see also the supplementary material
  for the formal definition of explanation in the general case).
Below we give an informal introduction to the theory and
simplified definitions suitable for the case of image classification.
%See supplementary materials for the formal definitions.
The definition of an \emph{actual cause} is based on the concept of \emph{causal models}, which consist of
a set of variables, a range of each variable, and structural equations describing the
dependencies between the variables. Actual causes are defined with respect to a given causal model, 
a given assignment to the variables of the model (a context), and a propositional formula that
holds in the model in this context.

\emph{Actual causality} extends simple counterfactual reasoning~\cite{Hume39}
by considering the effect of \emph{interventions}, which are changes of the current setting.
Roughly speaking, a subset of variables~$\vec{X}$ and their values in a given context
is an actual cause of a Boolean formula~$\varphi$ being True if there exists a change
in the values of other values that creates a counterfactual dependency
between the values of $\vec{X}$ and $\varphi$ (that is, if we change the values
of variables in~$\vec{X}$, $\varphi$~would be falsified). The formal definition by~\citet{HP05a}
and in its modifications, the latest of which is by~\citet{Hal15}, are far more
complex due to the potential dependencies between the variables and considering
causes of more than one element. In our setup, where we are only interested in singleton causes and in
interventions only on the input variables, all versions of the definition of (a part of) an actual cause
are equivalent under the assumption of independence between the input variables. This assumption is
far from trivial, and we discuss its implications in \cref{sec:limit}.

In the context of image classification, following~\cite{CH24},
we take endogenous variables to be the set $\vec{V}$ of pixels
that the image classifier gets as input, together with an output
variable that we call $O$.  The variable $V_i \in \vec{V}$ describes
the color and intensity of pixel $i$; its value is determined by the exogenous variables.  
The equation for $O$ determines the output of the
neural network as a function of the pixel values.
As mentioned above, we assume that there are no dependencies between the feature variables, thus, the causal network has depth $2$. 
While, in general, this assumption is not true in practice, in the context of MRI and CT scans it is reasonably accurate, 
as tumors can appear in most parts of an affected organ. Assuming independence makes the algorithms much simpler.

\cite{CH24} proved that for a causal model corresponding to an image
  classifier $\cN$, the following definition is equivalent to the definition of explanation in actual causality.
\begin{definition}[Explanation]\label{def:exp}  
  $\vec{X} = \vec{x}$ is an \emph{explanation} of $O=o$ iff the following conditions hold:
  \begin{description}
  \item[EX1] Setting $\vec{X}$ to $\vec{x}$ results in the classification $O=o$ for all images in the dataset;
  \item[EX2] For all images $\I$ in which $\vec{X} = \vec{x}$ and $O=o$, at least one conjunct $X=x$ in $\vec{X} = \vec{x}$ 
  is a (part of) an actual cause of $O=o$; in other words, 
  there exists a (possibly empty) set of variables $\vec{Y}$, a value $x'$, and a set of values $\vec{y}'$ such that setting 
  $X$ to $x'$ together with setting $\vec{Y}$ to $\vec{y}'$ results in $O \neq o$;
  \item[EX3] $\vec{X}$ is minimal, that is, no subset of $\vec{X}$ satisfies the conditions above. 
  \end{description}
\end{definition}  

``Folded'' in \cref{def:exp} is the definition of an \emph{actual cause} of $O=o$, which, using the notation in EX2,
would be $(\{X\} \cup \vec{Y} = \{x\}\cup \vec{y})$. 
The notion of \emph{responsibility} quantifies actual causality and is
defined for $X=x$ as above as $1/(|\vec{Y}|+1)$, where $\vec{Y}$ is the smallest set satisfying EX2.

To facilitate a dialog between the clinician and the AI system, we also use the definition of a \emph{partial explanation}
for image classifiers by~\cite{CH24}.
\dfn\label{def:PEX}[Partial Explanation]
 $\vec{X} = \vec{x}$ is a \emph{partial explanation} of $O=o$  
  with goodness $(\alpha,\beta)$, where $\alpha, \beta > 0$, relative to a set of images $\K$ if the following
  conditions hold:
  \begin{description}
  \item[PEX1] setting $\vec{X}$ to $\vec{x}$ results in the classification $O=o$ for all images in the dataset with probability at least $\beta$;
  \item[PEX2] the probability of $\vec{X} = \vec{x}$ to be a (part of an) actual cause of $O=o$ in an image $\I$ in the dataset is at least $\alpha$;
  \item[PEX3] $\vec{X}$ is minimal.
  \end{description}
\edfn 


\section{Explanations of Absence}\label{sec:abs}

Consider an AI model $\cN$ that classifies medical images as having or not having solid tumors. We start with a theoretical analysis with simplifying assumptions
and then discuss whether these assumptions hold for real AI models and the implications of relaxing them.

\subsection{Theoretical foundations}
Recall that we assume independence between the pixels of the image. 
We now add the assumption that tumors are equally likely in all areas of the scan. 

\begin{lemma}\label{lem:size}
Under the assumptions above, $\CN$ can only detect tumors based on the number of pixels with values (color and intensity) matching those of tumors, that is,
the size of a potential tumor on an image.
\end{lemma}
\prf
    The proof is based on the observation that due to the assumptions, the effect of changing each pixel in an input image is the same. Hence, $\cN$'s
    decisions rely only on the number of the pixels with values matching those of tumors, that is, the size of a potential tumor.
\eprf

The following lemma explains why responsibility maps are useless for explanations of absence, as illustrated in \cref{fig:empty}. As only \rex uses
a formal definition of \emph{responsibility} for its pixel ranking map, we use this definition in the lemma.

similar.
\begin{lemma}\label{lem:flat}
If none of the pixels in an input image $\I$ have values consistent with a tumor, the responsibility of each pixel of an input image $\I$ for the negative classification of $\cN$ is the same and is equal to $1/k$, where $k$ is the size of
a smallest tumor recognized by $\cN$.
\end{lemma}
\prf
The proof is based on the observation above that under our simplifying assumptions, $\cN$ can only use the size of a candidate tumor to decide whether
to classify $\I$ as having a tumor. Hence, by EX2 of \cref{def:exp}, the responsibility of each pixel $(X=x)$ for the negative classification of
$\I$ by $\cN$ is the same and is $1/k$, where $k$ is the size of a smallest set of pixels required to change the negative classification to a positive one
(aka ``there is a tumor''). 
\eprf

\begin{crl}\label{crl:inf}
    The responsibility of all pixels of an input image $\I$ classified as having no tumors for its classification is not an informative measure for explaining
    the classification.
\end{crl}

Based on \cref{lem:size}, the following construct is an explanation of absence of tumors in an input image $\I$ 
according to \cref{def:exp}.
\dfn\label{def:EoA}[Absence grid]
For an image $\I$ classified as not having tumors by an AI model $\cN$, a subset of pixels $\vec{G} \subseteq \I$ and their values $\vec{G} = \vec{g}$, 
is an \emph{absence grid} for $\I$ and $\cN$ if: 
\begin{description}
    \item[AG1] $\vec{G} = \vec{g}$ is a grid of clusters of pixels;
    \item[AG2] The distance between any two clusters in $\vec{G}$ is smaller than the size of a smallest tumor recognized by $\cN$; 
    \item There exists a cluster of pixels $C \subseteq \I$ in the explanation that is (a part of) an actual cause of classifying $\I$
    as not having tumors; that is, all pixels in $C$
    have the value that is incompatible with being a part of a tumor, and there exists another set of pixels $T \subseteq \I$ 
    such that changing the values of $C \cup T$ changes the classification of $\I$ to having a tumor, but changing the values of $C$ alone does not
    change the classification (of ``no tumor'').  
    \item[AG3] $\vec{G} = \vec{g}$ is minimal.
\end{description}
\edfn
\noindent

It is easy to see that an absence grid is an explanation for the negative classification of $I$ by $\cN$, according to \cref{def:exp}.
We also note that the location of the pixels on an absence grid as defined in \cref{def:EoA} depends only on $\cN$ and 
is independent of $\I$ and of the dataset. The only thing that depends on $\I$ is the values of these pixels. Therefore, an absence grid
can be constructed \emph{in advance} and projected on a given image $\I$ to get an explanation of absence of tumors.  

\subsection{Applying the theory to practice}

\commentout{
In the context of medical images and classifications of absence of abnormalities, \Cref{def:exp} means that a set of pixels $\vec{X}$ and their values $\vec{x}$ in an image $\I$ is an explanation of the classification of $\I$
as a healthy slice if (1) superimposing $\vec{X}=\vec{x}$ on any other image in the dataset results in a ``healthy'' classification,
(2) all images in the dataset that agree with $\I$ on $\vec{X}$ and are classified as healthy will be classified as having tumors for some other values of $\vec{X}$, and (3) $\vec{X}$ is a minimal such set. For example, given the minimal size of a tumor, explaining the absence of a tumor can be done by covering the image with a ``net'' of pixels, 
none of which can be part of a tumor, such that the distance between neighboring pixels in the net is smaller than the size of a minimal tumor.
Such a net maybe an uninformative output for a clinician:
a grid is generic and can be superimposed on \emph{any} scan, and there is nothing to suggest that it explains the absence of tumors in \emph{this} particular scan. 
To provide the user with more information, we use the definition of a \emph{partial explanation} to address the partiality of absence grids.
} %end of commented out text

In practice, the assumption that solid tumors are equally likely in all areas on the scan does not quite hold, as tumors are more likely to appear in some areas than in others. Moreover, tumors might be non-homogeneous, which makes it harder to measure their size. 
Indeed, while the assumption of independence of pixels is a good approximation in this domain, an AI model $\cN$ might also take into account an 
outline of a suspected tumor, rather than just its size, to decide whether there is a tumor on the scan. 
 
An absence grid defined in \cref{def:EoA} is, thus, impossible to construct precisely; in particular, the size of a smallest tumor may depend on the location on the scan and its shape. We therefore construct an approximation of this grid instead, as defined below.

\dfn\label{def:PEoA}[Partial Absence Grid]
For an image $\I$ classified as not having tumors by an AI model $\cN$ and a dataset $\K$, a subset of pixels $\vec{G} \subseteq \I$ and 
their values, $\vec{G} = \vec{g}$ is a \emph{partial absence grid} with goodness $(\alpha, \beta)$, where $\alpha, \beta > 0$ for $\I$ and $\cN$ in context $\K$ if: 
\begin{description}
     \item[PAG1] $\vec{G} = \vec{g}$ is a grid of clusters of pixels, such that all pixels have values incompatible with tumors;
     \item[PAG2] The distance between any two clusters in $\vec{G}$ is smaller than 
     the smallest explanation, $\vec{X} = \vec{x}$, of a tumor in the set $\K$, recognized by $\cN$, 
     \item There exists a cluster of pixels $C \subseteq \I$ in the explanation that is (a part of) an actual cause of classifying $\I$
     as not having tumors; that is, all pixels in $C$
     have the value that is incompatible with being a part of a tumor, and there exists another set of pixels $T \subseteq \I$ 
     such that changing the values of $C \cup T$ changes the classification of $\I$ to having a tumor, but changing the values of $C$ alone does not
     change the classification (of ``no tumor'').  
     \item[PAG3] $\vec{G} = \vec{g}$ is minimal.
\end{description}
\edfn
Note that, in particular, all pixels in $\vec{G}$ must have values inconsistent with a tumor. This may take the form of healthy value interpolation, or a neutral interpolation value. 
We discuss this choice in more detail in \Cref{sec:eval} and compare the results against taking an out-of-distribution neutral value.

For a partial absence grid $\vec{G} = \vec{g}$ as defined in \cref{def:PEoA}, let
$0 \leq \alpha, \beta \leq 1$ be such that:
\begin{itemize}
\item probability of $\vec{X} = \vec{x}$ to be a (part of an) actual cause of $O=o$ in an image $\I$ in the dataset is at least $\alpha$;
\item setting $\vec{X}$ to $\vec{x}$ results in the classification $O=o$ for all images in the dataset with probability at least $\beta$.
\end{itemize}
We are now ready to state our main result. 
\begin{fthm}\label{thm:pex}
    A partial absence grid is a partial explanation. That is, for an image $\I$ classified as not having tumors by an AI model $\cN$ and a dataset $\K$,
    the partial absence grid $\vec{G}=\vec{g}$ in \cref{def:PEoA} is a \emph{partial explanation of absence of tumors} in $\I$ wrt $\cN$ and
    a set of contexts $\K$ with $(\alpha,\beta)$-goodness, where $\alpha$ and $\beta$ are as defined above. 
\end{fthm}
The proof follows from \cref{def:PEX}. 

\cref{thm:pex} allows us to quantitatively assess the quality of explanations of absence, constructed as in \cref{def:PEoA}. 
It is important to note that a partial absence grid is always defined with respect to a particular model and dataset: a bad model and/or a poor dataset would produce low-quality explanations. However, with the definitions provided above, we emphasize that there is no sense in which an explanation can be ``wrong'': a human might disagree with the output, but a causal explanation is sufficient to reproduce the original class. In \cref{sec:limit} we discuss how these explanations can support a dialog between an AI model and a clinician.