\section{Formal Definitions of Causes and Explanations}\label{app:cause}

The material in this section is largely taken from~\cite{CH24}, and the reader is referred to that paper for more context.

Causal models capture the way some variables causally influence others. This
influence is modeled by a set of {\em structural equations}.
The variables are typically split into two sets: {\em exogenous\/} variables, whose values are
determined by factors outside the model, and the
{\em endogenous\/} variables, whose values are determined by
the exogenous variables. The structural equations
describe how these values are determined. We also assume acyclicity. In other words,
given the values of exogenous variables, we can propagate these values according to the structural equations
and get a complete valuation of all variables in the model.

Formally, a \emph{causal model} $M$
is a pair $(\Scal, \cF)$, where $\Scal$ is a \emph{signature}, which explicitly
lists the endogenous and exogenous variables  and characterizes
their possible values, and $\cF$ defines a set of \emph{(modifiable)
structural equations}, relating the values of the variables.  
A signature $\Scal$ is a tuple $(\U,\V,\R)$, where $\U$ is a set of
exogenous variables, $\V$ is a set 
of endogenous variables, and $\R$ associates with every variable $Y \in 
\U \cup \V$ a nonempty set $\R(Y)$ of possible values for 
$Y$ (\ie, the set of values over which $Y$ {\em ranges}).  
For simplicity, we assume here that $\V$ is finite, as is $\R(Y)$ for
every endogenous variable $Y \in \V$.
$\cF$ associates with each endogenous variable $X \in \V$ a
function denoted $F_X$
(\ie, $F_X = \cF(X)$)
such that $F_X: (\times_{U \in \U} \R(U))
\times (\times_{Y \in \V - \{X\}} \R(Y)) \rightarrow \R(X)$.
This mathematical notation just makes precise the fact that 
$F_X$ determines the value of $X$,
given the values of all the other variables in $\U \cup \V$.

The structural equations define what happens in the presence of external
interventions. 
Setting the value of some variable $X$ to $x$ in a causal
model $M = (\Scal,\cF)$ results in a new causal model, denoted
$M_{X\gets x}$, which is identical to $M$, except that the
equation for $X$ in $\cF$ is replaced by $X = x$.

We can also consider \emph{probabilistic causal models};
these are pairs $(M,\Pr)$, 
where $M$ is a causal model and $\Pr$ is a
probability on the contexts in $M$.

The dependencies between variables in a causal model $M = ((\U,\V,\R),\cF)$
can be described using a {\em causal network}(or \emph{causal graph}),
whose nodes are labeled by the endogenous and exogenous variables in
$M$, with one node for each variable in $\U \cup
\V$.  The roots of the graph are (labeled by)
the exogenous variables.  There is a directed edge from  variable $X$
to $Y$ if $Y$ \emph{depends on} $X$; this is the case
if there is some setting of all the variables in 
$\U \cup \V$ other than $X$ and $Y$ such that varying the value of
$X$ in that setting results in a variation in the value of $Y$; that
is, there is 
a setting $\vec{z}$ of the variables other than $X$ and $Y$ and values
$x$ and $x'$ of $X$ such that
$F_Y(x,\vec{z}) \ne F_Y(x',\vec{z})$.

We call a pair $(M,\vec{u})$ consisting of a causal model $M$ and a
context $\vec{u}$ a \emph{(causal) setting}.
A causal formula $\psi$ is true or false in a setting.
We write $(M,\vec{u}) \satt \psi$  if
the causal formula $\psi$ is true in
the setting $(M,\vec{u})$.
Finally, 
$(M,\vec{u}) \satt [\vec{Y} \gets \vec{y}]\varphi$ if 
$(M_{\vec{Y} = \vec{y}},\vec{u}) \satt \varphi$,
where $M_{\vec{Y}\gets \vec{y}}$ is the causal model that is identical
to $M$, except that the 
equations for variables in $\vec{Y}$ in $\cF$ are replaced by $Y = y$
for each $Y \in \vec{Y}$ and its corresponding 
value $y \in \vec{y}$.

A standard use of causal models is to define \emph{actual causation}: that is, 
what it means for some particular event that occurred to cause 
 another particular event. 
We briefly review the relevant definitions below.

The events that can be causes are arbitrary conjunctions of primitive
events (formulas of the form $X=x$); 
the events that can be caused are arbitrary Boolean combinations of primitive events.  
an arbitrary formula $\phi$. 

\dfn\label{def:AC}[Actual cause]
$\vec{X} = \vec{x}$ is 
an \emph{actual cause} of $\varphi$ in $(M,\vec{u})$ if the
following three conditions hold: 
\begin{description}
\item[{\rm AC1.}]\label{ac1} $(M,\vec{u}) \models (\vec{X} = \vec{x})$ and $(M,\vec{u}) \models \varphi$. 
\item[{\rm AC2.}] There is a
  a setting $\vec{x}'$ of the variables in $\vec{X}$, a 
(possibly empty)  set $\vec{W}$ of variables in $\V - \vec{X}'$,
and a setting $\vec{w}$ of the variables in $\vec{W}$
such that $(M,\vec{u}) \models \vec{W} = \vec{w}$ and
$(M,\vec{u}) \models [\vec{X} \gets \vec{x}', \vec{W} \gets
    \vec{w}]\neg{\varphi}$, and moreover
\item[{\rm AC3.}] \label{ac3}\index{AC3}  
  $\vec{X}$ is minimal; there is no strict subset $\vec{X}'$ of
  $\vec{X}$ such that $\vec{X}' = \vec{x}''$ can replace $\vec{X} =
  \vec{x}'$ in 
  AC2, where $\vec{x}''$ is the restriction of
$\vec{x}'$ to the variables in $\vec{X}'$.
\end{description}
\edfn

To define explanation, we need the notion of \emph{sufficient cause}
in addition to that of actual cause.

\dfn\label{def:SC}[Sufficient cause]
$\vec{X} = \vec{x}$ is 
a \emph{sufficient cause} of $\varphi$ in $(M,\vec{u})$
if the following four conditions hold: 
\begin{description}
\item[{\rm SC1.}] \label{sc1} $(M,\vec{u}) \models (\vec{X} = \vec{x})$ and $(M,\vec{u}) \models \varphi$. 
\item[{\rm SC2.}] \label{sc2} Some conjunct of $\vec{X} = \vec{x}$ is part of 
an actual cause
of $\varphi$ in $(M,\vec{u})$. 
More precisely, there exists a conjunct $X=x$ of $\vec{X} = \vec{x}$ and another (possibly empty) conjunction $\vec{Y} = \vec{y}$ 
such that $X=x \wedge \vec{Y} = \vec{y}$ is an actual cause of $\varphi$ in $(M,\vec{u})$.  
\item[{\rm SC3.}] \label{sc3} $(M,\vec{u}') \models [\vec{X} =
    \vec{x}]\varphi$  for all contexts $\vec{u}' \in 
\R(\U)$.  
\item[{\rm SC4.}] \label{sc4} $\vec{X}$ is minimal; there is no strict subset $\vec{X}'$ of $\vec{X}$ such that $\vec{X}' = \vec{x}'$ satisfies conditions
SC1, SC2, and SC3, where $\vec{x}'$ is the restriction of $\vec{x}$ to the variables in $\vec{X}'$.
\end{description}
\edfn

The notion of explanation builds on the notion of sufficient
causality, and is relative to a set of contexts.
\dfn\label{def:EX}[Explanation]
$\vec{X} = \vec{x}$ is 
an \emph{explanation} of $\varphi$ relative to a set $\K$ of contexts 
in a causal model $M$ if the following conditions hold:  
\begin{description}
\item[{\rm EX1.}]  $\vec{X} = \vec{x}$ is a sufficient cause of $\varphi$ in all contexts in $\K$ satisfying 
$(\vec{X}=\vec{x}) \wedge \varphi$. 
More precisely,
\begin{itemize}
\item If $\vec{u} \in \K$ and $(M,\vec{u}) \models (\vec{X} = \vec{x})
  \wedge \varphi$, then there exists a conjunct $X=x$ of $\vec{X} =
  \vec{x}$ and a (possibly empty) conjunction $\vec{Y} = \vec{y}$ such
  that $X=x \wedge \vec{Y} = \vec{y}$ is an actual cause of $\varphi$
  in $(M,\vec{u})$. (This is SC2 applied to all contexts  
    $\vec{u} \in \K$ where 
    $(\vec{X} = \vec{x}) \wedge \varphi$ holds.)
\item $(M,\vec{u}') \models [\vec{X} = \vec{x}]\varphi$  for all
  contexts $\vec{
    u}' \in \K$. (This is SC3 restricted to the contexts in $\K$.)
\end{itemize}
\item[{\rm EX2.}] $\vec{X}$ is minimal; there is no
  strict subset $\vec{X}'$ of $\vec{X}$ such that $\vec{X}' =
  \vec{x}'$ satisfies EX1,  
where $\vec{x}'$ is the restriction of $\vec{x}$ to the variables in $\vec{X}'$. (This is SC4).
\item[{\rm EX3.}] \label{ex3} $(M,u) \sat \vec{X} = \vec{x} \wedge
  \varphi$ for some $u \in \K$.
\end{description}
\edfn

The requirement that the first part of condition EX1
as given here
holds in all
contexts in $\K$ that satisfy 
$\vec{X} = \vec{x} \land \phi$ and that
the second part holds in all contexts in $\K$ is quite strong, and
often does not hold in practice.   We are often willing to accept
$\vec{X} = \vec{x}$ as an explanation if these requirements hold with
high probability.  
Given a set $\K$ of contexts in a causal model $M$, 
let $K_\psi$ consist of all contexts $\vec{u}$ in $\K$ such that
$(M,\vec{u}) \sat \psi$, and 
let $\K(\vec{X} =
\vec{x}, \varphi, \mbox{SC2})$ consist of all contexts $\vec{u} \in
\K$ that satisfy $\vec{X} = \vec{x}\wedge \varphi$ and the first
  condition in EX1 (i.e., the analogue of SC2).
\dfn\label{def:PEXProper}[Partial Explanation]
$\vec{X} = \vec{x}$ is 
a \emph{partial explanation} of $\varphi$ with goodness
$(\alpha,\beta)$ relative to $\K$ in a probabilistic causal model $(M,\Pr)$ if
\begin{description}

\item[{\rm EX1$'$.}]  
  $\alpha \le Pr(\K(\vec{X} = \vec{x}, \varphi, \mbox{SC2}) \mid
 \K_{\vec{X} = \vec{x} \land \phi})$ and
 $\beta \le Pr(\K_{[\vec{X} = \vec{x}]\phi})$.
\item[{\rm EX$2'$.}] $\vec{X}$ is minimal; there is no
  strict subset $\vec{X}'$ of $\vec{X}$ such that
  $\alpha \le Pr(\K(\vec{X}' = \vec{x}', \varphi, \mbox{SC2}) \mid
  \K_{\vec{X}' = \vec{x}' \land \phi})$
  and
  $\beta \le Pr(\K_{[\vec{X}' = \vec{x}']\varphi})$,
  where $x'$ is the restriction of $\vec{x}$ to the variables in $X$.
\item[{\rm EX$3'$.}]   $(M,u) \sat \vec{X} = \vec{x} \wedge
  \varphi$ for some $u \in \K$.
\end{description}  
\edfn

\begin{figure*}[t]
    \centering
    \begin{subfigure}{0.2\textwidth}
    \centering
    \includegraphics[scale=0.2]{images/pancreas_iters/pancreas.png}
    \caption{A pancreas CT slice}
    \label{fig:pancreas_ct}
\end{subfigure}
\hfill
\begin{subfigure}{0.2\textwidth}
    \centering
    \includegraphics[scale=0.2]{images/pancreas_iters/iter1.png}
    \caption{1 iteration}
    \label{fig:pancreas_1}
\end{subfigure}
\hfill
\begin{subfigure}{0.2\textwidth}
    \centering
    \includegraphics[scale=0.2]{images/pancreas_iters/iters10.png}
    \caption{10 iterations}
    \label{fig:pancreas_10}
\end{subfigure}
\hfill
\begin{subfigure}{0.2\textwidth}
    \centering
    \includegraphics[scale=0.2]{images/pancreas_iters/iters30.png}
    \caption{30 iterations}
    \label{fig:pancreas_30}
\end{subfigure}
\caption{The smoothing of the responsibility map over multiple iterations, here shown on a slice from an CT image of a pancreas (\Cref{fig:pancreas_ct}). \rex extracts explanations using the responsibility pixel ranking.}
\label{fig:pancreas}
\end{figure*}

\begin{figure*}
    \centering
     \begin{subfigure}{0.4\textwidth}
    \centering
    \includegraphics[scale=0.36]{images/lung/lung.png}
    \caption{A lung CT slice with a tumor}
    \label{fig:lung_1}
\end{subfigure}
\hfill
\begin{subfigure}{0.4\textwidth}
    \centering
    \includegraphics[scale=0.3]{images/lung/heatmap.png}
    \caption{Heatmap of responsibility}
    \label{fig:lung_2}
\end{subfigure}
\caption{\rex also produces heatmaps of the responsibility map. We have manually marked the location of the lung tumor in~\Cref{fig:lung_2}. The heatmap includes the tumor, but seems to be localizing slightly to the left of the main lump.}
\label{fig:lung}
\end{figure*}


\section{Explanations in \rex}\label{app:rex}

\rex is a causal explainability tool that produces a \emph{responsibility landscape}. From this landscape, it extracts causal explanations: sets of pixels, possibly disjoint, that 
are sufficient to reproduce the original model classification. We show a typical example in~\Cref{fig:typical} and another for lung data in~\Cref{fig:lung}. The tool itself is available at \url{https://github.com/ReX-XAI/ReX}.
While the full algorithm is rather complex, broadly speaking, \rex creates mutants of an initial input image by subdividing it into $4$ \emph{superpixels}. These superpixels are created
 by random partitioning. The model is queried on all combinations of these superpixels, with ``non-active'' superpixels
set to a masking value (by default $0$). The causal responsibility is calculated for these combinations. Combinations with non-zero responsibility are further broken down into more (smaller) superpixels and the process
repeated. Once superpixels reach a predefined size limit, the algorithm quits. This procedure is repeated many times to avoid the issue of a poor initial partitioning. 
The effect of multiple iterations is to smooth the final responsibility map. The map then provides a pixel ranking from which \rex greedily extracts an explanation. 
Pixels are added into an initially blank image, from highest responsibility to lowest,
until the pixels are sufficient to obtain the same class as the initial class prediction. 


\section{Neutrality of Neutral Grid}\label{neutrality}

\begin{figure*}[t]
\centering
\begin{subfigure}{0.2\textwidth}
    \centering
    \includegraphics[width=\textwidth]{images/TCGA_FG_A4MT_20020212_26.png}
    \caption{A healthy brain with $0.54$ confidence}
    \label{fig:lowest}
\end{subfigure}
\hfill
\begin{subfigure}{0.2\textwidth}
    \includegraphics[width=\textwidth]{images/AFMT_26_1.png}
    \caption{The grid changes the classification to $1$}
    \label{fig:lowest_grid}
\end{subfigure}
\hfill
\begin{subfigure}{0.2\textwidth}
    \includegraphics[width=\textwidth]{images/TCGA_HT_7608_19940304_10.png}
    \caption{A healthy brain with $0.99$ confidence}
    \label{fig:third}
\end{subfigure}
\hfill
\begin{subfigure}{0.2\textwidth}
    \includegraphics[width=\textwidth]{images/7608_10_1.png}
    \caption{An explanation for a healthy brain}
    \label{fig:fourth}
\end{subfigure}       
\caption{A selection of images and their actual causal explanations where the grid changed the classification from no-tumor to tumor. To the human eye, at least, there is no obvious reason why the grid has had the effect of a counterfactual. The explanations are unusually large.}
\label{fig:badgrids}
\end{figure*}

The effect of the user-colored grid is not entirely neutral on the confidence of the model on a given image. On the brain MRI data, for example, in a small number of cases ($15$) in the brain data, the calculated grid actually changes the classification from negative to positive. In general, the model confidence on these $15$ images is low, with a mean value of $0.83$, with the lowest confidence for the no-tumor classification being just $0.54$. It is, of course, possible to set a \betagood as a target rather than as a byproduct of the grid. We envisage this being the actual use case for clinicians. If a user required an explanation of absence to have a $\beta$ of $1$ then it would simply be the case of changing the density and radius of the grid until this is achieved. This procedure could form part of a dialogue between user and model, strengthening trust in the model, or revealing its weaknesses.

\Cref{fig:badgrids} shows a representative sample of these flipped classifications. Further investigation is required to discover why the grid changes the class in this year. Likely this is due to a disruption, by the grid, of some learned concept.

% \section{Calculating \texorpdfstring{$\alpha$}{alpha}}\label{app:alpha}The probability that an image in $\K$ has both $\vec{X} = \vec{x} \land \varphi$ is given by $\alpha$. For simplicity, let us say that $\varphi$ means ``tumor''. $\vec{X} = \vec{x}$ then will be a set of pixels and their (RGB) values. Even if we ignore the spatial dimension of of pixels, the chances of $\vec{X} = \vec{x}$ being the same in a different image for which $\varphi$ holds is very slim. Explanations are not usually identical between images with the same classification. This can be seen immediately when looking atk images of tumors. If we take into account the spatial dimension as well, then $\alpha$ is likely to be near $0$ on any dataset. If we are interested in $\vec{G} = \vec{g}$, our absence grid, then we can approximately control the value of $\alpha$ if either $\K$ does not contain any healthy slices (in which case $\alpha = 1$), or we use a neutral, constant, masking value for the grid. In either case, $\alpha$ does not reveal as much information about our grid as we would like. 

% \section{Calculating \texorpdfstring{$\beta$}{beta}}\label{app:beta}
% We present a simple visual example of a \betagood calculation (\Cref{fig:example}). 
% We also superimpose the smallest explanation of a tumor onto the original healthy slice as a sanity check. This procedure is obviously not perfect as brains even at the same slice may be a different sizes. Also, the context itself plays a part in the classification: an explanation of a tumor on a completely blank background does not necessarily carry the same explanatory weight on a different background which is a question of robustness. Causal explanations, by virtue of their minimality, tend not to be robust. Hence, the explanation of the smallest tumor is itself a \emph{partial} explanation. Future work will examine the usefulness of explanation \emph{vs.} partial explanation of tumor on grid calculation from the point of view of the clinician.
