\section{Additive Noise on Abstract Observations}\label{app:noisy}

\begin{figure}
  \centering
  \begin{subfigure}{\linewidth}
    \centering
    \includegraphics[width=\linewidth]{figures/exp0_abstract_roc_auc_small.pdf}
  \end{subfigure}
  \begin{subfigure}{\linewidth}
    \includegraphics[width=\linewidth]{figures/exp0_tau_roc_auc_small.pdf}
  \end{subfigure}
  \begin{subfigure}{\linewidth}
    \includegraphics[width=\linewidth]{figures/exp0_pk_precision_small.pdf}
  \end{subfigure}
  \begin{subfigure}{\linewidth}
    \includegraphics[width=\linewidth]{figures/exp0_concrete_roc_auc_small.pdf}
  \end{subfigure}
  \caption{%
    Results of Abs-LiNGAM
    over pairs of abstract ($b=5$ nodes)
    and concrete ($d\in[25,50]$ nodes)
    linear SCMs
    after perturbing the abstract observations
    with normal noise
    of increasing variance $\sigma^2$.
    We denote as ``Top-1'' the strategy
    where we force the selection
    of at most a single abstract variable
    per concrete one and as ``Top-1-Refit''
    the one where we then refit each abstraction vector.
    All results are averaged over 30 independent runs
    with $|\dset_{\scm{L}}| = 20000$ concrete samples
    and $|\dset_{\scm{J}}| = 150$ paired samples.
  }\label{fig:noisy}
\end{figure}

In this section, we discuss strategies to handle a further scenario where we consider abstract observations to be further perturbed by random noise. We consider the following generative model for the abstract observations:
\begin{align}
  \vec{e}^{(i)} &\sim \operatorname{Exponential}
                &\text{for } i=1,\dots,|\dset_{\scm{L}}|,\\
  \vec{x}^{(i)} &= \scm{L}(\vec{e}^{(i)})
                &\text{for } i=1,\dots,|\dset_{\scm{L}}|,\\
  \vec{y}^{(i)} &= \scm{H}(\gamma(\vec{e}^{(i)})) + \vec{\epsilon}^{(i)}
                &\text{for } i=1,\dots,|\dset_{J}|,
\end{align}
where $\vec{\epsilon} \sim \mathcal{N}(0, \sigma^2)$
is a Gaussian noise term
and the data-generating process
is the same of \Cref{subsec:dgp}.
Due to the presence of noise,
minimizing the least-squares error
does not ensure to recover
the true abstraction function.
We thus propose two strategies
to identify the concrete blocks
of each abstract variable.
By exploiting the fact
that each concrete variable
pertains to a single abstract variable,
we can filter the resulting matrix $\hat{\mat{T}}$
to select only the largest component per row
if it is above the threshold.
We find then beneficial
to refit the model once we have identified
the block in this way,
as in
\begin{align}
  \vec{t}_i = \argmin_{\vec{t}_i} \left\|
    \vec{x}_{\Pi_R{Y_i}} - \vec{t}_i^\tr \vec{y}_i
  \right\|^2_2.
\end{align}
In \Cref{fig:noisy},
we report results
for the reconstruction
of the blocks from the paired samples
for increasing
variance~$\sigma^2$
of the noise term
for these strategies.
