\begin{figure*}[t]
  \centering
  \begin{subfigure}[b]{0.48\textwidth}
    \includegraphics[width=\linewidth]{tikzbuilt/main-figure0.pdf}
    \caption{%
      $\mat{T}$-Abstraction
    }\label{subfig:a}
  \end{subfigure}
  \hfill
  \begin{subfigure}[b]{0.48\textwidth}
    \includegraphics[width=\linewidth]{tikzbuilt/main-figure1.pdf}
    \caption{
      Abs-LiNGAM
    }\label{subfig:b}
  \end{subfigure}
  \caption{%
    An overview of our contributions:
    (a.) A linear SCM~$\mathcal{H}$, representing the \emph{abstract} causal model,
    is a $\mat{T}$-abstraction
    of a linear SCM~$\mathcal{L}$, representing the \emph{concrete} causal model,
    whenever the
    linear transformation~$\mat{T}$
    from
    concrete
    to abstract variables
    is interventionally consistent, i.e., whenever it relates
    both values and
    interventions on the abstract model and the concrete model.
    We prove
    that,
    for each abstract variable~$Y$,
    the transformation $\mat{T}$
    induces
    a block~$\Pi(Y)$ of concrete causal variables
    that necessarily follows
    the causal ordering
    of the abstract model
    and whose parameters
    are constrained
    by the abstract coefficients.
    For each block,
    the abstraction function
    depends on a possibly smaller
    subset of \emph{relevant} variables,
    which we portray as dashed.
    (b.) We
    propose
    Abs-LiNGAM,
    a method
    to speedup the causal discovery
    of the concrete model~$\mathcal{L}$
    given an additional dataset $\dset_{J}$
    sampled from the joint distribution
    of the abstract and the concrete model.
    In order,
    Abs-LiNGAM
    (i.) reconstructs the transformation $\mat{T}$,
    (ii.) fits the abstract model by abstracting the concrete dataset $\dset_{\scm{L}}$,
    (iii.) infers a set of constraints $\mat{K}$ on which paths cannot exist in the concrete graph,
    and finally (iv.) discovers the concrete model in a search space reduced by the constraints.
  }\label{fig:mainfig}
\end{figure*}

\section{Introduction}

Causal Abstraction formalizes the property of distinct causal models to describe the same phenomenon with different levels of detail~\citep{beckers2019abstracting}. Despite having different variables and mechanisms, whenever two Structural Causal Models (SCMs) are in an abstraction relation, there must always exist at least one implementation on the low-level \emph{concrete} model of any property of the high-level \emph{abstract} one --- such as values, interventions, mechanisms, and endogenous or exogenous distributions. 

Abstract causal models allow the interpretation of causal models with large number of variables, such as in climate phenomena~\citep{chalupka2016unsupervised} or brain activation patterns~\citep{dubois2020personality}. Causal Abstraction has also found wide interest in explainable AI to align machine representations with human-interpretable concepts in feedforward neural networks~\citep{geiger2021causal,geiger2023causal}, concept-based neural networks~\citep{marconato2023interpretability}, and Large Language Models~\citep{wu2024interpretability,geiger2024finding}.

Previous works on the definition of Causal Abstraction do not focus on the graphical and parametrical conditions for two models to be in an abstraction relation. Furthermore, the problem of learning abstractions from data, when the high-level model is not known, is still open. In this context, \citet{zennaro2022abstraction} and \citet{geiger2023causal} propose methods to learn an abstraction function assuming to know both the low-level and the abstract model, while \citet{chalupka2016unsupervised,kekic2023targeted} and \citet{felekis2024causal} assume to have at least the graphical structure of the abstract model.

In this paper, we tackle these issues by focusing on the scenario where two linear SCMs are abstracted by a linear transformation, as shown in \Cref{fig:mainfig}.
In particular,
we study necessary and sufficient conditions
for abstraction
in terms of the edges
and the coefficients of the models.
We then propose Abs-LiNGAM,
a strategy
to learn from data
the abstract model,
the concrete model,
and their abstraction function
under the further assumption
of non-Gaussian exogenous noise.
We summarize our contributions as follows:


\begin{enumerate}
  \item%
    We first prove that abstract edges
    necessarily require
    edges in the low-level model
    to connect relevant variables,
    i.e., variables on which
    the abstraction function
    directly depends (\Cref{theo:connectivity}).
    Then,
    we show that the abstraction necessarily arranges concrete variables in adjacent and disjoint blocks that must follow the abstract causal ordering (\Cref{theorem:absord}).
  \item%
    We then prove a necessary and sufficient condition
    for causal abstraction
    that relates
    the coefficients
    of the linear models and
    the abstraction function (\Cref{theo:concretization}).
    In this way,
    we can characterize
    the set of all concrete models
    that are abstracted by a given abstract SCM
    and design a complete and correct algorithm to sample any model from this set (\Cref{alg:samplingblocks}).
  \item%
    We introduce Abs-LiNGAM,
    a method to speedup the causal discovery
    of large linear non-Gaussian models
    given an additional and small dataset sampled
    from the observational joint distribution
    of the model
    and one of its abstractions.
    Abs-LiNGAM recovers the abstraction function,
    learns the abstract model
    using low-level data,
    and finally
    constrains the recovery
    of the concrete model
    by ensuring that the necessary conditions
    we introduced
    are satisfied (\Cref{alg:abslingam}).
    \item%
    As we report in \Cref{sec:experiments},
    experiments in simulated settings
    show that Abs-LiNGAM substantially reduces the search space, and thus the execution time, compared to directly solving
    the problem on the low-level dataset
    with DirectLiNGAM~\citep{shimizu2011directlingam}.
\end{enumerate}

We also publicly release online
the code 
of Abs-LiNGAM and the experimental settings\footnote{\url{https://github.com/rmassidda/causabs}}.

