\section{Related Works}

Several works addressed the problem
of clustering together variables
to reduce dimensionality
and maintain the
identifiability of causal effect.
Both \citet{anand2023causal} and \citet{wahl2023foundations}
deal with the problem of partitioning
a causal graph into clusters
where causal relations at the micro-level
are translated 
as causal edges at the macro-level.
\citet{tikka2023clustering} 
study instead a particular class of groups,
which they define as \emph{transit clusters},
where only part of the variables
are allowed to have
ingoing or outgoing edgs
from the cluster.

Differently from previous works,
our work
focuses instead
on the necessary conditions
for causal abstraction
and results in different
definitions for the grouping of micro-variables.
It is however
an interesting direction
to assess
whether different assumptions,
for instance on the intervention map,
might lead to comparable definitions.

In parallel, several recent papers explored the problem of fitting an abstraction function from data by focusing on either discrete~\citep{zennaro2023jointly,felekis2024causal} or linear~\citep{kekic2023targeted,geiger2024finding} SCMs. Notably, apart from interventional samples, all these works assume to have at least partial knowledge of the graphs, the intervention map, or the set of concrete relevant variables corresponding to each abstract one.

Based on our theoretical results on the graphical and parametric conditions of linear abstraction for linear causal models,
we instead propose 
to learn both the abstract and the concrete model,
and their abstraction function directly from observational data and
without any prior knowledge or any constraint on the graphical structure of the two models.
