% \documentclass{uai2024} % for initial submission
\documentclass[accepted]{uai2024} % after acceptance, for a revised version; 
% also before submission to see how the non-anonymous paper would look like 
                        
%% There is a class option to choose the math font
% \documentclass[mathfont=ptmx]{uai2024} % ptmx math instead of Computer
                                         % Modern (has noticeable issues)
% \documentclass[mathfont=newtx]{uai2024} % newtx fonts (improves upon
                                          % ptmx; less tested, no support)
% NOTE: Only keep *one* line above as appropriate, as it will be replaced
%       automatically for papers to be published. Do not make any other
%       change above this note for an accepted version.

%% Choose your variant of English; be consistent
\usepackage[american]{babel}
% \usepackage[british]{babel}

%% Some suggested packages, as needed:
\usepackage{natbib} % has a nice set of citation styles and commands
    \bibliographystyle{plainnat}
    \renewcommand{\bibsection}{\subsubsection*{References}}
\usepackage{mathtools} % amsmath with fixes and additions
% \usepackage{siunitx} % for proper typesetting of numbers and units
\usepackage{booktabs} % commands to create good-looking tables
\usepackage{tikz} % nice language for creating drawings and diagrams

%\usepackage{ulem}

\usepackage{amsmath,amsfonts,amssymb,dsfont,bm,amsthm}
\usepackage{csquotes}

\usepackage{subcaption}
\captionsetup{compatibility=false}


\usepackage{thmtools,thm-restate}


\PassOptionsToPackage{hyphens}{url}
\usepackage{hyperref}
\hypersetup{
 colorlinks,
 citecolor=blue,
 linkcolor=red,
 urlcolor=blue,
 breaklinks=true}

\newtheorem{remark}{Remark}
\newtheorem{definition}{Definition}
\newtheorem*{notation}{Notation}
\newtheorem{property}{Property}
\newtheorem{proposition}{Proposition}
\newtheorem{corollary}{Corollary}
\newtheorem{assumption}{Assumption}
\newtheorem{lemma}{Lemma}[section]
\newtheorem{theorem}{Theorem}
\newtheorem{claim}{Claim}
\newtheorem{example}{Example}[section]
\newtheorem{problem}{Problem}
\newtheorem{theoremV2}{Theorem 2 - Version} \setcounter{theoremV2}{1}

\counterwithin{figure}{section}
\newcommand{\indep}{\perp \!\!\! \perp}
\newcommand{\notindep}{\not\!\perp\!\!\!\perp}
%% Provided macros
% \smaller: Because the class footnote size is essentially LaTeX's \small,
%           redefining \scriptsize, we provide the original \scriptsize
%           using this macro.
%           (Use only sparingly, e.g., in drawings, as it is quite small.)

%% Self-defined macros
\newcommand{\swap}[3][-]{#3#1#2} % just an example

\title{Identifiability of total effects from abstractions of time series\\ causal graphs}

% The standard author block has changed for UAI 2024 to provide
% more space for long author lists and allow for complex affiliations
%
% All author information is authomatically removed by the class for the
% anonymous submission version of your paper, so you can already add your
% information below.
%
% Add authors
\author[1,2]{Charles~K.~Assaad}
\author[3]{Emilie~Devijver}
\author[3]{Eric~Gaussier}
\author[4]{Gregor~G\"ossler}
\author[5]{Anouar~Meynaoui}
% Add affiliations after the authors
\affil[1]{%
    Sorbonne Université, INSERM, Institut Pierre Louis d’Epidémiologie et de Santé Publique\\
    F75012, Paris, France
}
\affil[2]{%
    EasyVista\\
    F38000, Grenoble, France
}
\affil[3]{%
    Univ Grenoble Alpes, CNRS, Grenoble INP, LIG\\
    F38000, Grenoble, France
  }
\affil[4]{%
    Univ. Grenoble Alpes, INRIA, CNRS, Grenoble INP, LIG\\
    F38000, Grenoble, France
  }
\affil[5]{%
    Université of Rennes 2\\
    F35000, Rennes, France
  }
  \begin{document}
\maketitle

\begin{abstract}
We study the problem of identifiability of the total effect of an intervention from observational time series in the situation, common in practice, where one only has access to abstractions of the true causal graph. We consider here two abstractions: the extended summary causal graph, which conflates all lagged causal relations but distinguishes between lagged and instantaneous relations, and the summary causal graph which does not give any indication about the lag between causal relations. We show that the total effect is always identifiable in extended summary causal graphs and provide sufficient conditions for identifiability in summary causal graphs. We furthermore provide adjustment sets allowing to estimate the total effect whenever it is identifiable.
\end{abstract}

\section{Introduction}
Over the last century and across numerous disciplines, experimentation has emerged as a potent methodology for estimating without bias the total effect of an intervention on a specific component of a given system \citep{Neyman_1990}. However, experimentation can be costly, unethical or even unfeasible. Both researchers and experts are thus interested in estimating the effect of an intervention directly from observational data. 
This can be done under some assumptions when relying on a complete causal graph \citep{Pearl_2000}, and typically relies on two sequential steps: identifiability and estimation \citep{Pearl_2019seven}.
The identifiability step involves distinguishing cases where a solution is possible and, when it exists, providing an estimand - an expression enabling the estimation of intervention effects from observational data. The subsequent step involves the actual estimation of this estimand from the available data.

\input{figure0}

The identifiability step received much attention for non-temporal causal graphs \citep{Pearl_1993StatScience,Pearl_1995,Spirtes_2000, Pearl_2000,Shpitser_2008}. For abstraction of causal graphs,  \cite{Perkovic_2020} derived necessary and sufficient conditions for identifying total effects in  maximally oriented partially directed acyclic graphs and \cite{Anand_2023} provided necessary and sufficient conditions when dealing with a directed acyclic graphs, where each vertex represent a cluster of variables and where relationships between clusters of variables are specified, but relationships between the variables within a cluster are left unspecified. 

For temporal causal graph, \citet{Blondel_2016} developed the do-calculus for the full-time causal graphs (FTCGs, Figure \ref{fig:example_FTCG}).
However, in dynamic systems, experts have difficulties in building full time causal graphs, while they can usually build an abstraction of those graphs such as an extended summary causal graph (ESCG, as in Figure \ref{fig:example_ESCG}) where  all lagged causal relations are conflated but lagged and instantaneous relations are clearly distinguished or such as a summary causal graph (SCG, as in Figure \ref{fig:example_SCG}) where all temporal information is omitted. Assuming no instantaneous relations, \citet{Eichler_2007} demonstrated that the total effect is identifiable from an ESCG or an SCG, and \cite{Assaad_2023} established identifiability  in the presence of instantaneous relations for acyclic SCGs.  \cite{Ferreira_2024} addressed the identifiability problem for general SCGs, including cycles and instantaneous relations for the direct effect; however, the identifiability of total effects in this context remains unexplored.

%The identifiability problem has only been solved for abstractions of causal graphs either under the assumption that there is no instantaneous relations~\citep{Eichler_2007} or under an acyclicity assumption~\citep{Assaad_2023}. 

Our main contributions consist in demonstrating, under causal sufficiency, that the total effect is always identifiable when working with an extended summary causal graph and providing sufficient conditions for identifying the total effect when working with a summary causal graph. The main difficulty lies in the fact that these abstractions may represent different full-time causal graphs with potentially different skeletons and orientations. %compatibility wrt to the true underlying probability distribution.
%sponding to on the missing information, and particularly the potential cycles in those abstract graphs, that have not been, to our knowledge,  considered in the literature dealing with abstract graphs. 
 
%Total effects, in time series, take the following form: $P (Y_t=y_t | do (X_{t-\gamma}=x_{t-\gamma}))$, latter denoted $P (y_t | do (x_{t-\gamma}))$ by a slight abuse of notations, where $do(X_{t-\gamma}=x_{t-\gamma})$ represents an intervention (as properly defined in \citet[Assumption 2.3]{Eichler_2007}) on the variable $X$ at time $t-\gamma$, with $\gamma\geq 0$. Here, $Y_t$ corresponds to the response and $do()$ denotes the standard intervention operator \citep{Pearl_2000}. 
%Our main contribution lies in demonstrating when the total effect can be identified when working with two specific types of abstraction known as the extended summary causal graph and the summary causal graph, under causal sufficiency (no hidden confounding). 

The remainder of the paper is structured as follows: Section \ref{sec:notions} introduces the main notions, Section \ref{sec:setup} presents the problem setup, identifiability conditions in ESCGs and SCGs are respectively presented in Sections~\ref{sec:ESCG} and \ref{sec:SCG}. Section~\ref{sec:real_app} discusses real applications for our theoretical results, and Section~\ref{sec:conclusion} concludes the paper. 
Omitted proofs can be found in the Supplementary Material.

\section{Preliminaries}
\label{sec:notions}
\paragraph{Graph notions}

For a \emph{directed acyclic graph} $\mathcal{G}$, a \emph{path} from $X$ to $Y$ in $\mathcal{G}$ is a sequence of distinct vertices $<X,\ldots, Y>$ in which every pair of successive vertices is adjacent. A \emph{directed path} from $X$ to $Y$ is a path from $X$ to $Y$ in which all edges are directed towards $Y$ in $\mathcal{G}$, that is $X \rightarrow \ldots \rightarrow Y$. A \emph{backdoor path} between $X$ and $Y$ is a path between $X$ and $Y$ with an arrowhead into $X$ in $\mathcal{G}$. If $X\rightarrow Y$, then $X$ is a \emph{parent} of $Y$. If there is a directed path from $X$ to $Y$, then $X$ is an \emph{ancestor} of $Y$, and $Y$ is a \emph{descendant} of $X$. A vertex counts as its own descendant and as its own ancestor. The sets of parents, ancestors and descendants of $X $ in $\mathcal{G}$ are denoted by $\text{Par}(X,\mathcal{G})$, $\text{Anc}(X,\mathcal{G})$ and $\text{Desc}(X,\mathcal{G})$ respectively. 
If a path $\pi$ contains $X \rightarrow Z \leftarrow Y$ as a subpath, then $Z$ is a \emph{collider} on $\pi$. A vertex $Z$ is a \emph{definite non-collider} on a path $\pi$ if the edge $X \leftarrow Z$, or the edge $Z \rightarrow Y$ is on $\pi$. A vertex is of \emph{definite status} on a path if it is a collider, a definite non-collider or an endpoint on the path. A path $\pi$ is of \emph{definite status} if every vertex on $\pi$ is of definite status. A path $\pi$ from $X$ to $Y$ of definite status is \emph{active} given a vertex set $\mathcal{Z}$, with $X,Y \notin  \mathcal{Z}$ if every definite non-collider on $\pi$ is not in $\mathcal{Z}$, and every collider on $\pi$ has a descendant in $\mathcal{Z}$. Otherwise, $\mathcal{Z}$ \emph{blocks} $\pi$.  By a slight abuse of notation, we denote $\mathcal{G} \backslash \{Y\}$ as the subgraph of $\mathcal{G}$ when removing the vertex $Y$ and its corresponding edges. Lastly, the \emph{skeleton} of a graph corresponds to all vertices and edges of the graph without considering edge orientations.


For a \emph{directed graph} $\mathcal{G}$, a directed path from $X$ to $Y$ and the edge $Y\rightarrow X$ form a \emph{directed cycle} in $\mathcal{G}$.  A self-loop on $X$ also forms a directed cycle.
We denote by $Cycles(X,\mathcal{G})$ the set of all directed cycles containing $X$ in $\mathcal{G}$, and by $Cycles^>(X,\mathcal{G})$ the subset of $Cycles(X,\mathcal{G})$ with at least two different vertices (i.e., excluding self-loops).
In addition, all notions introduced before for directed acyclic graphs hold for \emph{directed graphs}, with potential cycles. To avoid any ambiguity we would like to make some clarifications. For a \emph{directed graph} $\mathcal{G}$, a \emph{backdoor path} between $X$ and $Y$ is a path between $X$ and $Y$ which starts by either $X\leftarrow$ or $X \leftrightarrows$. A path is blocked by an empty set if there exists a vertex $Z$ such that $\rightarrow Z\leftarrow$ is on the path. Note that it means that $\leftrightarrows Z\leftarrow$, $\rightarrow Z\leftrightarrows$ and $\leftrightarrows Z\leftrightarrows$ are not on the path. For clarity, whenever a path is blocked by an empty set in a directed graph we will say that it is $\sigma$-blocked\footnote{The notion of $\sigma$-blocked path by a set $\mathcal{Z}$ is a generalization of the notion of blocked path by a set $\mathcal{Z}$ (which was introduced for directed acyclic graphs) to directed graphs~\citep{Forre_2017}. These two notions becomes equivalent when $\mathcal{Z}=\emptyset$. In this paper, we will use the notion of $\sigma$-blocked only when  $\mathcal{Z}=\emptyset$.}.
Note that $X \rightleftarrows Y$ and $X \leftarrow Y$ are the only $\sigma$-active backdoor paths of size $2$ in $\mathcal{G}$. 


If each vertex in a directed acyclic graph corresponds to an observed variable then, given an ordered pair of vertices $(X, Y)$ in $\mathcal{G}$, a set of vertices $\mathcal{Z}$ satisfies the \emph{standard backdoor criterion} relative to $(X, Y)$ if no vertex in $\mathcal{Z}$ is a descendant of $X$, and $\mathcal{Z}$ blocks every backdoor path between $X$ and $Y$.

% SCGs are the most abstract version of causal graphs considered in this paper. Due to the potential cycles, we use the notion of $\sigma$-blocked path introduced in \citet{Forre_2017}, considered here in the context of causal sufficiency. 

% \begin{definition}[$\sigma$-blocked path with no hidden confounding] 
% \label{Def:Sigma-blocked}
%  Consider an SCG $\mathcal{G}^s=(\mathcal{V}^s,\mathcal{E}^s)$. A path $\pi = \langle X, \cdots, Y \rangle$ in $\mathcal{G}^s$ is \emph{$\sigma$-blocked} by a set $\mathcal{Z}\subseteq \mathcal{V}^s$ if:
%  \begin{itemize}
%  \item at least one of the endpoints $X,Y$ is in $\mathcal{Z}$; or
%  \item there exists a vertex $W\in\pi$, such that 
%  $W \not \in Anc(\mathcal{Z}, \mathcal{G}^s)$ with two adjacent edges in $\pi$ that form a collider at $W$, (i.e., $\rightarrow W \leftarrow$); or
%  \item there exists a vertex $Z \in \pi \cap \mathcal{Z}$ with two adjacent edges in $\pi$ that forms a non-collider at $Z$, and such that there exists a directed edge pointing from $Z$ to a vertex not in the strongly connected component $Anc(Z, \mathcal{G}^s) \cap Desc(Z,\mathcal{G}^s)$.
%   \end{itemize}
%  If none of the above holds then the path $\langle X, \cdots, Y \rangle$ is \emph{$\sigma$-active} by $\mathcal{Z}$. 
% \end{definition}
% %
% Note that a backdoor path can start with the edge $\rightleftarrows$, and particularly $X \rightleftarrows Y$ is the only $\sigma$-active backdoor path of size 2 in $\mathcal{G}^s$ with $\mathcal{Z}\subseteq \mathcal{V}^s\backslash\{X, Y\}$. 

%
\paragraph{Causal graphs in time series}
Consider $\mathcal{V}$ a set of $p$ observational time series and $\mathcal{V}^f=\{\mathcal{V}_{t-\ell} | \ell \in \mathbb{Z}\}$ the set of temporal instances of $\mathcal{V}$ where $\mathcal{V}_{t-\ell}$ correspond to the variables of the time series at time $t-\ell$. 
%
We suppose that the time series are generated from an \emph{unknown} dynamic structural causal model (DSCM, \citet{DSCM}), an extension of structural causal models (SCM, \citet{Pearl_2000}) to time series. 
This DSCM defines a full-time causal graph (FTCG, see below) which we call the \emph{true} FTCG~\citep{Runge_2019,Runge_2021,Assaad_2022journal} and a joint distribution $P$ over its vertices which we call the \emph{true} probability distribution.

%The graph that is used to qualitatively represent causal relations described in a DSCM is known as the full-time causal graph (FTCG) which we call the \emph{true} FTCG. 
%
\begin{definition}[Full-time causal graph (FTCG), Figure~\ref{fig:example_FTCG}]
Let $\mathcal{V}$ be a set of $p$ observational time series and $\mathcal{V}^f=\{\mathcal{V}_{t-\ell} | \ell \in \mathbb{Z}\}$. The \emph{full-time causal graph} (FTCG) $\mathcal{G}^f=(\mathcal{V}^f, \mathcal{E}^f)$ representing a given DSCM is defined by: 
%set of time series for vertices $\mathcal{V}^f$ and the set of edges $\mathcal{E}^f$ defined as follows: 
$X_{t-\gamma} \rightarrow Y_t \in \mathcal{E}^f$ if and only if $X$ directly causes $Y$ at time $t$ with a time lag of $\gamma>0$ if $X=Y$ and with a time lag of $\gamma \geq 0$ for $X \neq Y$. 
\end{definition} 

As common in causality studies on time series, we consider in the remainder acyclic FTCGs with potential instantaneous causal relations. Note that acyclicity is guaranteed for relations between variables at different time stamps. In addition, note that for any time series $X$, $\forall i>0$,  $X_{t-i}$ can cause $X_t$; for example, the stock price yesterday can affect the stock price today. We furthermore assume causal sufficiency:
%
\begin{assumption}[Causal sufficiency] \label{ass:cs}
    There is no hidden common cause between any two observed variables.
\end{assumption}

In practice, it is usually impossible to work with FTCGs and people have resorted to simpler causal graphs, exploiting the fact that causal relations between time series hold throughout time, as formalized in the following assumption which allows one to focus on a finite number of past slices, given by the maximum lag. We fix it to $\gamma_{\max}$ in the remainder.

\begin{assumption}[Consistency throughout time]
	\label{ass:Consistency_Time}
 All the causal relationships in the the FTCG $\mathcal{G}^f$ remain constant in direction and magnitude throughout time\footnote{In our context we consider a dynamic system with several univariate observational time series, thus the problem of finding a unique total effect would be ill-posed if Assumption~\ref{ass:Consistency_Time} is not satisfied since violating the assumption would mean that the total effect would change over time.
}.
\end{assumption}



Experts are used to working with abstractions of causal graphs which summarize the information into a smaller graph that is interpretable, often with the omission of precise temporal information. We consider in this study two known causal abstractions for time series, namely \textit{extended summary causal graphs} and \textit{summary causal graphs}. An extended summary causal graph \citep{Assaad_2022uai} distinguishes between past time slices, denoted as $ \mathcal{V}^e_{t^-}$, and present time slices, denoted as $ \mathcal{V}^e_{t}$, thus enabling the differentiation between lagged and instantaneous causal relations. 
%
\begin{definition}[Extended summary causal graph (ESCG), Figure~\ref{fig:example_ESCG}]
	\label{Ext_Summary_G}
	Let $\mathcal{G}^f=(\mathcal{V}^f,\mathcal{E}^f)$ be an FTCG built from the set of time series $\mathcal{V}$ satisfying Assumption \ref{ass:Consistency_Time} with maximal temporal lag $\gamma_{\max}$. The \emph{extended summary causal graph} (ESCG) $\mathcal{G}^{e}=(\mathcal{V}^{e},\mathcal{E}^{e})$ associated to $\mathcal{G}^f$ is given by $\mathcal{V}^{e}=( \mathcal{V}^e_{t^-},\mathcal{V}^e_t)$ and $\mathcal{E}^{e}$ defined as follows:
	\begin{itemize}
\item for any $X$ in $\mathcal{V}$, we define two vertices, $X_{t^-}$ and $X_t$, respectively in $\mathcal{V}^e_{t^-}$ and $\mathcal{V}^e_t$; 
\item for all $X_t, Y_t \in \mathcal{V}_t^{e}$, $X_{t} \rightarrow Y_{t} \in \mathcal{E}^{e}$ if and only if $X_{t} \rightarrow Y_{t} \in \mathcal{E}^f$;
\item for all $X, Y \in \mathcal{V}_{t^-}^e$,  $X_{t^-} \rightarrow Y_{t} \in \mathcal{E}^{e}$ if and only if there exists at least one temporal lag $0<\gamma\leq \gamma_{\max}$ such that $X_{t-\gamma} \rightarrow Y_t \in \mathcal{E}^f$.
	\end{itemize}
	In that case, we say that $\mathcal{G}^e$ is \emph{derived from} $\mathcal{G}^{f}$.
\end{definition}
%
At a higher level of abstraction, a summary causal graph \citep{Peters_2013, Meng_2020, Assaad_2022journal,Assaad_2022survey} represents causal relationships among time series, regardless of the time delay between the cause and its effect. 
\begin{definition}[Summary causal graph (SCG), Figure~\ref{fig:example_SCG}]
	\label{Summary_G}
	Let $\mathcal{G}^f=(\mathcal{V}^f,\mathcal{E}^f)$ be an FTCG built from the set of time series $\mathcal{V}$ satisfying Assumption \ref{ass:Consistency_Time} with maximal temporal lag $\gamma_{\max}$. The  \emph{summary causal graph} (SCG) $\mathcal{G}^s=(\mathcal{V}^s, \mathcal{E}^s)$ associated to $\mathcal{G}^f$ is such that 
 \begin{itemize}
 \item $\mathcal{V}^s$ corresponds to the set of time series $\mathcal{V}$,
 \item $X \rightarrow Y \in \mathcal{E}^s$ if and only if there exists at least one temporal lag $0\leq \gamma\leq \gamma_{\max}$ such that $X_{t-\gamma} \rightarrow Y_t \in \mathcal{E}^f$.
 \end{itemize} 
In that case, we say that $\mathcal{G}^s$ is \emph{derived from} $\mathcal{G}^f$ as well as from the ESCG derived from $\mathcal{G}^f$. 
\end{definition}
%
Since an FTCG is assumed to be a directed acyclic graph, an ESCG is inherently a directed acyclic graph. In contrast, an SCG is a directed graph as it may include directed cycles and even self-loops. For example, the three FTCGs in Figure \ref{fig:example_FTCG} and the three ESCGs in Figure  \ref{fig:example_ESCG} are acyclic, while the SCG in Figure \ref{fig:example_SCG} has a cycle. 
We use the notation $X \rightleftarrows Y$ to indicate situations where there are time lags where $X$ causes $Y$ and other lags where $Y$ causes $X$. Additionally, if an SCG is an abstraction of an ESCG, in cases where there is no instantaneous relation, ESCGs and SCGs convey the same information.

It is worth noting that if there is a single ESCG or SCG derived from a given FTCG, different FTCGs, with possibly different orientations and skeletons, can yield the same ESCG or SCG. 
For example, the SCG in Figure \ref{fig:example_SCG}  can be derived from any FTCG and any ESCG in Figures \ref{fig:example_FTCG} and \ref{fig:example_ESCG}, even though they may have different skeletons (for example, $\mathcal{G}_{1}^f$ and $\mathcal{G}_{3}^f$ or $\mathcal{G}_{1}^e$ and $\mathcal{G}_{3}^e$) and different orientations (for example, $\mathcal{G}_{1}^f$ and $\mathcal{G}_{2}^f$ or $\mathcal{G}_{1}^e$ and $\mathcal{G}_{2}^e$).
Therefore, even if each vertex in an FTCG is assumed to represent a single observed variable, a vertex in the past slice of an ESCG represent a set of variables while a vertex in the present time slice represents a single variable, and a vertex in the SCG corresponds to a time series.
In the remainder, for a given ESCG or SCG $\mathcal{G}$, we call any FTCG from which $\mathcal{G}$ can be derived as a \textit{candidate FTCG} for $\mathcal{G}$. For example, in Figure \ref{fig:example}, $\mathcal{G}_{1}^f$, $\mathcal{G}_{2}^f$ and $\mathcal{G}_{3}^f$ are all candidate FTCGs for $\mathcal{G}^s$. The set of all candidate FTCGs for $\mathcal{G}$ is denoted by $\mathcal{C}(\mathcal{G})$.


\section{Problem setup}
\label{sec:setup}

We focus in this paper on the \emph{total effect}~\citep{Pearl_2000} of the \emph{singleton} variable $X_{t-\gamma}$ on the \emph{singleton} variable $Y_t$, written $P (Y_t=y_t | do (X_{t-\gamma}=x_{t-\gamma}))$ (as well as $P (y_t | do (x_{t-\gamma}))$ by a slight abuse of notation), \emph{when the only knowledge one has of the underlying DSCM consists in the ESCG or SCG derived from the unknown, true FTCG}. $Y_t$ corresponds to the response and $do(X_{t-\gamma}=x_{t-\gamma})$ represents an intervention (as defined in \citet{Pearl_2000} and \citet[Assumption 2.3]{Eichler_2007}) on the variable $X$ at time $t-\gamma$, with $\gamma\geq 0$. 

The above setting is very common in practice and entails that one neither knows the true FTCG nor the true probability distribution. Futhermore, even if one has access to observed data, in practice such observations are finite, which prevents one from discovering the true FTCG, and even from detecting it in the set of candidate FTCGs, as no existing causal discovery method is guaranteed to yield the true FTCG in the finite data setting \citep{Ait_Bachir_2023}. In the purely theoretical context of infinite data, discovering the true FTCG is only possible with additional assumptions, beyond the scope of this study~\citep{Assaad_2022survey}. 

Each candidate FTCG proposes a particular decomposition of the true joint probability distribution which is given by the standard recursive decomposition that characterizes Bayesian networks. 
%
%\begin{assumption}[Markov condition] %\label{ass:cmc}
%    Every vertex in an FTCG is conditionally independent of its non-descendants, given its parents.
%\end{assumption}
%
Not all decompositions are however correct wrt the true probability distribution $P$. % We meet there the notion of compatibility, which we restrict here to $P$ and candidate FTCGs.
% %
% \begin{definition}[Compatibility with $P$, \citet{Pearl_2000}]
% If the probability distribution $P$ admits the decomposition of a candidate FTCG $\mathcal{G}^f$, we say that $\mathcal{G}^f$ is compatible with $P$.
% \end{definition}
% %
% Note that only the (unknown) true FTCG is guaranteed to be compatible with $P$. 

%\subsection{Identifiability, summary back-door}

In general, a total effect $P(y_t \mid do(x_{t-\gamma}))$ is said to be identifiable from a graph if it can be uniquely computed with a do-free formula from the observed distribution
\citep{Pearl_1995,Perkovic_2020}. In our context, this means that the same do-free formula should hold in all candidate FTCG so as to guarantee that it holds for the true one.
%
\begin{definition}[Identifiability of total effects in ESCGs and SCGs] In a given ESCG or SCG $\mathcal{G}$, $P(y_t \mid do(x_{t-\gamma}))$ is \emph{identifiable} iff it can be rewritten with a do-free formula that is valid for any FTCG in $\mathcal{C}(\mathcal{G})$. %compatible with the given ESCG or SCG.
\end{definition}
%
One way to rewrite  $P(y_t \mid do(x_{t-\gamma}))$ with a do free-formula is by finding an adjustment set of variables for which:
\begin{equation}
\label{eq:adjustment_formula}
P(y_t | do(x_{t-\gamma})) = \sum_\mathbf{z} P(y_t|x_{t-\gamma}, \mathbf{z}) P(\mathbf{z}).    
\end{equation}
Whenever a set of variables satisfy Equation~\eqref{eq:adjustment_formula}, we call it a \emph{valid adjustment} set.
%do-free formulas are typically obtained through an adjustment set which contains variables that are sufficient  to adjust for computing the total effect from $x_{t-\gamma}$ to $y_t$. 
The standard backdoor criterion, introduced in \cite{Pearl_1995}, allows one to obtain valid adjustment sets using the true FTCG. 
%We introduce here a new backdoor criterion specific to ESCGs and SCGs which we call the \emph{backdoor criterion over all possible FTCGs}. 
We provide here another version of the backdoor criterion that allows us to find a valid adjustment set given all candidate FTCGs without knowing which one is the true FTCG.

\begin{definition}[Backdoor criterion over all candidate FTCGs]
Let $\mathcal{G} = (\mathcal{V},\mathcal{E})$ be an ESCG or SCG. % and $\mathcal{G}^f = (\mathcal{V}^f,\mathcal{E}^f)$ an FTCG in $\mathcal{C}(\mathcal{G})$. 
A set of vertices $\mathcal{Z}$ % in $\mathcal{V}^f$
satisfies the \emph{backdoor criterion over all candidate FTCGs} relative to $(X_{t-\gamma},Y_t)$ if %\textcolor{green}{$(X_{t-\gamma}, Y_t)$??}
 \begin{itemize}
 \item[(i)] $\mathcal{Z}$ blocks all backdoor paths between $X_{t-\gamma}$ and $Y_t$ in any FTCG in $\mathcal{C}(\mathcal{G})$,
 \item[(ii)] $\mathcal{Z}$ does not block any directed path between $X_{t-\gamma}$ and $Y_t$ in any FTCG in $\mathcal{C}(\mathcal{G})$,
 \item[(iii)] $\mathcal{Z}$ does not contain any descendant of $X_{t-\gamma}$ in any FTCG in $\mathcal{C}(\mathcal{G})$.
 \end{itemize}
\end{definition}
%
Note that when there is no backdoor path between $X_{t-\gamma}$ and $Y_t$ in any FTCG in $\mathcal{C}(\mathcal{G})$, $\mathcal{Z}=\emptyset$ satisfies the backdoor criterion over all candidate FTCGs. 

The backdoor criterion over all candidate FTCGs is sound for  the identification of the total effect $P(y_t|do(x_{t-\gamma}))$ in an ESCG or SCG, as stated in the following corollary that can be deduced from \cite[Theorem~1]{Pearl_1995}.

\begin{restatable}{corollary}{mypropositioninit}
Let $X$ and $Y$ be distinct vertices in an ESCG or SCG $\mathcal{G}$ of a DSCM with true (unknown) probability $P$. Under Assumptions \ref{ass:cs} and \ref{ass:Consistency_Time} for $\mathcal{G}$, if there exists a set $\mathcal{Z}$ satisfying the backdoor criterion over all possible FTCGs relative to $(X_{t-\gamma},Y_t)$, then the total effect of $X_{t-\gamma}$ on $Y_t$ is identifiable in $\mathcal{G}$, and 
%$$ P(y_t | do(x_{t-\gamma})) = \sum_\mathbf{z} P(y_t|x_{t-\gamma}, \mathbf{z}) P(\mathbf{z}). $$
$\mathcal{Z}$ is a valid adjustment set for the formulae given in Equation \eqref{eq:adjustment_formula}. 
\end{restatable}
%

However, enumerating all candidate FTCGs is computationally expensive \citep{Robinson_1977}, even when considering the constraints given by an ESCG or an SCG.

Formally, we address the following technical problem:
\begin{problem}
    Consider an ESCG or an SCG $\mathcal{G}$ and the total effect $P(y_t | do(x_{t-\gamma}))$. We aim to find out conditions to identify $P(y_t | do(x_{t-\gamma}))$ when having access solely to an ESCG or an SCG without enumerating all candidate FTCGs in $\mathcal{C}(\mathcal{G})$.
\end{problem}


\noindent \textbf{Remarks}
\begin{enumerate}
%\item \textcolor{red}{Charles n'a pas validé} Note that if the response or the intervention are non-singleton sets, the backdoor criterion over all candidate FTCGs may fail to identify the total effect. This is due to the fact that orientation (and thus causal ordering) may be different in different candidate FTCGs, similarly to \cite{Perkovic_2020}. 
%
\item 
Our context is different from the one considered in \citet{Perkovic_2020} since the graphs we have to consider for a given ESCG or SCG, namely the candidate FTCGs, may have different skeletons and may not all be compatible with the true underlying distribution. Furthermore, in ESCGs and SCGs, each vertex does not necessarily correspond to a single variable. 
%This contrasts with the graphs associated with CPDAGs and MPDAGs %\footnote{PDAG: partially directed acyclic graph; CPDAG: completed PDAG; MPDAG: maximally oriented PDAG.} 
%studied in particular in \citet{Perkovic_2020}. 
%This also explains why we need a specific assumption on the type of distributions on which completeness of the summary backdoor criterion can be established\footnote{Note that the summary backdoor criterion is sound for any type of distributions.}. 

\item Our context is different from the one considered in \citet{Anand_2023}. They consider cluster of variables, even for the response and the intervention variable, while  we are interested in the total effect $P(y_t\mid x_{t-\gamma})$ where the response variable and the intervention variable are singletons. 
Furthermore, we may have cycles in the SCGs, while they assume acyclic graphs. 

\item The cycles that we consider in this work, namely in SCGs,  do not hold the same conceptual meaning as the cycles considered in \cite{Bongers_2021}, as in our case, cyclicity comes from the abstraction of an acyclic graph. %only because of the partial specification of the graph.

\end{enumerate}



%\input{figure0bis}

\section{Identifiability in ESCG}
\label{sec:ESCG}

The total effect is always identifiable by adjustment in ESCGs, as stated in the following theorem.
%
\begin{restatable}{theorem}{mytheoremtwo}{(Identifiability in ESCG)}
\label{Identification_Ges}
Consider an ESCG $\mathcal{G}^{e}$. Under Assumptions \ref{ass:cs} and \ref{ass:Consistency_Time} for $\mathcal{G}^e$, the total effect $P(y_t | do(x_{t - \gamma}))$  is identifiable in $\mathcal{G}^{e}$ for any $\gamma \geq 0$. Furthermore, the set \begin{align*}
 \mathcal{B}_{\gamma} =& \{ (Z_{t-\gamma-\ell})_{1\leq \ell \leq \gamma_{\max}} | Z_{t^-} \in Par(X_t, \mathcal{G}^{e})\}\\
 &\cup \{Z_{t-\gamma} | Z_{t} \in Par(X_t, \mathcal{G}^{e}) \},
\end{align*}
is a valid adjustment set for $P(y_t | do(x_{t - \gamma}))$  for the formulae given in Equation \eqref{eq:adjustment_formula}. 
\end{restatable}
%
%
If $\mathcal{B}_{\gamma}$ is a valid adjustment set, it may still be very large. Additional adjustment sets, potentially smaller than $B_{\gamma}$, can however be obtained in the densest candidate FTCG, which is the candidate FTCG which contains all potential edges and is thus maximal in the number of edges. 
%, as stated in the following proposition.
%s every total effect is identifiable in ESCG, it is relevant to determine an adjustment set. The set, though this set is very large. We consider the densest compatible FTCG, defined as follows.
%\begin{definition}
%    For an ESCG $\mathcal{G}^{e}$ and a maximal lag $\gamma_{\max}$, the \emph{densest compatible FTCG} $\mathcal{G}^{f}_d$ is the FTCG compatible with $\mathcal{G}^{e}$ with all potential edges. \textcolor{gray}{maximal au sein de l'inclusion d'aretes. Enlever la definition ?}
%\end{definition}
%The following proposition gives a range of adjustment sets, which are potentially smaller. 

\begin{restatable}{proposition}{mypropositionone}%{(backdoor Criterion for ESCG)}
\label{backdoor_Ges}
Consider an ESCG $\mathcal{G}^{e}$ and a maximal lag $\gamma_{\max}$ and let $\gamma \geq 0$. Any adjustment set $\mathcal{B}_{\gamma}'$ for the total effect $P(y_t | do(x_{t - \gamma}))$ that satisfies the standard backdoor criterion on the densest candidate FTCG in $\mathcal{C}(\mathcal{G}^{e})$ is a valid adjustment set for the total effect. 
% and 
% $$P(y_t | do(x_{t - \gamma})) = \sum_{\mathbf{b} \in \mathcal{B}_{\gamma}'} P(\mathbf{b}) P(y_t | x_{t-\gamma},\mathbf{b}).$$ 
In addition, $\mathcal{B}_{\gamma}$ is a valid adjustment set with respect to the standard backdoor criterion on the densest candidate FTCG.
\end{restatable}
%
Note however that smaller (in the number of variables) adjustment sets may exist in the true FTCG when it is different from the densest candidate FTCG.

%\textcolor{red}{Moreover,  this adjustment set $\mathcal{B}_{\gamma}'$ is distinct from the standard backdoor criterion for the total effect $P(y_t | do(x_{t - \gamma}))$ as applied to the true FTCG, which we do not know}. 

\section{Identifiability in SCG}
\label{sec:SCG}

%In this condition, the only $\sigma$-active backdoor path from $X$ to $Y$ with $n=2$ is $X \leftrightarrows Y$ as the path is both active and a backdoor path. In fact, the two propositions ''there exists a $\sigma$-active backdoor path from $X$ to $Y$ with $n=2$'' and ''$X \leftrightarrows Y$'' are equivalent.

In this section, we start by presenting the main result of the paper which provides sufficient conditions for identifying the total effect only by using an SCG and providing an adjustment set that can be used whenever the sufficient conditions are satisfied. Then we provide another adjustment set that is more suitable in practice. Finally, we discuss several examples where the total is not identifiable using an SCG. 


%\textcolor{red}{motivation pourquoi suffisantes : Nous ne connaissons pas de preuve que le critere backdoor est complet dans la classe des candidats FTCG et donc nous exhibons ici seulement des conditions suffisantes pour avoir un backdoor commun et donc l'identifiabilité}

Note that we are only considering sufficient conditions because the backdoor criterion is not complete, meaning it does not provide all possible valid adjustment sets. Therefore, the backdoor criterion over all candidate FTCGs is not necessarily complete.


\subsection{Main result: sufficient conditions for identifiability}\label{sec:suff}

We provide sufficient conditions\footnote{In Supplementary Material, we provide an equivalent version of Theorem~\ref{Thm:identification_summary} which might be easier to read to certain readers.} for the identifiability in SCG. Recall that $Cycles(X,\mathcal{G}^s)$ is the set of all directed cycles containing $X$ in $\mathcal{G}^s$, and $Cycles^>(X,\mathcal{G}^s)$ is the subset where cycles contain at least 2 different vertices.% (i.e., excluding self-loops). %, as introduced in Section \ref{sec:notions}.
%


\begin{restatable}{theorem}{mytheoremthree}{(Identifiability in SCG)}
\label{Thm:identification_summary}
Consider an SCG $\mathcal{G}^s=(\mathcal{V}^s,\mathcal{E}^s)$ associated with a DSCM with true (unknown) probability distribution $P$. Under Assumptions \ref{ass:cs} and \ref{ass:Consistency_Time}, the total effect $P(y_t | do(x_{t - \gamma}))$, with $\gamma \geq 0$, is identifiable if $X \notin Anc(Y,\mathcal{G}^s)$ or $X \in Anc(Y,\mathcal{G}^s)$ and none of the following holds: 
 \begin{enumerate}
 \item $\gamma \ne 0$ and $Cycles^>(X, \mathcal{G}^s\backslash \{Y\}) \neq \emptyset$, or
 \item there exists a $\sigma$-active backdoor path 
 $$\pi^s = \langle V^1=X, \cdots, V^n=Y\rangle$$ from $X$ to $Y$ in $\mathcal{G}^s$ such that $\langle V^2, \cdots, V^{n-1}\rangle\subseteq Desc(X, \mathcal{G}^s)$ and one of the following holds:
 \begin{enumerate}
 \item $n > 2$, i.e. $\langle V^2, \cdots, V^{n-1}\rangle\ne\emptyset$, or
 \item $n=2$ and $\gamma\ne 1$, or
 \item $n=2$, $\gamma = 1$ and $Cycles(Y, \mathcal{G}^s\backslash \{X\}) \neq \emptyset$. 
 \end{enumerate}
\end{enumerate}
\end{restatable}


In the remainder, we prove the above theorem through Lemmas~\ref{lemma:1}-\ref{lemma:3}. 
To do so, for the total effect $P(y_t|do(x_{t-\gamma}))$, we consider the following set:
\begin{align}\label{AgammaSCG}
 \mathcal{A}_{\gamma} =& \{ (Z_{t-\gamma-\ell})_{1\leq \ell \leq \gamma_{\max}} | Z \in Desc(X; \mathcal{G}^{s})\}\nonumber\\
 &\cup \{(Z_{t-\gamma-\ell})_{0\leq \ell \leq \gamma_{\max}} | Z \in \mathcal{V}^s\backslash Desc(X, \mathcal{G}^s) \}
\end{align}
%
and we prove that it is a valid adjustment set when the total effect is identifiable. 
As one can note, it contains all possible parents of $X_{t-\gamma}$ in all candidate FTCGs of $\mathcal{G}^s$. Thus, $\mathcal{A}_{\gamma}$ blocks any backdoor path $\pi$ between $X_{t-\gamma}$ and $Y_t$ in any candidate FTCG through the parent of $X_{t-\gamma}$ on that path. %, which is the first vertex on the path from $X_{t-\gamma}$ to $Y_{t}$) which cannot be a collider. 


\input{figure5}

We first introduce the notion of ambiguous vertices, represented in gray in every figure, that will be useful for the proofs of most of the lemmas.
%
\begin{definition}[Ambiguous vertices]
\label{def:ambiguous_vertices}
  Consider an SCG $\mathcal{G}^s$ and the total effect $P(y_t\mid do(x_{t-\gamma}))$, for $\gamma \geq 0$.
 A vertex $V_{t'}$ belonging to an active backdoor path for $(X_{t-\gamma},Y_t)$ in a candidate FTCG is \emph{ambiguous} if there exists another candidate FTCG in which $V_{t'}$ belongs to a directed path from $X_{t-\gamma}$ to $Y_t$ or in which $V_{t'}$ is a descendant of $Y_t$. 
% is \emph{ambiguous} for $P(y_t\mid do(x_{t-\gamma}))$ if there exist two candidate FTCGs $\mathcal{G}_1^f$ and $\mathcal{G}_2^f$ such that:
% \begin{itemize}
% \item $V_{t'}$ belongs to a backdoor path of $(X_{t-\gamma},Y_t)$ in $\mathcal{G}_1^f$; and
% \item $V_{t'}$ belongs to a directed path from $X_{t-\gamma}$ to $Y_t$ in $\mathcal{G}_2^f$ or $V_{t'}\in Desc(Y_t, \mathcal{G}_2^f)$. 
% \end{itemize}
\end{definition}
%
Ambiguous vertices are crucial for identifiability.
%, as shown in the next property.
In addition to ambiguous vertices, one can also define ambiguous paths, as follows.
%
\begin{definition}[Ambiguous paths]
\label{def:ambiguous_paths}
Consider an SCG $\mathcal{G}^s$ and a candidate FTCG $\mathcal{G}^f$.
 A path $\pi^f \in \mathcal{G}^f$ between $X_{t-\gamma}$ and $Y_t$, for $\gamma \geq 0$, is an \emph{ambiguous path} if it does not contain any vertex at time $t-\gamma - \ell$ for $\ell \geq 1$.
 We note $\Pi^f_{\gamma}$ the set of all ambiguous paths in $\mathcal{G}^f$.
\end{definition}
When $\pi$ is not an ambiguous path ($\pi \notin \Pi^f_{\gamma}$), then the parent of $X_{t-\gamma}$ is in the past of $X_{t-\gamma}$ and thus cannot be ambiguous. One thus has the following property:

\begin{restatable}{property}{mypropertytwo}
 \label{property:non_ambiguous_identif}
 Consider an SCG $\mathcal{G}^s$ and the total effect $P(y_t\mid do(x_{t-\gamma}))$, for $\gamma \geq 0$. Suppose $\pi^f$ is a backdoor path between $X_{t-\gamma}$ and $Y_t$ in a candidate FTCG $\mathcal{G}^f$. If $\pi^f \not\in \Pi^f_{\gamma}$, then $\pi^f$ is blocked by a subset of $\mathcal{A}_{\gamma}$ containing at least one non-ambiguous vertex.
\end{restatable}
\begin{example}
    For example, in Figure~\ref{fig:example_non_identifiable_cond2c_FTCG2},  $\pi_1^f = \langle X_{t-1}, X_{t-2}, Y_{t-1}, Y_{t} \rangle$ is not an ambiguous path between $X_{t-1}$ and $Y_t$ since $X_{t-2}$ precedes $X_{t-1}$ in time. On the other hand, $\pi_2^f = \langle X_{t-1}, Y_{t-1}, Y_{t} \rangle$ is an ambiguous path between $X_{t-1}$ and $Y_t$. The path $\pi_1^f $ is blocked by $X_{t-2}$. 
\end{example}

We now introduce the notion of compatible path that will allow us to relate backdoor paths in a given SCG and its candidate FTCGs.

\begin{definition}[Compatible path]
 Consider an SCG $\mathcal{G}^s$, a candidate FTCG $\mathcal{G}^f$, and the total effect $P(y_t\mid do(x_{t-\gamma}))$, for $\gamma \geq 0$. 
We say that a path $\pi^f=\langle X_{t-\gamma}, W^2_{t^2}, \cdots, W^{m-1}_{t^{m-1}}, Y_t \rangle$ in $\mathcal{G}^f$ is \emph{compatible} with a path $\pi^s=\langle X, V^2, \cdots, V^{n-1}, Y\rangle$ in $\mathcal{G}^s$ if for all 
$(W^j_{t^j})_{2\leq j \leq m-1}$: either $W^j\in \langle V^2, \cdots, V^{n-1}\rangle$ or $\exists V \in \langle V^2, \cdots, V^{n-1}\rangle$ such that $W^j\in Cycles(V, \mathcal{G}^s)\backslash Cycles(X, \mathcal{G}^s)$.
\end{definition}

The following property relates backdoor paths in a given SCG and  in any of its candidate FTCG. % compatible with the SCG through graphical conditions on the SCG. 
\begin{restatable}{property}{mypropertythree}
\label{property:compatible_paths}
Consider an SCG $\mathcal{G}^s$ and the total effect $P(y_t\mid do(x_{t-\gamma}))$ for $\gamma \geq 0$. Then $(i) \Rightarrow (ii)$, where: 
\begin{itemize}
 \item[(i)] $\gamma = 0$ or $Cycles^>(X, \mathcal{G}^s\backslash \{Y\}) = \emptyset$,
\item[(ii)] in any candidate FTCG $\mathcal{G}^f$, there exists no backdoor path $\pi^f\in \Pi^f_{\gamma}$ %between $X_{t-\gamma}$ and $Y_t$ 
that is not compatible with any backdoor path in $\mathcal{G}^s$.
\end{itemize}
\end{restatable}

 
The two above properties allow one to prove the following lemmas which prove that each condition of Theorem~\ref{Thm:identification_summary} is sufficient. The first lemma is rather straightforward and concern the case where $X\not\in Anc(Y,\mathcal{G}^s)$ for a given SCG $\mathcal{G}^s$.

\begin{restatable}{lemma}{mylemmafive}
\label{lemma:1}
Consider an SCG $\mathcal{G}^s$, $\gamma \geq0$ fixed and the total effect $P(y_t\mid do(x_{t-\gamma}))$.
 If $X\not\in Anc(Y,\mathcal{G}^s)$ then $P(y_t\mid do(x_{t-\gamma}))$ is identifiable, and $P(y_t\mid do(x_{t-\gamma})) = P(y_t)$.
\end{restatable}

The following lemma excludes both Conditions 1 and 2 of Theorem~\ref{Thm:identification_summary} by considering the negation of Condition 1 (in (i)) and the situation in which there is no $\sigma$-active backdoor path from $X$ to $Y$ with $\mathcal{Z} = \emptyset$.

\begin{restatable}{lemma}{mylemmasix}
 \label{lemma:2}
 Consider an SCG $\mathcal{G}^s$, $\gamma \geq0$ fixed and the total effect $P(y_t\mid do(x_{t-\gamma}))$.
 If $X\in Anc(Y,\mathcal{G}^s)$ and 
 \begin{itemize}
 \item[(i)] either $\gamma=0$ or $Cycles^>(X, \mathcal{G}^s\backslash \{Y\}) = \emptyset$ and
 \item[(ii)] $\nexists \sigma$-active backdoor path $\pi^s = \langle V^1=X, \cdots, V^n=Y\rangle$ from $X$ to $Y$ in $\mathcal{G}^s$ with $\mathcal{Z} = \emptyset$ such that $\langle V^2, \cdots, V^{n-1}\rangle\subseteq Desc(X, \mathcal{G}^s)$, 
 \end{itemize}
 then $P(y_t\mid do(x_{t-\gamma}))$ is identifiable by $\mathcal{A}_\gamma$.
\end{restatable}
This lemma is illustrated in Figure \ref{fig:example_maybe_identifiable:SCG_1} - \ref{fig:example_maybe_identifiable:SCG_2}.

When there is a $\sigma$-active backdoor path from $X$ to $Y$ with $\mathcal{Z} = \emptyset$, the negation of Condition 2 of Theorem~\ref{Thm:identification_summary} is obtained with $n=2$, $\gamma =1$ and $Cycles(Y, \mathcal{G}^s\backslash \{X\}) = \emptyset$. The negation of Condition 1 of Theorem~\ref{Thm:identification_summary} is obtained in this setting with $Cycles^>(X, \mathcal{G}^s\backslash \{Y\}) = \emptyset$. Note that, as before, having a $\sigma$-active backdoor path from $X$ to $Y$ with $\mathcal{Z} = \emptyset$ and $n=2$ is equivalent to $X\leftrightarrows Y$.

\begin{restatable}{lemma}{mylemmaseven}
 \label{lemma:3}
 Consider an SCG $\mathcal{G}^s$ and the total effect $P(y_t\mid do(x_{t-1}))$ ($\gamma=1$). 
 If the only $\sigma$-active backdoor path from $X$ to $Y$ in $\mathcal{G}^s$ with $\mathcal{Z} = \emptyset$ is $X\leftrightarrows Y \in \mathcal{G}^s$ and 
 \begin{itemize}
 \item[(i)] $Cycles^>(X, \mathcal{G}^s\backslash \{Y\}) = \emptyset$ and
 \item[(ii)] $Cycles(Y, \mathcal{G}^s\backslash \{X\}) = \emptyset$,
 \end{itemize}
 then $P(y_t\mid do(x_{t-1}))$ is identifiable by $\mathcal{A}_\gamma$.
\end{restatable}
This lemma is illustrated in Figure \ref{fig:example_maybe_identifiable:SCG_3}.

\subsection{Adjustment set}
When the total effect is identifiable and when $X\in Anc(Y, \mathcal{G}^s)$, the set $A_{\gamma}$ defined in Equation \eqref{AgammaSCG} is a valid adjustment set, but it has a large size, so we provide a smaller valid adjustment set, defined as follows:
\begin{equation*}
 {\mathcal{A}}'_{\gamma} = \{ V_{t'}\in \mathcal{A}_{\gamma} |V \in Anc(X, \mathcal{G}^s) \cup Anc(Y, \mathcal{G}^s)\}.
\end{equation*}
\begin{restatable}{proposition}{mypropositiontwo}
 Consider an SCG $\mathcal{G}^s$ and the total effect $P(y_t\mid do(x_{t-\gamma}))$, with $\gamma \geq 0$. Under conditions of identifiability provided by Theorem~\ref{Thm:identification_summary}, the set $ {\mathcal{A}}'_{\gamma}$ is a valid adjustment set for the total effect. 
\end{restatable}


\subsection{Non identifiable examples}\label{sec:nec}
%
In this section, we provide several examples of SCGs where the total effect cannot be identified by finding a valid adjustment set.

\input{figure1} 

%This enables us to give necessary conditions. We detail in Lemmas \ref{lemma:1} - \ref{lemma:4} all the cases, and depict them in Figures \ref{fig:example_non_identifiable_2}-\ref{fig:example_identifiable_6}. We present  an SCG and two FTCGs compatible with the SCG that do not share any adjustment set. Ambiguous vertices are highlighted in gray. The proofs of the lemmas follow a similar approach: we provide two FTCGs that are compatible with the SCG, along with associated paths in which all vertices are ambiguous. Thus, by Property \ref{property:ambiguous_non_identif}, the total effect is not identifiable. 



%This lemma is illustrated in Figure \ref{fig:example_non_identifiable_2}  with the SCG $\mathcal{G}^s_1$ and the total effect $P(y_t | do(x_{t - 1}))$.

\begin{example}
\label{example:1}
Consider the SCG in Figure~\ref{fig:example_non_identifiable_2_SCG} and the two candidate FCTGs given in Figure~\ref{fig:example_non_identifiable_2_FTCG1}  and ~\ref{fig:example_non_identifiable_2_FTCG2}. Suppose we are interested in the total effect $P(y_t\mid do(x_{t-1}))$.
In the first FCTG depicted in Figure~\ref{fig:example_non_identifiable_2_FTCG1}, the path $\langle X_{t-1}, Z_{t-1}, Y_t \rangle$ is an active back-door path. Since $Z_{t-1}$ is the only vertex on this path that is not an endpoint, we need to adjust for it to eliminate the confounding bias induced by this path.
However, in the second FTCG depicted in \ref{fig:example_non_identifiable_2_FTCG2}, $\langle X_{t-1}, Z_{t-1}, Y_t \rangle$ forms a directed path. This implies that we should not adjust for $Z_{t-1}$ to preserve the influence of $X_{t-1}$ on $Y_t$ through the path passing by $Z_{t-1}$.
Since we do not know which FTCG is the true one, then we cannot determine whether we should adjust for $Z_{t-1}$ or not. Consequently, there is no valid adjustment set to identify the total effect $P(y_t\mid do(x_{t-1}))$.
\end{example}

\input{figure2}



%This lemma is illustrated in Figure~\ref{fig:example_identifiable_2.2} with the SCG $\mathcal{G}^s_2$ and the total effect $P(y_t | do(x_{t - 1}))$. 

\begin{example}
\label{example:2}
Consider the SCG in Figure~\ref{fig:example_non_identifiable_cond2a_SCG} and the two candidate FTCGs in Figures~\ref{fig:example_non_identifiable_cond2a_FTCG1} and \ref{fig:example_non_identifiable_cond2a_FTCG2}. Suppose we are interested in the the total effect $P(y_t\mid do(x_{t-1}))$.  
The path $\langle X_{t-1}, Z_{t-1}, X_t, Y_t \rangle$ is an active back-door path in the first FTCG depicted in (b).  Since $Z_{t-1}$ is the only vertex on this path that is not an endpoint and that does not belong to a directed path in the same graph, we need to adjust for it to eliminate the confounding bias induced by this path.
However, in the second FTCG depicted in \ref{fig:example_non_identifiable_cond2a_FTCG2}, $\langle X_{t-1}, \langle Y_{t-1}, Z_{t-1}, X_t, Y_t \rangle$ forms a directed path. This implies that we should not adjust for $Z_{t-1}$ to preserve the influence of $X_{t-1}$ on $Y_t$ through the path passing by $Z_{t-1}$.
Since we do not know which FTCG is the true one, then we cannot determine whether we should adjust for $Z_{t-1}$ or not. Consequently, there is no valid adjustment set to identify the total effect $P(y_t\mid do(x_{t-1}))$.

\end{example}

\input{figure3}

%In this condition, the only $\sigma$-active backdoor path from $X$ to $Y$ with $n=2$ is $X \leftrightarrows Y$ as the path is both active and a backdoor path. In fact, the two propositions ''there exists a $\sigma$-active backdoor path from $X$ to $Y$ with $n=2$'' and ''$X \leftrightarrows Y$'' are equivalent.



%This lemma is illustrated in Figure \ref{fig:example_non_identifiable_3} with the SCG $\mathcal{G}^s_3$ and the t

\begin{example}
\label{example:3}
Consider the SCG in Figure~\ref{fig:example_non_identifiable_cond2b_SCG} and the two candidate FTCGs in Figures~\ref{fig:example_non_identifiable_cond2b_FTCG1} and \ref{fig:example_non_identifiable_cond2b_FTCG2}. Suppose we are interested in the the total effect $P(y_t\mid do(x_{t-2}))$.  
The path $\langle X_{t-2}, Y_{t-2}, X_{t-1}, Y_t \rangle$ is an active back-door path in the first FTCG depicted in~\ref{fig:example_non_identifiable_cond2b_FTCG1}.  Since $Y_{t-2}$ is the only vertex on this path that is not an endpoint and that does not belong to a directed path in the same graph, we need to adjust for it to eliminate the confounding bias induced by this path.
However, in the second FTCG depicted in \ref{fig:example_non_identifiable_cond2b_FTCG2}, $\langle X_{t-2}, Y_{t-2}, X_{t-1}, Y_t \rangle$ forms a directed path. This implies that we should not adjust for $Y_{t-2}$ to preserve the influence of $X_{t-2}$ on $Y_t$ through the path passing by $Y_{t-2}$.
Since we do not know which FTCG is the true one, then we cannot determine whether we should adjust for $Y_{t-2}$ or not. Consequently, there is no valid adjustment set to identify the total effect $P(y_t\mid do(x_{t-2}))$.
\end{example}




%This lemma is illustrated in Figure \ref{fig:example_identifiable_6} with the SCG $\mathcal{G}^s_4$ and the total effect $P(y_t | do(x_{t - 1}))$.



\begin{example}
\label{example:4}
Consider the SCG in Figure~\ref{fig:example_non_identifiable_cond2c_SCG} and the two candidate FTCGs in Figures~\ref{fig:example_non_identifiable_cond2c_FTCG1} and \ref{fig:example_non_identifiable_cond2c_FTCG2}. Suppose we are interested in the the total effect $P(y_t\mid do(x_{t-1}))$.  
The path $\langle X_{t-1}, Y_{t-1}, Y_t \rangle$ is an active back-door path in the first FTCG depicted in~\ref{fig:example_non_identifiable_cond2c_FTCG1}.  Since $Y_{t-1}$ is the only vertex on this path that is not an endpoint, we need to adjust for it to eliminate the confounding bias induced by this path.
However, in the second FTCG depicted in \ref{fig:example_non_identifiable_cond2c_FTCG2}, $\langle X_{t-1}, Y_{t-1}, Y_t \rangle$ forms a directed path. This implies that we should not adjust for $Y_{t-1}$ to preserve the influence of $X_{t-1}$ on $Y_t$ through the path passing by $Y_{t-1}$.
Since we do not know which FTCG is the true one, then we cannot determine whether we should adjust for $Y_{t-1}$ or not. Consequently, there is no valid adjustment set to identify the total effect $P(y_t\mid do(x_{t-1}))$.
\end{example}

Notice that in Figure~\ref{fig:example_identifiable_6}, removing the self-loop on $Y$ makes the total effect identifiable. This is because the active backdoor path and the directed path discussed in Example \ref{example:4} would no longer exist, leaving only directed paths or blocked (due to a collider) backdoor paths between $X_{t-1}$ and $Y_t$.



\input{figure4}

\section{Discussion on real-world applications}
\label{sec:real_app}




%\textbf{Epidemiology.}
%Understanding the transmission of SARS-CoV-2 from younger to older individuals, and vice versa, can help  define interventions most likely to reduce the number of deaths. Indeed, it has been shown that younger adults tended to be highly infected during the first wave of the pandemic, while older individuals faced a higher risk of death if infected~\citep{Carrat_2021,Lapidus_2021,Glemain_2024}.
%A cyclic relationship representing this interplay is summarized by the SCG in Figure~\ref{fig:real_epi}. Considering sufficiently large time intervals (several weeks), the number of new infections in an age group during a time interval (incidence) may be influenced by incidence in the other age group during the same interval. Incidence in an age group can also be influenced by incidence during the previous time interval in any age group.
%Using Theorem~\ref{Thm:identification_summary}  and assuming no hidden confounding, we can identify the total effect in each direction with a lag of $1$ (if there are confounders that do not form additional cycles, the total effect remains identifiable if we measure them and take them into account in the SCG). Thus, we could use data such as those of repeated serosurveys, like in \cite{Wiegand_2023}
%to estimate the total effect.



\textbf{Nephrology.} 
Hypertension has long been considered as a risk factor for kidney function decline. %However, whether intensive blood pressure control could promote kidney health has been uncertain. 
At the same time, the kidney is known to have a major role in affecting blood pressure through sodium extraction and regulating electrolyte balance~\citep{Yu_2020}. This can be represented with the SCG in Figure~\ref{fig:real_nephro} where the kidney function is represented by the creatinine level.
Epidemiologists are interested to know if preventing kidney function decline can reduce the public health burden of hypertension and at the same time nephrologists are interested in knowing how much a treatment related to hypertension can improve the state of the kidney.
Using Theorem~\ref{Thm:identification_summary} and assuming no hidden confounding, we can identify the total effect in each direction with a lag equal to $1$ (if there are confounders that do not form additional cycles, the total effect remains identifiable if we measure them and take them into account in the SCG). We can collect data for estimation by conducting weekly blood tests on patients with kidney insufficiency, especially those whose hypertension and creatinine levels fluctuate.


\textbf{Finance.} 
It has been suggested that there exists a bidirectional causal relationship between the number of unique active wallets associated with bridge protocols and the mean transaction fees within the Ethereum network~\citep{Ante_2024}. Additionally, we consider that transaction fees causes itself over time, as depicted in the SCG shown in Figure~\ref{fig:real_eco}. In this scenario, the total effect of mean transaction fees on the number of unique active wallets is identifiable using Theorem~\ref{Thm:identification_summary} with a lag of $1$. However, the same does not hold true for the opposite direction: the total effect of the number of unique active wallets on the mean transaction fees is not identifiable  using Theorem~\ref{Thm:identification_summary}.

\textbf{System monitoring.}
Consider a subgraph of the SCG described in \citep{Bystrova_2024}, representing the web activity in an IT system. Suppose that system experts observed a high number of queries at midnight for several weeks, likely due to a Distributed Denial of Service attack. Simultaneously, they noticed that CPU usage at midnight was very high, preventing the system from running some processes. Therefore, the system experts would like to determine (before intervening in the system) how much a reduction in bandwidth in the network would reduce the global CPU usage.
Theorem~\ref{Thm:identification_summary} shows that the total effect of Network input on CPU Global is identifiable for any lag.
In addition, Theorem~\ref{Thm:identification_summary} implies that the total effect between all pairs of variables is identifiable since in the SCG there exists no cycles of size greater than $2$.
We can estimate  those total effect using the data introduced in \cite{Bystrova_2024}.
% {Data available \href{https://easyvista2015-my.sharepoint.com/personal/aait-bachir_easyvista_com/_layouts/15/onedrive.aspx?ga=1&id=%2Fpersonal%2Faait%2Dbachir%5Feasyvista%5Fcom%2FDocuments%2FLab%2FPublicData%2FWeb%5FActivity}{here}.}
\input{figure6}

\textbf{Thermoregulation.}
Inspired by the experiment conducted in \cite{Peters_2013}, we consider maintaining a steady temperature in an apartment composed of four rooms: a living room, a kitchen, a bathroom, and an office. The living room is the only room containing a radiator, and all rooms are connected to each other through the living room. Additionally, all rooms contain a window except for the office.
Temperature sensors were placed in the four rooms, plus one outside the apartment, and temperatures were recorded on an hourly basis.
We consider the SCG presented in Figure~\ref{fig:real_thermoregulation} as the true one. Clearly, the outside temperature directly influences all rooms containing a window and the temperature in each room cannot cause the outside temperature.
Since the living room contains a radiator, it can affect the temperatures in all other rooms. Additionally, since we may use fire in the kitchen for cooking, which can increase the temperature, we consider that the temperature in the kitchen can affect the temperature in the living room. Similarly, since we may use hot water in the bathroom, which can increase the temperature, we consider that the temperature in the bathroom can influence the temperature in the living room.
All other vertices representing rooms in the graph are not connected to each other because they are not physically directly connected; they are all connected through the living room.
Suppose we are specifically interested in estimating the total effect of the temperature in living room on the temperature in the office. Theorem~\ref{Thm:identification_summary} states that this total effect is identifiable for any lag since $Cycles(\text{Living Room}, \mathcal{G}\backslash\{\text{Office}\})=\emptyset$ and there exists no $\sigma$-active backdoor path between Living Room and Office.


\section{Conclusion}
\label{sec:conclusion}

We studied in this paper the identification of total effects between singleton variables, under causal sufficiency, for both extended summary causal graphs and summary causal graphs. We showed that the total effect is always identifiable for extended summary causal graphs. The same does not hold for summary causal graphs for which we established graphical conditions which are sufficient, in any underlying probability distribution, for the identifiability of the total effect. In addition, in case of identifiability, we  provided several valid adjustment sets for estimating the total effect in extended summary causal graphs, and two adjustment sets when considering summary causal graphs. 

These results have significant implications, such as impact analysis in dynamic systems, particularly in scenarios where experts are unable to provide either a full temporal causal graph or an extended summary causal graph. They are also valuable in cases where the assumptions underlying causal discovery methods for inferring causal graphs with time lags are deemed overly restrictive. Furthermore, these results offer insights that can be useful in different disciplines such as Nephrology, Finance, System Monitoring, and Thermoregulation.

For future works, it would be valuable to establish necessary and sufficient conditions for the identifiability of total effects using SCGs, to extend this work to the case where the responses and interventions can be multivariate, and to the case where there are hidden confounding.

%However, those sets can potentially be large. While the estimation of the total effect based on such adjustments remains unbiased, there is a possibility of high variance. Determining the optimal adjustment set, that has minimal asymptotic estimation variance, remains an open question in this context.

\begin{acknowledgements} 
We thank Ali Aït-Bachir from EasyVista for discussions about the application of this work in system monitoring.
We thank Benjamin Glemain and Nathanael Lapidus from IPLESP and Paolo Malvezzi from CHU Grenoble for discussions about the application of this work in Nephrology/Epidemiology.
Finally, we thank Clément Yvernes from LIG for several discussions and five anonymous reviewers for their many insightful comments and suggestions.
This work was partially supported by MIAI@Grenoble Alpes (ANR-19-P3IA-0003), by the CIPHOD project (ANR-23-CPJ1-0212-01), and by the CSPR R\&D Booster Auvergne-Rhône-Alpes project. 
\end{acknowledgements}

\bibliography{References}

\appendix

\section{Supplementary Material}
\input{supplement_Camera_Ready}


\end{document}
