\section{Relational Causal Discovery with Latent Variables}
In this section, we define latent variables in relational causal models, show why existing algorithms cannot perform relational causal discovery with latent variables, define the graphical models necessary for such discovery, and propose an algorithm for it. The full proofs of all theoretical findings in this paper are available in Appendix \ref{proofs}.
\subsection{Latent Variables in Relational Causal Models}
To perform causal discovery with latent confounders, we first define them in the relational context. 
Considering the set of latent variables \textbf{L}, we need to define what constitutes a latent relational variable  $[I_X...I_Y].Y\in$ \textbf{L}. We assume that all entities in $\mathcal{E}$ and relationships in $\mathcal{R}$ are observed in the relational schema and the model and, consequently, in the variable's relational path. We define the set of latent attributes in a schema $\mathcal{S}$ as the set $\mathcal{A}_\textbf{L}$. We can then look at the definition that follows: 
\begin{definition}[Latent Relational Variable]
A relational variable $RV$: $[I_X...I_Y].Y$ is considered latent, i.e., $RV\in $ \textbf{L} if and only if its attribute class $Y\in\mathcal{A}(I_Y)$ is unobserved in the schema, meaning $Y\in\mathcal{A}_\textbf{L}$.
\end{definition}
Consequently, a relational variable $RV$: $[I_W...I_Z].Z$ is observed, i.e., $RV\in $ \textbf{O} if its attribute class $Z\in\mathcal{A}(I_Z)$ is observed, indicating that it is a member of the set of observed attributes classes in the schema, which we respectively define as $\mathcal{A}_\textbf{O}$. A model's set of relational dependencies $\mathcal{D}$ is thus divided into two groups:
\begin{enumerate}
    \item Set $\mathcal{D}_\mathbf{O}$ of observed dependencies $RV_1 \rightarrow RV_2$ defined only over observed relational variables i.e., $RV_1, RV_2\in\textbf{O}$;
    \item Set $\mathcal{D}_\mathbf{L}$ of latent dependencies $RV_1 \rightarrow RV_2$ containing at least one latent relational variable i.e., $RV1\in\textbf{L}\lor RV_2\in\textbf{L}$;
\end{enumerate} 

The modified relational causal model can now be defined as follows:
\begin{definition}[Latent Relational Causal Model (LRCM)]A relational causal model with latent variables $\mathcal{M}_{\Theta L}$ consists of two parts:
\begin{enumerate}
    \item The structure $\mathcal{M}_L=(\mathcal{S}, \mathcal{D})$: the schema $\mathcal{S}$, containing a set of latent attributes $\mathcal{A}_\textbf{L}$; the set of dependencies $\mathcal{D}=\mathcal{D}_\textbf{O}\cup\mathcal{D}_\textbf{L}$ defined over all relational variables;
    \item Parameters $\Theta$: a conditional probability distribution $P([I_j].X\mid parents([I_j].X))$ for all relational variables of the form $[I_j].X$ \citep{maier2014reasoning}.
\end{enumerate}
\end{definition}
An example LRCM can be seen in figure \ref{subfig:LRCM}.
% Except for synthetically generated data, where we assume a specific LRCM, $\mathcal{M}_{\Theta L}$ is typically unknown.
The latent AGG is constructed from LRCM $\mathcal{M}_L$, similarly to conventional relational causal models \citep{maier2014reasoning}. The construction divides the edges of the abstract ground graph into observed and unobserved edges, based on whether the underlying dependency from which the edge is yielded belongs to $\mathcal{D}_{L}$, i.e., is unobserved.
\begin{figure}[ht]
    \centering
    \begin{subfigure}{0.45\textwidth}
        \centering
        \scalebox{0.8}{
            \begin{tikzpicture}
                % Entities
                \node[draw, rounded corners, rectangle, minimum width=2cm, minimum height=2cm] (A) at (0, 0) {};
                \node[above] at (A.north) {USER};
            
                \node[draw, rounded corners, rectangle, minimum width=2cm, minimum height=2cm] (B) at (5, 0) {};
                \node[above] at (B.north) {POST};
            
                % Define the four corners of the rhombus
                \coordinate (N) at (2.5, 0.75);
                \coordinate (E) at (3.55, 0);
                \coordinate (S) at (2.5, -0.75);
                \coordinate (W) at (1.45, 0);
                \node[above] at (N.north) {REACTS};
                \node[above, font=\tiny] at ([xshift=-0.16cm] W.center) {MANY};
                \node[above, font=\tiny] at ([xshift=0.16cm] E.center) {MANY};
            
                % Create attributes inside A
                \node[draw, ellipse, thick, font=\scriptsize] (A1) at ([yshift=-0.66cm, xshift=-0.09cm] A.center) {Sentiment};
                \node[draw, ellipse, thick, font=\scriptsize] (A2) at ([xshift=-0.45cm, yshift=0.65cm] A.center) {Type};
                \node[draw, ellipse, thick, font=\scriptsize] (A3) at ([xshift=0.24cm, yshift=0cm] A.center) {Activity};
            
                % Create attributes inside AB1
                \node[draw, ellipse, font=\scriptsize, thick, dashed] (AB1_1) at ([yshift=-0.75cm] N.center) {Frequency};
                
                % Draw the lines to form the rhombus
                \draw (N) -- (E) -- (S) -- (W) -- cycle;
            
                \node[draw, ellipse, thick, font=\tiny]  (B1) at ([xshift=0cm, yshift=-0.45cm] B.center) {Engagement};
                \node[draw, ellipse, thick, font=\scriptsize] (B2) at ([xshift=0.1cm, yshift=0.5cm] B.center) {Content};
              
                % Lines
                \draw (A) -- (W);
                \draw (E) -- (B);
            
                % Curved edge
                \path (A1) edge[bend right, thick, ->] (B1);
                \path (B2) edge[bend left, thick, ->] (B1);
                \path (AB1_1) edge[bend left, thick, ->, dashed] (A3);
                \path (AB1_1) edge[bend right, thick, ->, dashed] (B1);
                \path (A2) edge[bend right, thick, ->] (A1);
                \path (A2) edge[bend left, thick, ->] (A3);
            \end{tikzpicture}
        }
        \caption{Latent RCM. Unobserved variables and dependencies marked with segmented lines}
        \label{subfig:LRCM}
    \end{subfigure} \\ 
    \begin{subfigure}{0.45\textwidth}
        \centering
        \scalebox{0.5}{
            \begin{tikzpicture}
                % Nodes
                \node[draw, ellipse, thick, font=\large] (1) at (-2.8, -1.5) {[U].Sentiment};
                \node[draw, ellipse, thick, font=\large] (2) at (-2.8, 0) {[U].Type};
                \node[draw, ellipse, thick, font=\large] (3) at (-2.8, 1.5) {[U].Activity};
                \node[draw, ellipse, thick, font=\large] (4) at (1.7, 0) {[U, R, P].Engagement};
                \node[draw, ellipse, thick, font=\large] (5) at (1.6, -1.5) {[U, R, P].Content};
                \node[draw, ellipse, thick, font=\large] (6) at (7.5, -1.5) {[U, R, P, R, U].Sentiment};
                \node[draw, ellipse, thick, font=\large] (7) at (7.5, 0) {[U, R, P, R, U].Type};
                \node[draw, ellipse, thick, font=\large] (8) at (7.5, 1.5) {[U, R, P, R, U].Activity};
                \node[draw, ellipse, thick, dashed, font=\large] (9) at (1.7, 1.5) {[U, R].Frequency};

                \path (2) edge[thick, ->] (1);
                \path (2) edge[thick, ->] (3);
                \path (9) edge[thick, ->, dashed] (3);
                \path (9) edge[thick, ->, dashed] (4);
                \path (1) edge[thick, ->] (4);
                \path (5) edge[thick, ->] (4);
                
                \path (7) edge[thick, ->] (6);
                \path (7) edge[thick, ->] (8);
                \path (9) edge[thick, ->, dashed] (8);
                \path (6) edge[thick, ->] (4);
            \end{tikzpicture}
        }
        \caption{True LAGG for perspective U}
        \label{subfig:True}
    \end{subfigure} \\  
    \begin{subfigure}{0.45\textwidth}
        \centering
        \scalebox{0.5}{
            \begin{tikzpicture}
                % Nodes
                \node[draw, ellipse, thick, font=\large] (1) at (0.5, -1.5) {[U].Sentiment};
                \node[draw, ellipse, thick, font=\large] (2) at (0.25, 0) {[U].Type};
                \node[draw, ellipse, thick, font=\large] (3) at (0.25, 1.5) {[U].Activity};
                \node[draw, ellipse, thick, font=\large] (4) at (5, 0) {[U, R, P].Engagement};
                \node[draw, ellipse, thick, font=\large] (5) at (5.5, -1.5) {[U, R, P].Content};
                \node[draw, ellipse, thick, font=\large] (6) at (10.5, -1) {[U, R, P, R, U].Sentiment};
                \node[draw, ellipse, thick, font=\large] (7) at (11, 0.5) {[U, R, P, R, U].Type};
                \node[draw, ellipse, thick, font=\large] (8) at (6, 1.5) {[U, R, P, R, U].Activity};
            
                % % Curved edge
                % \path (A1) edge[bend right, thick, ->] (B1);
                % \path (B2) edge[bend right, thick, ->] (B1);
                % % \path (AB_1) edge[bend left, thick, ->, dashed] (A3);
                % % \path (AB_1) edge[bend right, thick, ->, dashed] (B1);
                % \path (A2) edge[bend right, thick, {Circle[open]}-{Circle[open]}] (A1);
                % \path (A2) edge[bend left, thick, ->] (A3);
                % \path (A3) edge[bend left, thick, <->, line width=0.6mm] (B1);

                \path (1) edge[thick, ->] (4);
                \path (3) edge[thick, ->] (4);
                \path (5) edge[thick, ->] (4);
                \path (6) edge[thick, ->] (4);
                \path (8) edge[thick, ->] (4);
                \path (6) edge[thick, ->] (4);
                % \path (8) edge[thick, ->] (4);
                % \path (7) edge[thick, ->] (6);
                % \path (7) edge[thick, ->] (8);
            \end{tikzpicture}
        }
        \caption{RCD output AGG for perspective U}
        \label{subfig:RCD}
    \end{subfigure}
    \caption{Counterexample that shows RCD does not produce the correct output AGG for LRCM with a faithful oracle}
    \label{fig:RCD}
\end{figure}
\begin{definition}[Latent Abstract Ground Graph (LAGG)]
    Given a relational causal model $\mathcal{M}_L$ a maximum hop threshold $h$, and a perspective $\mathcal{B}$, the $LAGG_{\mathcal{M}_\textbf{L}\mathcal{B}h}$ is the abstract ground graph of the latent relational causal model. It contains both variables in the sets \textbf{O} and \textbf{L} over perspective $\mathcal{B}$, plus intersection variables divided into observed intersection variables (both participating variables in the intersections are observed), and latent ones (i.e., at least one participating variable is latent). The set of edges $E$ yielded from the dependencies in $\mathcal{D}_\textbf{O}$ and $\mathcal{D}_\textbf{L}$, using the \verb|extend| method \citep{maier2014reasoning}, is partitioned respectively into the set of observed ($E_\textbf{O}$) and unobserved ($E_\textbf{L}$) edges.
\end{definition} 
Consider the LRCM shown in figure \ref{subfig:LRCM} and a hop threshold $h=2$. 
% It has two entity types, A and B, one MANY-TO-MANY relationship AB, and maximum hop threshold $h=2$. The entities and relationship have three, two, and one attribute for A, B, and AB, respectively. 
The Frequency attribute for REACT is unobserved. There are six relational dependencies in the model: 1) [U].Type $\rightarrow$ [U].Activity, 2) [U].Type $\rightarrow$ [U].Sentiment, 3) [P, R, U].Sentiment $\rightarrow$ [P].Engagement, 4) [P].Content $\rightarrow$ [P].Engagement, 5) [U, R].Frequency $\rightarrow$ [U].Activity, 6) [P, R].Frequency $\rightarrow$ [P].Engagement. The last two are unobserved dependencies in $\mathcal{D}_\textbf{L}$. The respective LAGG for the described LRCM is shown in figure \ref{subfig:True}. 
\subsection{Latent Relational Causal Discovery}
% Describe why RCD is not sufficient for discovering causal relations in the presence of latent variables in relational models -> counterexample, maybe not complex enough underlying model
The RCD algorithm is the first sound and complete procedure that learns the dependencies of a relational causal model \citep{maier2013sound}. It works under several assumptions, described in detail in Appendix \ref{assumptions}: maximum hop threshold $h$, $d$-faithfulness, acyclicity, and causal sufficiency. Causal sufficiency in particular implies that RCD was not originally designed for models with latent variables. Given that some forms of latent confounding can be detected via simple dependence tests~\citep{arbour2016inferring}, it is natural to ask whether RCD is still sound and complete when the casual sufficiency assumption is lifted. To the best of our knowledge, no prior research has addressed this question in detail. 
% To answer, we need to assume the existence of an oracle as defined in \citet{spirtes2000causation}, which is faithful to the relations defined by G(\textbf{O},\textbf{L}) and, as such, can reliably determine the conditional independence of any two observed variables in \textbf{O}. 
The following counterexample shows that RCD is neither sound nor complete for relational causal discovery with latent variables.

\textbf{Counterexample} Figure \ref{subfig:RCD} shows the output AGG produced by RCD using an oracle faithful to the underlying distribution. As we can see, the actual LAGG in figure \ref{subfig:True} contains outgoing edges from [U].Type and from [U, R, P, R, U].Type; however, the output AGG (Fig. \ref{subfig:RCD} lacks these edges. This indicates that RCD fails to identify the relational dependencies 1) and 2).
% [A].A2 $\rightarrow$ [A].A1 and [A].A2 $\rightarrow$ [A].A3 relational dependencies.
Furthermore, without latent variables, RCD cannot capture and detect the presence of a latent confounder on the AGG using only directed edges. As seen in Figure \ref{fig:RCD}, the fundamental problem that renders RCD neither sound nor complete for LRCM is the lack of identification of latent variables. This suggests that a more expressive representation than AGGs is required for the correct causal discovery in the presence of latent variables. 

% The representations required to expand RCD with FCI to enable relational causal discovery with latent variables are shown in the next section.
\subsection{Partial Ancestral Abstract Ground Graphs}

\begin{figure}[ht]
    \centering
    \begin{subfigure}{0.45\textwidth}
        \centering
        \scalebox{0.5}{
            \begin{tikzpicture}
                % Nodes
                \node[draw, ellipse, thick, font=\large] (1) at (0.5, -1.5) {[U].Sentiment};
                \node[draw, ellipse, thick, font=\large] (2) at (0.25, 0) {[U].Type};
                \node[draw, ellipse, thick, font=\large] (3) at (0.25, 1.5) {[U].Activity};
                \node[draw, ellipse, thick, font=\large] (4) at (5, 0) {[U, R, P].Engagement};
                \node[draw, ellipse, thick, font=\large] (5) at (5.5, -1.5) {[U, R, P].Content};
                \node[draw, ellipse, thick, font=\large] (6) at (10.5, -1) {[U, R, P, R, U].Sentiment};
                \node[draw, ellipse, thick, font=\large] (7) at (11, 0.5) {[U, R, P, R, U].Type};
                \node[draw, ellipse, thick, font=\large] (8) at (6, 1.5) {[U, R, P, R, U].Activity};
            
            

                \path (2) edge[thick, ->] (3);
                \path (2) edge[thick, ->] (1);
                \path (4) edge[thick, <->] (3);
                \path (1) edge[thick, ->] (4);
                \path (5) edge[thick, ->] (4);
                \path (6) edge[thick, ->] (4);
                \path (8) edge[thick, <->] (4);
                \path (7) edge[thick, ->] (6);
                \path (7) edge[thick, ->] (8);
            \end{tikzpicture}
        }
        \caption{MAAGG from the LRCM in Figure \ref{subfig:LRCM} for perspective U}
    \end{subfigure} \\
    \begin{subfigure}{0.45\textwidth}
        \centering
        \scalebox{0.5}{
            \begin{tikzpicture}
                % Nodes
                \node[draw, ellipse, thick, font=\large] (1) at (0.5, -1.5) {[U].Sentiment};
                \node[draw, ellipse, thick, font=\large] (2) at (0.25, 0) {[U].Type};
                \node[draw, ellipse, thick, font=\large] (3) at (0.25, 1.5) {[U].Activity};
                \node[draw, ellipse, thick, font=\large] (4) at (5, 0) {[U, R, P].Engagement};
                \node[draw, ellipse, thick, font=\large] (5) at (5.5, -1.5) {[U, R, P].Content};
                \node[draw, ellipse, thick, font=\large] (6) at (10.5, -1) {[U, R, P, R, U].Sentiment};
                \node[draw, ellipse, thick, font=\large] (7) at (11, 0.5) {[U, R, P, R, U].Type};
                \node[draw, ellipse, thick, font=\large] (8) at (6, 1.5) {[U, R, P, R, U].Activity};
            

                \path (2) edge[thick, ->] (3);
                \path (2) edge[thick, {Circle[open]}-{Circle[open]}] (1);
                \path (4) edge[thick, <->] (3);
                \path (1) edge[thick, ->] (4);
                \path (5) edge[thick, ->] (4);
                \path (6) edge[thick, ->] (4);
                \path (8) edge[thick, <->] (4);
                \path (7) edge[thick, {Circle[open]}-{Circle[open]}] (6);
                \path (7) edge[thick, ->] (8);
            \end{tikzpicture}
        }
        \caption{PAAGG of \textit{O-Equiv}($\mathcal{D}_{\textbf{O}}$) for perspective U}
    \end{subfigure} \\
    \begin{subfigure}{0.45\textwidth}
        \centering
        \scalebox{0.9}{
            \begin{tikzpicture}
                % Entities
                \node[draw, rounded corners, rectangle, minimum width=2cm, minimum height=2cm] (A) at (0, 0) {};
                \node[above] at (A.north) {USER};
            
                \node[draw, rounded corners, rectangle, minimum width=2cm, minimum height=2cm] (B) at (5, 0) {};
                \node[above] at (B.north) {POST};
            
 % Define the four corners of the rhombus
                \coordinate (N) at (2.5, 0.75);
                \coordinate (E) at (3.55, 0);
                \coordinate (S) at (2.5, -0.75);
                \coordinate (W) at (1.45, 0);
                \node[above] at (N.north) {REACTS};
                \node[above, font=\tiny] at ([xshift=-0.16cm] W.center) {MANY};
                \node[above, font=\tiny] at ([xshift=0.16cm] E.center) {MANY};
            
                % Create attributes inside A
                \node[draw, ellipse, thick, font=\scriptsize] (A1) at ([yshift=-0.66cm, xshift=-0.09cm] A.center) {Sentiment};
                \node[draw, ellipse, thick, font=\scriptsize] (A2) at ([xshift=-0.45cm, yshift=0.65cm] A.center) {Type};
                \node[draw, ellipse, thick, font=\scriptsize] (A3) at ([xshift=0.24cm, yshift=0cm] A.center) {Activity};
                
                % Draw the lines to form the rhombus
                \draw (N) -- (E) -- (S) -- (W) -- cycle;
            
                \node[draw, ellipse, thick, font=\tiny]  (B1) at ([xshift=0cm, yshift=-0.45cm] B.center) {Engagement};
                \node[draw, ellipse, thick, font=\scriptsize] (B2) at ([xshift=0.1cm, yshift=0.5cm] B.center) {Content};
              
                % Lines
                \draw (A) -- (W);
                \draw (E) -- (B);
            
                % Curved edge
                \path (A1) edge[bend right, thick, ->] (B1);
                \path (B2) edge[bend right, thick, ->] (B1);
                % \path (AB_1) edge[bend left, thick, ->, dashed] (A3);
                % \path (AB_1) edge[bend right, thick, ->, dashed] (B1);
                \path (A2) edge[bend right, thick, {Circle[open]}-{Circle[open]}] (A1);
                \path (A2) edge[bend left, thick, ->] (A3);
                \path (A3) edge[bend left, thick, <->, line width=0.6mm] (B1);
            \end{tikzpicture}
        }
        \caption{PARM of \textit{O-Equiv}($\mathcal{D}_{\textbf{O}}$)}
    \end{subfigure}%
    \caption{New representations to enable relational causal discovery in LRCMs}
    \label{fig:models}
\end{figure}

To enable causal discovery in latent relational causal models, we first need to define the necessary graphical models. Figure 3 shows an example of the models we introduce in this section. We will do so by considering an extension of the graphical models used in FCI to the relational setting, with a representation that expresses the underlying dependencies $\mathcal{D}$ over a set of observed variables \textbf{O} (i.e., $\mathcal{D}_{\textbf{O}}$):
\begin{definition}[Maximal Ancestral Abstract Ground Graph (MAAGG)]
Given a 
% relational schema $\mathcal{S} = (\mathcal{E}, \mathcal{R}, \mathcal{A}, \textit{card})$, a 
latent relational causal model $\mathcal{M}_\textbf{L}(\mathcal{S},\mathcal{D})$ with a hop threshold $h$, any perspective $\mathcal{B}$, and the resulting Latent Abstract Ground Graph $LAGG_{\mathcal{M}_L\mathcal{B}h}$, the maximal ancestral abstract ground graph $MAAGG_{\mathcal{M}\mathcal{B}h'}$ is a graph, with a hop threshold $h'\geq h$ comprising:
\begin{itemize}
    \item One node for each relational variable of the LAGG in \textbf{O} and the respective set of observed intersection variables;
    \item Three types of edges: $\rightarrow$, \textemdash , and $\leftrightarrow$, which are used to represent the underlying dependencies $\mathcal{D}_{\textbf{O}}$.
\end{itemize}
\end{definition}
The MAAGG $\mathbfcal{G}$ defined over the variables in \textbf{O}, following the definition of \citet{ZHANG20081873}, probabilistically represents the respective LAGG defined over \textbf{O} and \textbf{L}, specifically:
\begin{itemize}
    \item Two variables $A, B \in \textbf{O}$ are adjacent in $\mathbfcal{G}$ if and only if there is an inducing path relative to $\langle L\rangle$ in the true LAGG;
    \item The orientation entails the same concept of non-causality and ancestry between two variables for MAGs and PAGs.
    % :
    % \begin{enumerate}
    %     \item $A \rightarrow B$ implies that $A$ causes $B$ ($A$ is an ancestor of $B$), and $B$ is not a cause of $A$ ($B$ is not an ancestor of $A$);
    %     \item $A \leftrightarrow B$ implies that $A$ is not a cause of $B$, but also $B$ is not a cause of $A$. Considering the adjacency of $A$ and $B$ in the MAAGG, this implies a common latent confounder between the two observed variables.
    %     \item $A$ \textemdash $B$ implies that $A$ causes $B$, and $B$ also causes $A$.
    % \end{enumerate}
\end{itemize}
We now introduce a lemma that is necessary for proving the soundness and completeness of our proposed method:
\begin{lemma}
    Given a relational causal model structure $\mathcal{M}$ and perspective $\mathcal{B}$, an abstract ground graph $AGG_{\mathcal{M}\mathcal{B}}$ is ancestral if and only if all ground graphs $GG_{\mathcal{M}\sigma}$, with skeleton $\sigma\in\sum_\mathcal{S}$, are ancestral.
\end{lemma}
Generally, the set underlying dependencies $\mathcal{D}_\textbf{O}$ is not associated with a single MAAGG, but with the class of Markov equivalence defined as \textit{O-Equiv}($\mathcal{D}_{\textbf{O}}$). Therefore, we define another abstraction, based on PAGs and AGGs, that represents this equivalence class: 
\begin{definition}[Partial Ancestral Abstract Ground Graph (PAAGG)]
Given a relational causal model $\mathcal{M}_\textbf{L}(\mathcal{S},\mathcal{D})$ with hop threshold $h$, the respective $MAAGG_{\mathcal{M}\mathcal{B}_{h'}$, and its equivalence class \textit{O-Equiv}($\mathcal{D}_{\textbf{O}}$), and a perspective $\mathcal{B}$, the partial ancestral abstract ground graph $PAAGG_{\mathbfcal{M}\mathcal{B}_{h'}}$ is a bidirected PAG, with hop threshold $h'\geq h$ comprising:
\begin{itemize}[wide, labelwidth=!, labelindent=0pt]
    \item The same set of nodes and adjacencies as the MAAGG;
    \item Edges containing three kinds of marks: ◦, \textemdash and $\rightarrow$, which are used to represent the variance and invariance of the equivalence class \textit{O-Equiv}($\mathcal{D}_{\textbf{O}}$).
    % , and thus of models belonging to \textit{O-Equiv}($\mathcal{D}_{\textbf{O}}$).
\end{itemize}
\end{definition}

% To perform causal discovery on the ARM by applying relational $d$-separation, we define a new lifted representation based on $AGG$s:

% From this set, it's possible to obtain the \textit{O-Equiv}($\mathcal{D}_{\textbf{O}}$) class of relational models that share the same observed dependencies. To support the concept of variant and invariant dependency marks between models, a new representation inspired by PAGs is defined as follows: 
% , without including latent variables, can represent the causal and conditional independence relations for observed relational variables:

% the underlying dependencies $\mathcal{D}$ over a set of observed variables \textbf{O} (i.e., $\mathcal{D}_{\textbf{O}}$). From this set, it's possible to obtain the \textit{O-Equiv}($\mathcal{D}_{\textbf{O}}$) class of relational models that share the same observed dependencies. To support the concept of variant and invariant dependency marks between models, a new representation inspired by PAGs is defined as follows:
The following proposition provides a description of the soundness and completeness of the new representation:
\begin{proposition}
    Given a relational causal model $\mathcal{M}_\textbf{L}(\mathcal{S},\mathcal{D})$ with hop threshold $h$, and its respective latent abstract ground graph $G$, the constructed MAAGG probabilistically and causally represents $G$ and thus the underlying relational causal model. Furthermore, assuming a sound and complete procedure to construct the PAAGG $G'$, it correctly represents the Markov equivalence class of the produced MAAGG
    % \cite{ZHANG20081873}
    and, therefore, of $G$ and the underlying model $\mathcal{M}_\textbf{L}$. 
\end{proposition}
The equivalence class of MAAGGs represented by the PAAGG corresponds to multiple LRCMs that share the same set of dependencies over $\mathcal{D}_\textbf{O}$. Thus, it is possible to define a new model from the PAAGG, which represents the equivalence class \textit{O-Equiv}($\mathcal{D}_{\textbf{O}}$):

% \begin{definition}[Partial Ancestral Relational Model (PARM)]
% Given a $PAAGG_{\mathbfcal{M}\mathcal{B}_h'}$ for a LRCM $\mathcal{M}_\textbf{L}(\mathcal{S},\mathcal{D})$, a partial ancestral model $\mathbfcal{M}(\mathcal{S}_\textbf{O},\mathcal{D}')$ is the relational model corresponding to the $PAAGG$. The PARM is defined over a relational schema containing only observed attribute classes $\mathcal{S}_\textbf{O}=(\mathcal{E}, \mathcal{R}, \mathcal{A}_\textbf{O}, \textit{card})$ and a set $\mathcal{D}'$ of dependencies, which are used to represent the causality information for all models in the equivalence class \textit{O-Equiv}($\mathcal{D}_{\textbf{O}}$).
% \end{definition}

\begin{definition}[Partial Ancestral Relational Model (PARM)]
Given a LRCM $\mathcal{M}_\textbf{L}(\mathcal{S},\mathcal{D})$ and its respective $PAAGG_{\mathbfcal{M}\mathcal{B}_{h'}}$ for the equivalence class \textit{O-Equiv}($\mathcal{D}_{\textbf{O}}$), a partial ancestral model $\mathbfcal{M}(\mathcal{S}_\textbf{O},\mathcal{D}')$ is the relational causal model abstracted by the PAAGG that represents \textit{O-Equiv}($\mathcal{D}_{\textbf{O}}$). The PARM is defined over a relational schema containing only observed attribute classes $\mathcal{S}_\textbf{O}=(\mathcal{E}, \mathcal{R}, \mathcal{A}_\textbf{O}, \textit{card})$ and a set $\mathcal{D}'$ of dependencies, which are used to represent the causality information for all models in \textit{O-Equiv}($\mathcal{D}_{\textbf{O}}$).
\end{definition}
The definitions of $MAAGG$ and $PAAGG$ allow a limit on the hop threshold higher than that of the underlying equivalence class of models. This is because the set of possible underlying dependencies with at most the same hop threshold would not capture paths that, in addition to the allowed threshold for observed variables, include unobserved variables. 
The higher hop threshold implied by definition 2 for $PAAGG$s is required to obtain an abstraction that correctly represents the presence of latent confounders in the underlying model. This is due to the presence of latent confounders (Definition 3) in a relational causal model and to the absence of latent parents and children for latent variables.
\begin{proposition} \label{prop:hop}
Given a latent relational causal model $\mathcal{M}_\textbf{L}(\mathcal{S},\mathcal{D})$ with hop threshold $h$ and its corresponding PARM $\mathbfcal{M}$, the hop threshold $h'$ of the $PAAGG_{\mathbfcal{M}\mathcal{B}}$ for any perspective $\mathcal{B}$ can be at most $2h$.
\end{proposition}
 
\subsection{The RelFCI Algorithm}
In this section, we present the Relational Fast Causal Inference (RelFCI) algorithm, a sound and complete procedure for determining causal relationships from relational data when unobserved variables are present. 
RelFCI follows a three-step approach similar to the FCI algorithm for Bayesian networks (Spirtes 2013). RelFCI adapts the FCI procedure to relational causal models, similar to how RCD \citep{maier2013sound} does with the PC algorithm \citep{spirtes2000causation}. Given that FCI is an extension of PC, RelFCI follows the same orientation rules as RCD and also assumes a prior relational skeleton. However, it differs from RCD in two ways: [1] RelFCI uses partial ancestral abstract ground graphs, one for each perspective, as the underlying representation; [2] RelFCI applies seven additional rules from FCI to ensure soundness and completeness with latent relational data.

Algorithm \ref{alg:main_alg} shows the high-level pseudocode for RelFCI, and Appendix \ref{algo} contains the complete algorithm pseudocode. RelFCI, like RCD, computes the set of potential dependencies in canonical form, limited by the $h'=2h$ threshold. Starting from these dependencies, the algorithm constructs PAAGGs, one for each perspective, all with ◦ edge marks. The first step is to remove potential dependencies using conditional independence tests with conditioning sets of increasing size drawn from the collection of adjacencies of the two nodes considered. After deleting all possible edges, a set of unshielded triples is obtained. The second phase detects colliders while finding potential additional independence relationships between the triples' variables and potentially eliminating the respective edges. Even though RelFCI operates on different graphical models compared to FCI and RCD, it is straightforward to adapt their rules for RelFCI. The third step thus performs edge orientation by applying RBO, KNC, CA, and MK3 rules from RCD first, then rules R4 through R10 from FCI. A detailed description of these rules is provided in Appendix \ref{rules}. In contrast to the first step, the latter two differ from FCI because they apply the RBO rule from RCD and propagate each edge orientation to other PAAGGs. All steps are performed on all PAAGGs to accurately identify additional separation sets for each perspective. 

Before demonstrating the soundness and completeness of RelFCI, we first clarify how the algorithm handles relational dependencies and edge orientations in PAAGGs. 
Since with ◦ marks RelFCI produces an equivalence class rather than a single causal model, certain underlying dependencies remain ambiguous. To address this, we distinguish between \textit{required dependencies}, which must be oriented in a specific direction to respect the PAAGG orientation, and \textit{possible dependencies}, which may have alternative orientations while remaining consistent with the learned PAAGG. With this new distinction, it is then possible to define the propagation of edges orientation across all PAAGGs for every perspective in a given LRCM, following a similar approach to the one described in RCD \cite{maier2013sound} for regular AGGs. A detailed explanation of these aspects is provided in Appendices \ref{deps} and \ref{edges}.

\begin{algorithm}[tb]
\caption{RelFCI algorithm}
\label{alg:main_alg}
\textbf{Input}: schema, oracle, threshold \\
\textbf{Output}: Dependencies
\begin{algorithmic}[1] %[1] enables line numbers
\STATE $PDs \gets$ get potential dependencies from the base schema with 2*threshold
\STATE $PAAGGs \gets$ construct PAAGGs from set of potential dependencies $PDs$
\STATE $S \gets \{\}$ \\
% \verb|// Find independent variables and| \\
% \verb|unshielded triples|
\STATE $PAAGGs, S, U \gets$ find all independent variables in the graphs, storing separating sets and unshielded triples
% \verb|obtainInitialSkeleton|(PAAGGs, S)$ \\
% \verb|// Find and orient v-structures|
\STATE $PAAGGS, S \gets$ orient v-structures using CD, starting from unshielded triples in $U$
% \verb|orientVStructures|(PAAGGs, S, U)$ 
% \verb|// Apply orientations rules from RCD, FCI| \\
\STATE $PAAGGs, S \gets$ orient PAAGGs edges using RCD and FCI rules
% \verb|performEdgeOrientation|(PAAGGs, S)$
\STATE $Deps \gets$ retrieve underlying dependencies from the edges of oriented PAAGGs
% \verb|retrieveDepsFromPAAGGs|(PAAGGs)$
\STATE \textbf{return} Deps
\end{algorithmic}
\end{algorithm}

\subsection{Soundness}
\citet{maier2013sound} prove the soundness of CD, KNC, CA, MR3, and the new RBO rule using a proof derived from the soundness definition presented in \citet{meek1995causal}. Thus, we will focus on the soundness of the remaining rules R4-R10 adapted from FCI \citep{ZHANG20081873}.
\begin{theorem} \label{theo:sound}
Let G be the partially oriented PAAGG from perspective B, with the correct adjacencies, unshielded colliders correctly orientated through CD and RBO, and as many edges as possible oriented through KNC, CA, MR3, and the purely common cause of RBO. Then, FCI's rules R4-R10 and the orientation propagations are sound.
\end{theorem}
The proof is an extension of those presented by \citet{spirtes2000causation} for rule R4 and \citet{ZHANG20081873} for rules R5-R10.
\subsection{Completeness}
A set of orientation rules is called complete if it generates a maximally informative graph. In PAAGGs, each circle corresponds to a variation mark in the equivalence class \textit{O-Equiv}($\mathcal{D}_{\textbf{O}}$) (modified from \citet{ZHANG20081873}). The rules employed in FCI can be divided into two groups based on their function: those used to identify arrowhead invariants (CD, KNC, CA, MR3, and R4) and those used to identify tail invariants (R5-R10). According to \citet{ali2012towards}, the first set of rules covers all invariant arrowheads. Lemma \ref{lemma:arrow} shows that PAAGGs have similar arrowhead completeness, which can be used to prove overall rule completeness.
\begin{lemma} \label{lemma:arrow}
Let G be a partially oriented PAAGG with correct adjacencies. Then, exhaustively applying CD, RBO, KNC, CA, and MR3, all with orientation propagation of edges, produces a graph G' in which for every circle mark, there exists a MAAGG in the \textit{O-Equiv}($\mathcal{D}_{\textbf{O}}$) class with a corresponding tail mark.
\end{lemma}
Following \citet{ali2012towards}'s proofs for MAG, we apply the same reasoning for MAAGG and expand it with the RBO rule. The orientation propagation proof is identical to the one offered in \citet{maier2013sound}.
We now provide tail completeness of the remaining set of rules.
\begin{lemma} \label{lemma:tail}
Let G' be the partially oriented PAAGG with correct adjacencies and unshielded colliders, and as many edges orientated with KNC, CA, and MR3, consistently applying edge propagation. Then, applying rules R5-R10, along with orientation propagation, provides a graph G'' such that for every circle mark, there exists a MAAGG in \textit{O-Equiv}($\mathcal{D}_{\textbf{O}}$) in which the associated mark is an arrowhead.
\end{lemma}
The proof comes from \cite{ZHANG20081873} tail completeness, establishing that every PAAGG edge ◦\textemdash, ◦\textemdash◦, ◦$\rightarrow$, the circle mark corresponds to an arrowhead in an MAAGG belonging to the equivalence class. With lemmas \ref{lemma:arrow} and \ref{lemma:tail} in place, completeness follows:
\begin{figure*}[ht]
    \centering
    \includegraphics[width=\textwidth]{imgs/combined_figure.png}
    \caption{RelFCI and RCD Precision and Recall comparison. Results are combined for both 1 and 2 latent variables. Intervals represent $\pm1$ standard deviation.}
    \label{fig:pre-rec-tot}
\end{figure*}
\begin{figure*}[ht]
    \centering
    \includegraphics[width=\textwidth]{imgs/plot_latvar.png}
    \caption{RelFCI and RCD Precision and Recall performance with 1 and 2 latent variables.}
    \label{fig:pre-rec-lat}
\end{figure*}

\begin{theorem} \label{theo:compl}
    Given a partially oriented PAAGG G with the appropriate set of adjacencies, applying rules CD, KNC, CA, MR3, and RBO extensively, followed by orienting any possible edges with rules R4-R10, all with orientation propagation, yields a maximally informative graph G. 
\end{theorem}
\begin{proof}
    Lemmas \ref{lemma:arrow} and \ref{lemma:tail} prove that in the output graph $G$ produced by applying all the rules, every remaining circle mark corresponds to both tail and arrowhead variant marks in the \textit{O-Equiv} ($\mathcal{D}_{\textbf{O}}$). As such, the circle mark is considered a variation mark. Thus, by definition, the graph $G$ is maximally informative.
\end{proof}
 
We are now ready to establish the soundness and completeness of RelFCI:
\begin{theorem}
Given a schema and a probability distribution P(\textbf{V}) with $\textbf{V}=\textbf{O}\cup\textbf{L}\cup\textbf{S}$, the output of RelFCI is a correct maximally informative PAAGG, and thus a maximally informative PARM $\mathbfcal{M}$, assuming perfect conditional independence tests and sufficient hop threshold $h'$.
\end{theorem}
