\section{Background}
We provide an overview of relational theory, which serves as the foundation for our proposed RelFCI algorithm and its proofs of correctness. We follow the theoretical definitions provided by \cite{maier2014reasoning}. We also go over the theory underlying causal discovery using latent confounders and partial ancestral graphs for Bayesian networks as specified by \cite{spirtes2000causation} for the FCI algorithm. Finally, we provide the set of assumptions used in this work for relational causal discovery with latent variables.
Appendices \ref{rcd} and \ref{fci} contain accompanying figures that illustrate the concepts presented in this section.
\subsection{Relational Data and Relational Causal Models}
A \textit{Relational Schema} $\mathcal{S} = (\mathcal{E}, \mathcal{R}, \mathcal{A}, \textit{card})$ is a collection of a set of entity types $\mathcal{E}$; a set of relationship types $\mathcal{R}$, where $\mathcal{R}_i=\langle E_1^i,...,E_{a}^i\rangle \in \mathcal{R}$, with $E_j^i\in\mathcal{E}$ and $a$ the arity of the relation; a set of attribute classes $\mathcal{A}(I)$ for each entity or relationship and a cardinality function \textit{card}: $\mathcal{R}\times\mathcal{E} \rightarrow \{\text{ONE, MANY}\}$. As a running example, we will consider a schema with two entity types, USER (U) and POST (P), and the relationship between them, REACTS (R). USER has three attributes (U.Type, U.Activity and U.Sentiment), POST has two attributes (P.Engagement, P.Content), and REACTS has one attribute (R.Frequency). 

Given a relational schema, a \textit{Relational Variable} $[I_X...I_Y].Y$ consists of a \textit{Relational Path} $[I_X...I_Y]$, an alternating sequence of connected entities and relations, and an attribute $Y$ of the last class reached by said path. The first class $I_X$ of this relational variable is called \textit{perspective}. For example, from the described schema example, $[U, R, P].Engagement$ is a relational variable from the perspective USER, which captures the set of Engagements of all posts that a user reacts to. 

A \textit{Relational Dependency} $[I_X...I_Y].Y \rightarrow [I_X].X$ is a pair of two relational variables with a common perspective. Relational paths allow us to model causal dependencies between attributes of different entities, e.g., $[P, R, U].Sentiment \rightarrow [P].Engagement$ indicates that the engagement of a post depends on the sentiment of the user reacting to that post. The dependency is called \textit{canonical} if the path of the outcome variable (in the example, $[P].Engagement$) has a path of length 1. 
A \textit{Relational Causal Model} $\mathcal{M}_{\Theta}(\mathcal{S}, \mathcal{D})$ is a set of relational dependencies $\mathcal{D}$ defined over schema $\mathcal{S}$, with $\Theta$ denoting the set of conditional probability distributions for each attribute $\mathcal{A}(I)$ of every class $I\in\mathcal{E}\cup\mathcal{R}$ over its parents. The arrow corresponds to a relational dependency. The example relational causal model in Figure \ref{fig:model} shows that the user's sentiment and the post's content influence the engagement of the post.
A \textit{Relational Skeleton} $\sigma$, is an instantiation of the schema for all entities, relationships, and attributes which follows the cardinality requirements specified by \textit{card}. 
In other words, this is the data realization of the schema. For example, one relational skeleton could have two entities of type USER, Bob and Anna, and one entity of type POST, a food recipe, that Bob and Anna react to.
We denote the set of all possible relational skeletons for a schema $\mathcal{S}$ as $\Sigma_{\mathcal{S}}$.

Each relational causal model $\mathcal{M}_\Theta$ and relational skeleton $\sigma$ correspond to a \textit{Ground Graph} $GG_{\mathcal{M}\sigma}$.  The nodes in this graph are the attributes of all Entities and Relation instances in the skeleton $\sigma$, while the edges between instances of variables represent all dependencies in $\mathcal{M}$. Graphical examples of these representations can be seen in Appendix \ref{rcd}. An \textit{Abstract Ground Graph} $AGG_{\mathcal{M}\mathcal{B}h}$, for the relational causal model $\mathcal{M}$, perspective $\mathcal{B}$ and hop-threshold $h$, is a graph that captures dependencies between relational variables that hold for all possible ground graphs $GG_{\mathcal{M}\sigma}$, with $\sigma \in \Sigma_{\mathcal{S}}$. Abstract ground graphs are defined for each perspective $\mathcal{B}$ and relational path length fixed to $h$. 
\textcolor{red}{
They include one node for each relational variable defined over the given perspective, as well as additional \textit{intersection variables}, which represent the potential overlap in instances between two relational variables that share the same attribute class. For instance, if the AGG contains $[U,R,P].\textit{Engagement}$ and $[U,R,P,R,U,R,P].\textit{Engagement}$, two relational variables with different paths but the same attribute, an intersection node $[U,R,P] \cap [U,R,P,R,U,R,P].\textit{Engagement}$ is added to capture their shared instances. These intersection nodes inherit all incoming and outgoing edges from the original variables, as a single ground random variable may participate in multiple relational variable instances. Including intersection variables is essential for accurately modeling implied dependencies and for correctly applying d-separation, since such shared participation can affect conditional independence reasoning.

$AGG_{\mathcal{M}\mathcal{B}h}$ contains edges between relational variables if the instantiations of those relational variables contain a dependent pair in some ground graph. 
The edges are obtained using the \verb|extend| method \citep{maier2014reasoning}, which constructs relational paths from the current perspective to a dependency’s source attribute. Formally, given a relational dependency $[I_Y, \dots, I_Z].Y \rightarrow [I_X].X$, and a current perspective path $[I_B, \dots, I_X]$, the method finds all valid pivot points between the reversed path $[I_X, \dots, I_B]$ and the dependency path, and concatenates them at the pivot to generate new paths of the form $[I_B, \dots, I_X, \dots, I_Z]$. The process ensures that dependencies are appropriately lifted to the abstract ground graph, regardless of original perspective. 
Furthermore, intersection variables inherit the edges from both the variables in the pair. 

A single dependency in $\mathcal{M}$, with the extend method, may support multiple edges in $AGG_{\mathcal{M}\mathcal{B}h}$. Additionally, a single model $\mathcal{M}$ produces multiple AGGs, one for each perspective. A more complete description of AGGs components and the extend method are provided in Appendix \ref{rcd}.
\citet{maier2014reasoning} showed that $d$-separation applied to abstract ground graphs (i.e., relational $d$-separation) allows the identification of conditional independences that hold across all ground graphs. 

\begin{figure}[ht]
    \centering
    \begin{tikzpicture}
        % Entities
        \node[draw, rounded corners, rectangle, minimum width=2cm, minimum height=2cm] (A) at (0, 0) {};
        \node[above] at (A.north) {USER};
    
        \node[draw, rounded corners, rectangle, minimum width=2cm, minimum height=2cm] (B) at (5, 0) {};
        \node[above] at (B.north) {POST};
    
        % Define the four corners of the rhombus
        \coordinate (N) at (2.5, 0.75);
        \coordinate (E) at (3.55, 0);
        \coordinate (S) at (2.5, -0.75);
        \coordinate (W) at (1.45, 0);
        \node[above] at (N.north) {REACTS};
        \node[above, font=\tiny] at ([xshift=-0.16cm] W.center) {MANY};
        \node[above, font=\tiny] at ([xshift=0.16cm] E.center) {MANY};
    
        % Create attributes inside A
        \node[draw, ellipse, thick, font=\scriptsize] (A1) at ([yshift=-0.66cm, xshift=-0.09cm] A.center) {Sentiment};
        \node[draw, ellipse, thick, font=\scriptsize] (A2) at ([xshift=-0.45cm, yshift=0.65cm] A.center) {Type};
        \node[draw, ellipse, thick, font=\scriptsize] (A3) at ([xshift=0.24cm, yshift=0cm] A.center) {Activity};
    
        % Create attributes inside AB1
        \node[draw, ellipse, font=\scriptsize, thick] (AB1_1) at ([yshift=-0.75cm] N.center) {Frequency};
        
        % Draw the lines to form the rhombus
        \draw (N) -- (E) -- (S) -- (W) -- cycle;
    
        % Create attributes inside B
        \node[draw, ellipse, thick, font=\tiny]  (B1) at ([xshift=0cm, yshift=-0.45cm] B.center) {Engagement};
        \node[draw, ellipse, thick, font=\scriptsize] (B2) at ([xshift=0.1cm, yshift=0.5cm] B.center) {Content};
      
        % Lines
        \draw (A) -- (W);
        \draw (E) -- (B);
    
        % Curved edge
        \path (A1) edge[bend right, thick, ->] (B1);
        \path (B2) edge[bend left, thick, ->] (B1);
        \path (AB1_1) edge[bend left, thick, ->] (A3);
        \path (AB1_1) edge[bend right, thick, ->] (B1);
        \path (A2) edge[bend right, thick, ->] (A1);
        \path (A2) edge[bend left, thick, ->] (A3);
    \end{tikzpicture}
    \caption{Example of a Relational Causal Model}
    \label{fig:model}
\end{figure}
\subsection{Partial Ancestral Graph}
The variables of a causal graph, $V \in G$, can be divided into three categories: observed (\textbf{O}), selection (\textbf{S}), and latent (\textbf{L}) variables, denoted as G(\textbf{O},\textbf{S},\textbf{L}). In this work, we focus on latent variables and assume there are no selection ones, i.e., $\boldsymbol{S} = \emptyset$.
We denote \textbf{Cond} as the set of conditional independence relations among variables in \textbf{O}, and define the equivalence class of graphs that meets the conditional independence \textit{O-Equiv}(\textbf{Cond}) as follows: for a graph G(\textbf{O},\textbf{L}) belonging to \textit{O-Equiv}(\textbf{Cond}), given three sets of variables \textbf{X}, \textbf{Y} and \textbf{Z}, G(\textbf{O},\textbf{L}) entails that \textbf{X} $\independent$ \textbf{Y} $\mid$ \textbf{Z} if and only if \textbf{X} $\independent$ \textbf{Y} $\mid$ \textbf{Z} $\in$ \textbf{Cond}. 

An \textit{ancestral graph} \citep{ZHANG20081873} is a causal graph that can be used to represent conditional independence and causal relations of a DAG with latent variables, using only the observed variables. A path $p$ between any two vertices $X,Y\in\textbf{O}$ is called an \textit{inducing path relative to} $\langle\textbf{L}\rangle$ if every non-endpoint vertex on $p$ is either in \textbf{L} or a collider, and every collider in $p$ is an ancestor vertex of either $X$ or $Y$. A path is called primitive when \textbf{L} is empty. A \textit{Maximal Ancestral Graph} (MAG) is an ancestral graph having no primitive inducing path between any two non-adjacent vertices.

The FCI algorithm learns a Markov equivalence class of a MAG called \textit{Partial Ancestral Graph} (PAG), with edges ends having three possible marks, ◦, -, $\textgreater$, which indicate the following relationships: [1] $A \rightarrow B$ implies that $A$ causes $B$; [2] $A \leftrightarrow B$ implies a common latent confounder between the two observed variables. 
An edge has an arrowhead $\textgreater$ or tail - mark between two variables if and only if all DAGs in \textit{O-Equiv}(\textbf{Cond}) share the same arrowhead (or tail) mark for those variables, i.e. the mark is \textit{invariant}. On the other hand, if there exist two DAGs with a different edge mark between two variables, the PAG will contain a ◦ mark, i.e. the mark is \textit{variant}. 
If every circle mark corresponds to an invariant in \textit{O-Equiv}(\textbf{Cond}), the PAG is called \textit{maximally informative} for the equivalence class. Examples of MAG and PAG are available in Appendix \ref{fci}. 

\subsection{Assumptions for Relational Causal Discovery}\label{assumptions}
In this subsection, we define and discuss some key assumptions used for causal discovery in relational data, including the maximum hop threshold, d-faithfulness, acyclicity, and absence of latent descendants for latent variables.
\begin{itemize}
    \item Maximum Hop Threshold (\(h\)): The maximum hop threshold defines the largest permissible path length (or number of relational hops) between entities in a relational causal model that will be considered when constructing causal dependencies. Setting \(h\) limits the computational complexity and ensures that the discovered relationships are both interpretable and relevant. For instance, in a social network, \(h = 2\) might capture direct friendships and friends-of-friends relationships while ignoring more distant connections.
    \item D-Faithfulness: D-faithfulness (Dependency-Faithfulness) posits that any conditional independence observed in the data is also represented in the underlying causal graph, and vice versa. This ensures that the causal relationships inferred from the data align with the observed statistical dependencies in the relational causal model.
    \item Acyclicity: Acyclicity mandates that the causal graph representing the relationships among variables and entities is a directed acyclic graph (DAG). This means there are no directed cycles in the relational causal model.
    \item 
    Absence of latent descendants for latent variables: For this work, we assume that latent variables cannot be descendants of each other, i.e. all the parents and children of a latent variable are observed. This assumption is standard in constraint-based latent variable models \citep{evans2016graphs}. \citet{spirtes1995causal} note that conclusions about the equivalence class over observed variables remain valid regardless of the causal relations among latent variables.
\end{itemize}

