\section{Experimental Results} 
\subsection{Setup}\label{sec:setup}
Our RelFCI algorithm implementation\footnote{Code available at \url{https://github.com/edgeslab/RelFCI}.}
% \footnote{Code available at \url{https://github.com/edgeslab/RelFCI}.}
is based on the RFCI algorithm \cite{colombo2012learning} rather than FCI. 

RFCI performs significantly fewer conditional independence tests than other FCI variants. While not proven complete, experiments show it achieves similar accuracy in edge orientation.
We generate synthetic data using a procedure similar to \cite{maier2013sound} but with the addition of introducing latent variables into the schema and model. We generate 1000 random LRCMs from randomly generated schemas for each of the following combinations: number of entities $n\in [2,4]$; $n-1$ relationships with randomly selected cardinalities; attributes per item drawn from a Poisson distribution Pois($\lambda=1$) + 1; and the number of relational dependencies (6, 8, 10, and 12) limited by hop threshold of 2. We additionally require the presence of one or two latent attributes, which are randomly chosen from the set of attributes for relational variables in the LAGG involved in at least two dependencies as the cause variable. The process yields a total of 22,000 synthetic models. We use an oracle to perform conditional independent tests for RelFCI and RCD for all possible perspectives. The results are then averaged over multiple runs for every combination, i.e., averaging over 1000 different LRCMs sharing the same properties.

\subsection{Evaluation}
We evaluated our work by comparing the model derived from the algorithm's dependencies to the ground truth.
We define the latent relational causal model obtained as ground truth by replacing the latent variable with double arrowhead edges using the same Maximal Ancestral Graph construction approach as presented in \citet{ZHANG20081873}. 
We label a missing edge as a false negative, an additional edge as a false positive, and a correct edge as a true positive and compute the precision and recall. Furthermore, to assess the necessity of new rules for relational causal discovery, we also measure the frequency with which each rule was invoked during the RelFCI runs. This last result can be found in Appendix \ref{res}.
\subsection{Results}
Figure \ref{fig:pre-rec-tot} presents a comparative analysis of RelFCI and RCD regarding precision and recall. An apparent discrepancy can be noticed in the results. This difference arises due to latent variables, which RCD fails to handle effectively. As previously discussed, the influence of hidden confounders violates RCD's core assumptions, significantly degrading its accuracy. 
In contrast, our proposed method, RelFCI, is designed to be sound and complete in the presence of latent variables. Since the RFCI implementation can sometimes introduce spurious edges or omit true ones, we expect its precision and recall to be slightly below one, as supported by \citet{colombo2012learning}. Furthermore, RelFCI exhibits a smaller variance than RCD. This indicates that RelFCI produces more consistent and reliable results across different conditions, reinforcing its robustness in handling latent variables. 

Figure \ref{fig:pre-rec-lat} further illustrates the performance trends with either one or two latent variables as the number of entities and dependencies increases. A key observation is that while RCD's performances slightly improve as the number of entities and dependencies grows, its precision and recall remain consistently lower than those of RelFCI. This trend is particularly noticeable in recall, suggesting that RCD benefits marginally from increased structural complexity. RelFCI, instead, maintains stable and high precision and recall across all conditions. These findings highlight the robustness of RelFCI in handling relational datasets with latent variables, where RCD struggles to achieve comparable accuracy.

As an additional analysis, we evaluated our new algorithm's rule activation distribution over all synthetic runs. Rules unique to FCI account for approximately one-third of all orientations. It demonstrates that latent confounders impact the entire model structure during the learning process. The plot of the rules distribution is shown in Figure \ref{fig:rule-distr}.

\begin{figure}[ht]
    \centering
    \includegraphics[width=0.45\textwidth]{imgs/plot_rules_distr.png}
    \caption{RelFCI's rule distribution of RCD and FCI rules.}
    \label{fig:rule-distr}
\end{figure}
