\section{Case Study}

\begin{table*}[t]
\centering
\begin{tabular}{|cl|lllrrrrr|}
\hline
\multicolumn{2}{|c|}{}                                                                  & MI                                       & cRCA                                     & RUN                                      & \multicolumn{1}{l}{BARO}                 & \multicolumn{1}{l}{RCD}                  & \multicolumn{1}{c}{\name-0} & \multicolumn{1}{c}{\name-1} & \multicolumn{1}{c|}{\begin{tabular}[c]{@{}c@{}}\name-\\ Expert\end{tabular}} \\ \hline
\multicolumn{1}{|c|}{\multirow{5}{*}{top-1}} & Carts                                    & 0.79                                     & 0.80                                     & 0.00                                     & 1.00                & 0.56                                     & 0.40                                       & 0.81                                       & 0.30                                                                                        \\
\multicolumn{1}{|c|}{}                       & Catalogue                                & 0.11                                     & 0.40                                     & 0.00                                     & 1.00                                     & 0.18                                     & 0.98                                       & 0.09                                       & 0.81                                                                                        \\
\multicolumn{1}{|c|}{}                       & Orders                                   & 0.36                                     & 0.40                                     & 0.00                                     & 1.00                                     & 0.68                                     & 0.57                                       & 0.37                                       & 0.96                                                                                        \\
\multicolumn{1}{|c|}{}                       & Payment                                  & 0.27                                     & 0.40                                     & 0.00                                     & 1.00                                     & 0.65                                     & 0.76                                       & 0.24                                       & 0.93                                                                                        \\
\multicolumn{1}{|c|}{}                       & User                                     & 1.00                                     & 1.00                                     & 0.00                                     & 1.00                                     & 1.00                                     & 0.56                                       & 1.00                                       & 0.87                                                                                        \\
\multicolumn{1}{|c|}{}                       & \cellcolor{gray!20}Avg. & \cellcolor{gray!20}0.51 & \cellcolor{gray!20}0.60 & \cellcolor{gray!20}0.00 & \cellcolor{gray!20}1.00 & \cellcolor{gray!20}0.61 & \cellcolor{gray!20}0.66   & \cellcolor{gray!20}0.50   & \cellcolor{gray!20}0.77                                                    \\ \hline
\multicolumn{1}{|c|}{\multirow{5}{*}{top-3}} & Carts                                    & 1.00                                     & 0.80                                     & 0.42                                     & 1.00                                     & 0.87                                     & 0.42                                       & 1.00                                       & 1.00                                                                                        \\
\multicolumn{1}{|c|}{}                       & Catalogue                                & 0.92                                     & 0.60                                     & 0.39                                     & 1.00                                     & 0.47                                     & 1.00                                       & 0.45                                       & 1.00                                                                                        \\
\multicolumn{1}{|c|}{}                       & Orders                                   & 1.00                                     & 0.40                                     & 0.07                                     & 1.00                                     & 0.92                                     & 0.6                                       & 1.00                                       & 1.00                                                                                        \\
\multicolumn{1}{|c|}{}                       & Payment                                  & 1.00                                     & 0.40                                     & 0.14                                     & 1.00                                     & 0.88                                     & 0.76                                       & 1.00                                       & 1.00                                                                                        \\
\multicolumn{1}{|c|}{}                       & User                                     & 1.00                                     & 1.00                                     & 0.06                                     & 1.00                                     & 1.00                                     & 0.64                                       & 1.00                                       & 1.00                                                                                        \\
\multicolumn{1}{|c|}{}                       & \cellcolor{gray!20}Avg. & \cellcolor{gray!20}0.98 & \cellcolor{gray!20}0.64 & \cellcolor{gray!20}0.22 & \cellcolor{gray!20}1.00 & \cellcolor{gray!20}0.77 & \cellcolor{gray!20}0.68   & \cellcolor{gray!20}0.89   & \cellcolor{gray!20}1.00                                                    \\ \hline
\multicolumn{1}{|c|}{\multirow{5}{*}{top-5}} & Carts                                    & 1.00                                     & 1.00                                     & 0.62                                     & 1.00                                     & 0.82                                     & 0.42                                       & 1.00                                       & 1.00                                                                                        \\
\multicolumn{1}{|c|}{}                       & Catalogue                                & 0.93                                     & 0.60                                     & 0.58                                     & 1.00                                     & 0.51                                     & 1.00                                       & 0.58                                       & 1.00                                                                                        \\
\multicolumn{1}{|c|}{}                       & Orders                                   & 1.00                                     & 0.60                                     & 0.15                                     & 1.00                                     & 0.87                                     & 0.62                                       & 1.00                                       & 1.00                                                                                        \\
\multicolumn{1}{|c|}{}                       & Payment                                  & 1.00                                     & 0.40                                     & 0.20                                     & 1.00                                     & 0.86                                     & 0.76                                       & 1.00                                       & 1.00                                                                                        \\
\multicolumn{1}{|c|}{}                       & User                                     & 1.00                                     & 1.00                                     & 0.11                                     & 1.00                                     & 1.00                                     & 0.70                                       & 1.00                                       & 1.00                                                                                        \\
\multicolumn{1}{|c|}{}                       & \cellcolor{gray!20}Avg. & \cellcolor{gray!20}0.99 & \cellcolor{gray!20}0.72 & \cellcolor{gray!20}0.33 & \cellcolor{gray!20}1.00 & \cellcolor{gray!20}0.81 & \cellcolor{gray!20}0.70   & \cellcolor{gray!20}0.92   & \cellcolor{gray!20}1.00                                                    \\ \hline
\end{tabular}
\caption{The table shows the top-$l$ accuracy of different baselines on the data collected from the Sock-shop application after injecting a CPU hog into a given microservice.}
\label{tab:sockshop-cpu}
\end{table*}


% \begin{table*}[t]
% \scriptsize
% \centering
% \begin{tabular}{|cl|lllrrrrrr|}
% \hline
% \multicolumn{2}{|c|}{}                                                                  & MI                                       & cRCA                                     & RUN                                      & \multicolumn{1}{l}{BARO}                 & \multicolumn{1}{l}{RCD}                  & \multicolumn{1}{c}{\name-0} & \multicolumn{1}{c}{\name-C} & \multicolumn{1}{c}{\name-1} & \multicolumn{1}{c|}{\begin{tabular}[c]{@{}c@{}}\name-\\ Expert\end{tabular}} \\ \hline
% \multicolumn{1}{|c|}{\multirow{5}{*}{top-1}} & Carts                                    & 0.79                                     & 0.80                                     & 0.00                                     & 1.00                & 0.56                                     & 0.40                                       & 0.84                                       & 0.81                                       & 0.30                                                                                        \\
% \multicolumn{1}{|c|}{}                       & Catalogue                                & 0.11                                     & 0.40                                     & 0.00                                     & 1.00                                     & 0.18                                     & 0.98                                       & 0.20                                       & 0.09                                       & 0.81                                                                                        \\
% \multicolumn{1}{|c|}{}                       & Orders                                   & 0.36                                     & 0.40                                     & 0.00                                     & 1.00                                     & 0.68                                     & 0.57                                       & 0.51                                       & 0.37                                       & 0.96                                                                                        \\
% \multicolumn{1}{|c|}{}                       & Payment                                  & 0.27                                     & 0.40                                     & 0.00                                     & 1.00                                     & 0.65                                     & 0.76                                       & 0.28                                       & 0.24                                       & 0.93                                                                                        \\
% \multicolumn{1}{|c|}{}                       & User                                     & 1.00                                     & 1.00                                     & 0.00                                     & 1.00                                     & 1.00                                     & 0.56                                       & 1.00                                       & 1.00                                       & 0.87                                                                                        \\
% \multicolumn{1}{|c|}{}                       & \cellcolor{gray!20}Avg. & \cellcolor{gray!20}0.51 & \cellcolor{gray!20}0.60 & \cellcolor{gray!20}0.00 & \cellcolor{gray!20}1.00 & \cellcolor{gray!20}0.61 & \cellcolor{gray!20}0.66   & \cellcolor{gray!20}0.57   & \cellcolor{gray!20}0.50   & \cellcolor{gray!20}0.77                                                    \\ \hline
% \multicolumn{1}{|c|}{\multirow{5}{*}{top-3}} & Carts                                    & 1.00                                     & 0.80                                     & 0.42                                     & 1.00                                     & 0.87                                     & 0.42                                       & 1.00                                       & 1.00                                       & 1.00                                                                                        \\
% \multicolumn{1}{|c|}{}                       & Catalogue                                & 0.92                                     & 0.60                                     & 0.39                                     & 1.00                                     & 0.47                                     & 1.00                                       & 0.64                                       & 0.45                                       & 1.00                                                                                        \\
% \multicolumn{1}{|c|}{}                       & Orders                                   & 1.00                                     & 0.40                                     & 0.07                                     & 1.00                                     & 0.92                                     & 0.6                                       & 1.00                                       & 1.00                                       & 1.00                                                                                        \\
% \multicolumn{1}{|c|}{}                       & Payment                                  & 1.00                                     & 0.40                                     & 0.14                                     & 1.00                                     & 0.88                                     & 0.76                                       & 1.00                                       & 1.00                                       & 1.00                                                                                        \\
% \multicolumn{1}{|c|}{}                       & User                                     & 1.00                                     & 1.00                                     & 0.06                                     & 1.00                                     & 1.00                                     & 0.64                                       & 1.00                                       & 1.00                                       & 1.00                                                                                        \\
% \multicolumn{1}{|c|}{}                       & \cellcolor{gray!20}Avg. & \cellcolor{gray!20}0.98 & \cellcolor{gray!20}0.64 & \cellcolor{gray!20}0.22 & \cellcolor{gray!20}1.00 & \cellcolor{gray!20}0.77 & \cellcolor{gray!20}0.68   & \cellcolor{gray!20}0.93   & \cellcolor{gray!20}0.89   & \cellcolor{gray!20}1.00                                                    \\ \hline
% \multicolumn{1}{|c|}{\multirow{5}{*}{top-5}} & Carts                                    & 1.00                                     & 1.00                                     & 0.62                                     & 1.00                                     & 0.82                                     & 0.42                                       & 1.00                                       & 1.00                                       & 1.00                                                                                        \\
% \multicolumn{1}{|c|}{}                       & Catalogue                                & 0.93                                     & 0.60                                     & 0.58                                     & 1.00                                     & 0.51                                     & 1.00                                       & 0.70                                       & 0.58                                       & 1.00                                                                                        \\
% \multicolumn{1}{|c|}{}                       & Orders                                   & 1.00                                     & 0.60                                     & 0.15                                     & 1.00                                     & 0.87                                     & 0.62                                       & 1.00                                       & 1.00                                       & 1.00                                                                                        \\
% \multicolumn{1}{|c|}{}                       & Payment                                  & 1.00                                     & 0.40                                     & 0.20                                     & 1.00                                     & 0.86                                     & 0.76                                       & 1.00                                       & 1.00                                       & 1.00                                                                                        \\
% \multicolumn{1}{|c|}{}                       & User                                     & 1.00                                     & 1.00                                     & 0.11                                     & 1.00                                     & 1.00                                     & 0.70                                       & 1.00                                       & 1.00                                       & 1.00                                                                                        \\
% \multicolumn{1}{|c|}{}                       & \cellcolor{gray!20}Avg. & \cellcolor{gray!20}0.99 & \cellcolor{gray!20}0.72 & \cellcolor{gray!20}0.33 & \cellcolor{gray!20}1.00 & \cellcolor{gray!20}0.81 & \cellcolor{gray!20}0.70   & \cellcolor{gray!20}0.94   & \cellcolor{gray!20}0.92   & \cellcolor{gray!20}1.00                                                    \\ \hline
% \end{tabular}
% \caption{The table shows the top-$l$ accuracy of different baselines on the data collected from sock-shop application after injecting CPU hog to a given microservice.}
% \label{tab:sockshop-cpu}
% \end{table*}

\textbf{Sock-shop.} This section demonstrates the effectiveness of \name using the Sock-shop application, a microservice-based replica of a web application for selling socks. The system consists of 13 microservices, with 5 being the most critical and user-facing. Although Sock-shop is microservice-based, our method remains system-agnostic. We used the dataset from~\citet{ikram2022root}, which includes two failure types: CPU hog and memory leak. The dataset contains 50 instances, each running for 5 minutes in both normal and failure conditions. We repeated each experiment 50 times and report the mean top-$l$ accuracy.

For Sock-shop scenario, we considered state-of-the-art RCA baselines, including causalRCA~\citep{xin2023causalrca} (shown as cRCA), RUN~\citep{lin2024root}, BARO~\citep{pham2024baro}, and RCD. \rev{To demonstrate the utility of the extended version of Algorithm \ref{alg:sample-version} (see Algorithm \ref{alg:extended-version} in Appendix \ref{app:algorithms}), we used a recent causal discovery algorithm $k$-PC ~\cite{kocaoglu2023characterization}, which learns the graph using a restricted conditioning set. Note that our algorithm only requires a partial causal graph, and $k$-PC was only chosen for its ability to constrain the separating set size, helping to prevent faithfulness violations in real data.} We use postfixes to indicate how $k$ was chosen. For example, \name-$k$ refers to using all conditioning sets up to size $k$, where $k \in \{0,1\}$. Additionally, we construct a causal graph based on the system's call graph and use that as the input to Algorithm \ref{alg:sample-version}. We denote this algorithm by \name-Expert, which uses expert knowledge for identifying the root cause. In this case, the possible parent set in Algorithm \ref{alg:sample-version} is the parent set indicated by the DAG structure. 

Table~\ref{tab:sockshop-cpu} shows the top-$l$ accuracy of \name with different baselines for CPU hog failure. We report the results for the memory leak failure in the Appendix~\ref{app:sockshop-result}. The results align with those from our synthetic data experiments in Section~\ref{sec:experiments}. Notably, RCD and \name-0 perform similarly because, with $k=0$, $k$-PC is limited to marginal CI tests, producing a dense $k$-essential graph (see Appendix \ref{app:extension_discussion}. This leads to a larger set of possible parent nodes, forcing \name to condition on more variables, which can obscure the true root cause. However, when $k=1$, \name outperforms RCD as $k$-PC is allowed to use separating sets of size up to one. Increasing $k$ improves graph learning but demands larger sample sizes for reliable CI tests. Additionally, the system call graph shows that \name achieves high accuracy when a high-quality causal graph is learned from observational data.

Notably, the performance of BARO, which achieves high top-1 accuracy on the Sock-shop dataset for both failure types. However, this contradicts our earlier experiments in Section~\ref{sec:experiments}. As we show in the next section with real-world data, BARO performs poorly. This discrepancy suggests that BARO may be overfitting to the Sock-shop dataset.
% However, it disregards the correct causal order in the data-generating mechanism and is limited to continuous data. As shown in Figure~\ref{fig:top-l-accuracy},
% As shown in Appendix~\ref{app:linear_gaussian_model_comparison}, BARO yields suboptimal results even with continuous data compared to \name, which utilizes correct causal knowledge.



\setlength{\tabcolsep}{4pt} % Adjust the column padding here to reduce whitespace

\begin{table}
    \centering
        \begin{tabular}{c c c c c c c}
        \toprule
        Outage & \name-0 & \name-1 & \name-2 & MI & BARO & RCD\\
        \midrule
        A      & \textbf{7}       & -       & -       & -  & 9   & - \\
        B      & \textbf{1}       & 6       & -       & 9  & 6 &-   \\
        C      & \textbf{1}       & \textbf{1}       & \textbf{1}       & \textbf{1}  & 8  & \textbf{1}  \\
        D      & 5       & 5       & 6       & 3  & -  & \textbf{2}  \\
        \bottomrule
        \end{tabular}
    \caption{Rank of the root cause among the top 10 nodes for each baseline, with a rank of 1 indicating the highest-ranked node and a dash indicating the root cause was not found. \name-0 consistently outperforms BARO and performs comparably with MI, but higher values of $k$ lead to a less reliable causal graph and decreased consistency. }
    \label{tab:outage-result}
\end{table}

\begin{table}[h!]
    \centering
        \begin{tabular}{c c c c c}
        \toprule
        Outage & Nodes & \begin{tabular}[c]{@{}c@{}}Normal\\ Samples\end{tabular} & \begin{tabular}[c]{@{}c@{}}Failure\\ Samples\end{tabular} & \begin{tabular}[c]{@{}c@{}}Duration\\ (hours)\end{tabular} \\
        \midrule
        A      & 152   & 4783                       & 918                         & 15        \\
        B      & 141   & 4626                       & 1217                        & 20        \\
        C      & 149   & 3464                       & 110                         & 2        \\
        D      & 146   & 7165                       & 567                         & 5        \\
        \bottomrule
        \end{tabular}
    \caption{Summary of outages from a real-world production application.}
    \label{tab:outage-summary}
\end{table}


\textbf{Real Datasets.} We collected data from a real-world production application from January to July 2024, during which four outages were reported. For each incident, Software Reliability Engineers (SREs) documented key details, including outage duration, detection time, resolution method, and root cause (see Table~\ref{tab:outage-summary}). We presented the SREs with the top 10 ranked nodes from each baseline and asked them to confirm if the true root cause was among them. We report the rank of the root cause for each incident, where a lower rank indicates better performance by the method.

Table~\ref{tab:outage-result} compares the performance of MI, BARO, RCD, and~\name on the real-world dataset. The results show that \name-0 consistently outperforms BARO and matches closely with MI's performance in outages C and D. MI's strong performance may stem from data inequality processing, but it breaks down when confounders ext between root causes and their descendants. In outage B, \name-0 ranks the root cause first, while MI places it ninth. For outage A, \name-0 identifies the root cause within the top 10, whereas MI does not. BARO often ranked the root cause near the bottom and failed to identify it entirely in one case, highlighting the limitations of methods focused solely on detecting noticeable changes. This also underscores the drawbacks of relying on a single point of the distribution (such as the median), which may not accurately capture the shift between the two distributions.
% \kenneth{I think the limitation of BARO is its reliance on the median of the distribution, which may not capture the shift between two distributions accurately.} 
RCD performs well on outages C and D, but it took approximately 25 minutes to produce results, whereas \name completed in 10 seconds. We also find that increasing $k$ did not consistently improve accuracy by comparing \name-0 with \name-1 and \name-2. In some cases, accuracy declined due to less reliable CI tests with larger separating sets, leading to incorrect parent node conditioning and inaccurate rankings. This aligns with our insight that relying on high-order CI tests with small interventional samples is not ideal. In the additional experiment in Appendix \ref{app:rcd-higher-order-ci}, we demonstrate that RCD heavily relies on high-order CI tests after failure, which compromises its performance. In contrast, our extended version of RCG (see Algorithm \ref{alg:extended-version}) leverages a partial causal structure learned from observed data using CI tests with small conditioning sets and requires only marginal invariance tests after failure. 

For \name, we note that there is a trade-off between the time complexity of computing CMI and the granularity of the input structure provided to \name if we take the time to obtain the input into account. For example, when using the PC algorithm \cite{spirtes2000causation} to construct a CPDAG as input, the resulting graph is typically sparser than a $k$-essential graph, a graphical representation that only uses CI tests up to order $k$ to construct, leading to more efficient CMI computations due to smaller conditioning sets. In contrast, a $k$-essential graph often results in larger conditioning sets, making CMI computations more costly. This efficiency gap, however, is not obvious when CPDAGs are obtained from other discovery algorithms besides PC. 

% Balancing accuracy and informativeness in CI tests is challenging in real-world applications. To address this, we propose $RCG^{\star}$, which integrates the results across different values of $k$ rather than selecting a single one. For instance, the top nodes from \name-0 can be combined with those from \name-1 until reaching $l$ nodes. However, we leave the exploration of this approach for future work.

