\section{Challenges of Conformal Prediction on Federated Graphs} \label{sec:challenges}

Conformal Prediction on federated graphs faces several challenges that need to be addressed to ensure its applicability and effectiveness in real-world applications. In this section, we elaborate on these challenges. In Section \ref{sec:method}, we discuss how we address them.

\textit{Exchangeability:} A significant challenge in federated graph CP is the violation of the exchangeability principle, which traditional CP methods rely upon \citep{vovk2005algorithmic}. Consider a federated graph learning setting where nodes of the overall graph \( \mathcal{V} \) are partitioned into training, validation, calibration, and test sets as \( \mathcal{V}_{\text{train}}, \mathcal{V}_{\text{valid}}, \mathcal{V}_{\text{calib}}, \) and \( \mathcal{V}_{\text{test}} \). These methods presuppose that the distributions of calibration nodes \( \mathcal{V}_{\text{calib}} \) and test nodes \( \mathcal{V}_{\text{test}} \) are exchangeable during inference, meaning their joint distribution remains unchanged when samples are permuted. This assumption breaks down in federated graph settings for two primary reasons.

First, inherent dependencies among nodes due to their connectivities violate exchangeability if the test data is not present during training. Secondly, the distribution of graph data across different clients in a federated setting tends to vary, leading to non-exchangeable distributions. Specifically, the sets \( \mathcal{V}_{\text{calib}} \) and \( \mathcal{V}_{\text{test}} \) are not exchangeable, as their respective subsets \( \mathcal{V}_{\text{calib}}^{(k)} \) and \( \mathcal{V}_{\text{test}}^{(k')} \) may originate from distinct clients (\( k \neq k' \)). This variability underscores the challenges in assuming uniform data distribution across clients. For example, hospitals specializing in certain medical fields might predominantly treat patients from specific demographic groups, leading to skewed data distributions. Similarly, graph partitioning algorithms like METIS \citep{karypis1997metis}, used for simulating subgraph FL scenarios, aim to minimize edge cuts across partitions, often resulting in subgraphs that do not share the same data distribution.

\begin{table}[h]
\caption{Number of partitions (\( K \)) and its impact on missing edges (\( \Delta E \)) and average conformal prediction set sizes (\( \Delta |CP| \)) across clients. Larger CP set sizes result from both the local training of models and conformal predictors, as well as the increasing number of missing links in client subgraphs.}  

\label{tab:missing-links}
\centering
\scalebox{0.75}{
\begin{tabular}{cccccc}
\toprule
Dataset & \( |E| \) & \( K \) & \( \Delta E \) & \( \Delta E \% \) & \(\Delta |CP|\%\)\\
\midrule
\multirow{3}{*}{Cora} & \multirow{3}{*}{10,138} 
& 5 & 604 & 5.96\% & 34.7\%\\
& & 10 & 806 & 7.95\% & 43.3\%\\
& & 20 & 1,230 & 12.13\% & 48.1\%\\
\midrule
\multirow{3}{*}{CiteSeer} & \multirow{3}{*}{7,358} 
& 5 & 310 & 4.21\% & 54.0\%\\
& & 10 & 608 & 8.26\% & 57.7\%\\
& & 20 & 848 & 11.52\% & 62.3\%\\
\bottomrule
\end{tabular}
}
\end{table}

\textit{Missing Neighbor Information:} Another significant challenge in federated graph CP is the presence of missing neighbor information across client subgraphs. Consider a scenario where a patient visits multiple hospitals within the same city, maintaining separate records at each location. Due to conflicts of interest, it is impractical for hospitals to share their patient networks, leading to incomplete edge information in the overall graph. In simulations of federated learning based on graph partitioning, increasing the number of clients amplifies the number of missing links between them, as shown in Table~\ref{tab:missing-links}.

These missing edges, which carry critical neighborhood information, remain uncaptured by any single client subgraph. This absence becomes particularly problematic when CP techniques are applied to partitioned graph data, as it can impair model performance and increase the size of prediction sets due to insufficient coverage of the data's connectivity. Figure~\ref{fig:set-size} illustrates this issue, showing how the increasing number of missing links correlates with larger prediction set sizes through empirical evaluation.

Given these complexities, it is necessary to demonstrate how CP can be applied to non-exchangeable graph data and how the inefficiency caused by missing neighbor information can be mitigated within federated graph environments.

\begin{figure}[htbp]
    \centering
        \includegraphics[width=0.36\textwidth]{figures/heat_map.png}
    \caption{Effect of the number of clients on CP set size for the Cora dataset.}
    \label{fig:set-size}
\end{figure}
