\section{Results}

\begin{figure*}[t]
    \centering
    \includegraphics[width=\linewidth]{Figure/Figure3_key_gene.png}
    \caption{\textbf{Qualitative comparison for pivotal SGPP1 gene expression prediction.} SGPP1 expression prediction distribution of randomly selected 16$\mu$m bins within a region in WSI. }
    \label{fig:key_gene}
\end{figure*}

\subsection{Cross-Validation Evaluation}
We conducted four-fold cross-validation on the WSI level to validate and benchmark MagNet and SOTAs on the two HD datasets. Table~\ref{table:comparison} summarizes quantitative comparisons of various baselines across different datasets and resolutions. Our proposed MagNet consistently outperforms existing methods in almost all metrics, with its superiority particularly evident at HD high-resolution levels. Taking the 8~$\mu$m prediction task in our VUMC dataset as an example, MagNet achieved MSE, MAE, and PCC values of 0.048±0.008, 0.109±0.008, and 0.278±0.042, respectively, significantly surpassing the results of other methods, such as BLEEP, which reported values of 0.063±0.006, 0.163±0.009, and 0.199±0.052.

These findings demonstrate the capability of MagNet to effectively address the information bottleneck inherent in high-resolution gene prediction tasks. By efficiently integrating and leveraging multi-source and multi-level information, MagNet overcomes the performance limitations caused by constrained data and substantially enhances prediction accuracy for high-resolution HD data. Furthermore, the relatively low standard deviation observed among all metrics during cross-validation highlights the method's robustness and stability, underscoring its reliability for practical clinical applications.

\begin{table}[bht]
\centering
\caption{\textbf{Ablation study for functional blocks in MagNet.} The benefits from each designed block are orthonormal, while MagNet achieves optimal results when integrating all modules.}
\resizebox{\textwidth}{!}{
\begin{tblr}{
  row{1} = {c},
  row{2} = {c},
  cell{1}{1} = {r=2}{},
  cell{1}{2} = {c=3}{},
  cell{1}{5} = {c=3}{},
  cell{3}{2} = {c},
  cell{3}{3} = {c},
  cell{3}{4} = {c},
  cell{3}{5} = {c},
  cell{3}{6} = {c},
  cell{3}{7} = {c},
  cell{4}{2} = {c},
  cell{4}{3} = {c},
  cell{4}{4} = {c},
  cell{4}{5} = {c},
  cell{4}{6} = {c},
  cell{4}{7} = {c},
  cell{5}{2} = {c},
  cell{5}{3} = {c},
  cell{5}{4} = {c},
  cell{5}{5} = {c},
  cell{5}{6} = {c},
  cell{5}{7} = {c},
  cell{6}{2} = {c},
  cell{6}{3} = {c},
  cell{6}{4} = {c},
  cell{6}{5} = {c},
  cell{6}{6} = {c},
  cell{6}{7} = {c},
  cell{7}{2} = {c},
  cell{7}{3} = {c},
  cell{7}{4} = {c},
  cell{7}{5} = {c},
  cell{7}{6} = {c},
  cell{7}{7} = {c},
  hline{1,3,8} = {-}{},
  hline{2} = {2-7}{},
}
Functional Blocks                 & VUMC (in-house dataset) /16$\mu$m    &             &             & CRC~\cite{oliveira2024characterization}/16 $\mu$m    &             &             \\
                                  & MSE         & MAE         & PCC         & MSE         & MAE         & PCC         \\
w.o. GAT \&  Multi-resolution & 0.148±0.042 & 0.281±0.069 & 0.299±0.028 & 0.799±0.259 & 0.709±0.146 & 0.548±0.146 \\
w.o. GAT block      & 0.135±0.030 & 0.266±0.048 & 0.306±0.043 & 0.632±0.170 & 0.624±0.096 & 0.550±0.147 \\
w.o. Multi-resolution             & 0.133±0.030 & 0.260±0.051 & 0.323±0.044 & 0.634±0.175 & 0.628±0.111 & 0.563±0.152 \\
w.o. Consistency Loss             & 0.130±0.023 & 0.235±0.040 & 0.369±0.054 & 0.624±0.187 & 0.619±0.117 & 0.559±0.146 \\
w. All blocks       &  \textbf{0.127±0.024} & \textbf{0.228±0.034} & \textbf{0.378±0.057} & \textbf{0.564±0.184} & \textbf{0.581±0.114} & \textbf{0.574±0.154}
\end{tblr}   }
        \label{table:ablation}
\end{table}

\subsection{Pivotal Gene Expression Prediction}
We evaluated the clinical applicability of various baselines by analyzing the predictive performance of key biomarker SGPP1 and tubule-related gene DPEP1 in our kidney dataset at 16$\mu$m level. SGPP1 and DPEP1 with their associated pathways play a critical role in kidney health and disease, with direct implications for conditions such as acute kidney injury and fibrotic kidney diseases~\cite{drexler2021sphingosine, keller2024factors, lovric2017mutations}.

Figure~\ref{fig:key_gene} illustrates the predictive performance of different models for the SGPP1 gene. Compared with other baseline models, our proposed MagNet achieved the best MSE of 0.051. 
Additionally, we analyzed DPEP1 and SGPP1 predictions on WSIs from two samples in our VUMC dataset. Results show that MagNet achieved MSEs of 0.0544 / 0.0493 for SGPP1 / DPEP1 at the WSI level, significantly outperforming other methods like EGN (0.1605 / 0.1855) and BLEEP (0.1530 / 0.1126), further validating its superiority in HD-level gene expression prediction. By deeply integrating and leveraging multi-level information, MagNet captures the spatial distribution of key gene expressions in pathological tissues with higher resolution.



\begin{table}[bht]
\centering
\caption{\textbf{Ablation study on high-resolution-level-only baseline.}}
\resizebox{\textwidth}{!}{
\begin{tblr}{
  cells = {},
  row{1} = {c},
  row{2} = {c},
  cell{1}{1} = {r=2}{},
  cell{1}{2} = {c=3}{},
  cell{1}{5} = {c=3}{},
  cell{3}{2} = {c},
  cell{3}{3} = {c},
  cell{3}{4} = {c},
  cell{3}{5} = {c},
  cell{3}{6} = {c},
  cell{3}{7} = {c},
  cell{4}{2} = {c},
  cell{4}{3} = {c},
  cell{4}{4} = {c},
  cell{4}{5} = {c},
  cell{4}{6} = {c},
  cell{4}{7} = {c},
  cell{5}{2} = {c},
  cell{5}{3} = {c},
  cell{5}{4} = {c},
  cell{5}{5} = {c},
  cell{5}{6} = {c},
  cell{5}{7} = {c},
  hline{1,3,6} = {-}{},
  hline{2} = {2-7}{},
}
Functional Blocks                 & VUMC (in-house dataset)/8$\mu$m   &                      &                      & CRC~\cite{oliveira2024characterization}/8$\mu$m              &                      &                      \\
                                  & MSE                           & MAE                  & PCC                  & MSE                  & MAE                  & PCC                  \\
w.o. GAT blocks  Multi-resolution & 0.052±0.023                   & 0.146±0.059          & 0.180±0.039          & 0.281±0.084          & 0.395±0.079          & 0.512±0.156          \\
w.o. Multi-resolution             & 0.048±0.013                   & 0.137±0.030          & 0.159±0.025          & 0.276±0.075          & 0.387±0.076          & 0.540±0.162          \\
w. All blocks                     & \textbf{0.048±0.008}          & \textbf{0.109±0.008} & \textbf{0.278±0.042} & \textbf{0.271±0.054} & \textbf{0.375±0.053} & \textbf{0.541±0.167} 
\end{tblr}  }
\label{table:ablation_08}
\end{table}

\subsection{Ablation Study}
We conducted a detailed ablation study to evaluate the effectiveness of each functional block, as is summarized in Table~\ref {table:ablation}, Table~\ref{table:ablation_08} and Table~\ref{table:ablation_backbone}. Experimental results in Table~\ref{table:ablation} and Table~\ref{table:ablation_08} demonstrate that the incorporation of GAT-Transformer blocks and multi-resolution information compensates for the limited details in the original bin-level data, yielding a PCC improvement of 0.079 on our dataset and 0.026 on the CRC dataset at 16$\mu$m bins. At 8$\mu$m bins, PCC increases by 0.098 and 0.029 on the VUMC and CRC datasets, respectively. Additionally, the consistency loss enhances the synergy of multi-resolution information, thereby facilitating more effective learning of high-resolution features and further improving the model's performance.

We also investigate the pathology-specific foundation model UNI~\cite{chen2024uni} as the encoder for MagNet, with results summarized in Table 4. Compared with ResNet50, replacing it with UNI led to a slight decline in performance. An explanation is that the larger model size of UNI constrained the subgraph dimensions during training. To optimize computational efficiency, we process bin-level subgraphs iteratively, where the batch size determines the graph size. Under the same experimental conditions (one NVIDIA RTX A6000 GPU with 48GB memory), UNI’s larger parameter count resulted in a reduced batch size to 64, compared with 256 for ResNet50. This reduction in subgraph size limited the model’s ability to capture sufficient contextual information from neighboring bins, ultimately leading to the observed performance degradation.



\begin{table}[bht]
\centering
\caption{\textbf{Ablation study on backbone selection for MagNet.} }
\resizebox{\textwidth}{!}{
\begin{tblr}{
  cells = {c},
  cell{1}{1} = {r=2}{},
  cell{1}{2} = {r=2}{},
  cell{1}{3} = {c=3}{},
  cell{1}{6} = {c=3}{},
  cell{3}{1} = {r=2}{},
  cell{5}{1} = {r=2}{},
  cell{7}{1} = {r=2}{},
  hline{1,3,5,7,9} = {-}{},
  hline{2} = {3-8}{},
}
Resolution  & Backbone  & VUMC (in-house dataset)   &                      &                      & CRC~\cite{oliveira2024characterization}                  &                      &                      \\
            &           & MSE                       & MAE                  & PCC                  & MSE                  & MAE                  & PCC                  \\
8$\mu$m/112px   & ResNet50  & 0.048±0.008      & \textbf{0.109±0.008} & \textbf{0.278±0.042} & \textbf{0.271±0.054} & \textbf{0.375±0.053} & \textbf{0.541±0.167} \\
            & UNI       &\textbf{ 0.047±0.006   }            & 0.112±0.003          & 0.266±0.049          & 0.331±0.088          & 0.422±0.079          & 0.505±0.142          \\
16$\mu$m/112px  & ResNet50  & \textbf{0.127±0.024}      & \textbf{0.228±0.034} & \textbf{0.378±0.057} & \textbf{0.564±0.184} & \textbf{0.581±0.114} & \textbf{0.574±0.154} \\
            & UNI       & 0.131±0.022               & 0.239±0.037          & 0.364±0.054          & 0.638±0.181          & 0.617±0.116          & 0.556±0.112          \\
55$\mu$m/224px  & ResNet50  & \textbf{0.324±0.044}      & \textbf{0.458±0.030} & \textbf{0.611±0.082} & \textbf{0.688±0.149} & \textbf{0.612±0.069} & \textbf{0.670±0.059} \\
            & UNI       & 0.339±0.038               & 0.469±0.024          & 0.582±0.044          & 0.735±0.231          & 0.638±0.084          & 0.643±0.111          
\end{tblr}  }
        \label{table:ablation_backbone}
\end{table}