\section{Data and Experiment}
\textbf{Dataset.} We benchmarked our MagNet and other baseline models on a privately collected kidney pathology dataset (VUMC) and a publicly available colorectal cancer (CRC) dataset~\cite{oliveira2024characterization}. We conducted four-fold cross-validation at the WSI level. Our in-house dataset contains 12 HD ST samples with three resolutions: 2~$\mu$m, 8~$\mu$m, and 16~$\mu$m, where 1px in the WSI corresponds to 0.25 $\mu$m of real tissue. The CRC dataset consists of four samples with a single-layer section, including two CRC tissues and two adjacent normal tissues. The process has been approved by Institutional Review Board (IRB).


\noindent\textbf{Data Preprocessing.}
6,000 bins were randomly selected for each WSI, and 112$\times$112 pixel patches centered at 8~$\mu$m and 16~$\mu$m bins were cropped. At the spot and region levels, patches with diameters of 224 and 512 pixels were extracted across the WSI, with their gene expressions aggregated from bin-level data. 2,500 spot-level patches per WSI were selected for training and testing. Patch pairing across levels was based on the distance between the coordinates in different resolutions. We follow the method proposed in ST-Net~\cite{he2020integrating} and select the top 250 genes with the highest average expression levels of more than 20,000 original genes for prediction. Gene expression values were normalized using the approach introduced in TRIPLEX~\cite{chung2024accurate}, which involves proportional normalization followed by a log transformation.


\begin{table*}[t]
\centering
\caption{\textbf{Quantitative comparisons across different datasets.} 
The best performance is highlighted in \textbf{bold}, where we can observe that  \texttt{MagNet} outperforms the state-of-the-art in multiple resolutions.}
\resizebox{\textwidth}{!}{
\centering
\begin{tblr}{
  cells = {c},
  row{8} = {},
  row{15} = {},
  row{22} = {},
  cell{1}{1} = {r=2}{},
  cell{1}{2} = {r=2}{},
  cell{1}{3} = {c=3}{},
  cell{1}{6} = {c=3}{},
  cell{3}{1} = {r=7}{},
  cell{10}{1} = {r=7}{},
  cell{17}{1} = {r=7}{},
  vline{2-3} = {3-9,10-16,17-23}{},
  vline{3} = {3-9,10-16,17-23}{},
  hline{1,3,10,17,24} = {-}{},
  hline{2} = {3-8}{},
}
Resolution & Model         & VUMC (in-house dataset) &                      &                      & CRC~\cite{oliveira2024characterization}                  &                      &                      \\
           &               & MSE                     & MAE                  & PCC                  & MSE                  & MAE                  & PCC                  \\
8um/112px  & ST-Net        & 0.193±0.004             & 0.388±0.009          & 0.226±0.040          & 0.292±0.076          & 0.402±0.084          & 0.527±0.155          \\
           & EGN           & 0.048±0.011             & 0.134±0.020          & 0.157±0.024          & 0.409±0.164          & 0.508±0.139          & 0.511±0.152          \\
           & HisToGene     & 0.105±0.007             & 0.241±0.006          & 0.109±0.018          & 0.311±0.088          & 0.419±0.075          & 0.451±0.128          \\
           & BLEEP         & 0.063±0.006             & 0.163±0.009          & 0.199±0.052          & 0.348±0.041          & 0.440±0.0361         & 0.475±0.1379         \\
           & His2ST        & 0.140±0.019             & 0.358±0.026          & 0.175±0.033          & 0.287±0.113          & 0.4041±0.109         & 0.537±0.165          \\
           & TRIPLEX       & 0.151±0.152             & 0.286±0.180          & 0.107±0.059          & 0.291±0.110          & 0.397±0.069          & 0.498±0.167          \\
           & MagNet (Ours) & \textbf{0.048±0.008}    & \textbf{0.109±0.008} & \textbf{0.278±0.042} & \textbf{0.271±0.054} & \textbf{0.375±0.053} & \textbf{0.541±0.167} \\
16um/112px & ST-Net        & 0.288±0.007             & 0.420±0.027          & 0.364±0.0539         & 0.661±0.239          & 0.632±0.146          & 0.560±0.151          \\
           & EGN           & 0.149±0.037             & 0.302±0.06           & 0.308±0.037          & 0.740±0.0241         & 0.677±0.013          & 0.552±0.014          \\
           & HisToGene     & 0.204±0.045             & 0.380±0.052          & 0.243±0.035          & 0.660±0.176          & 0.6368±0.099         & 0.522±0.136          \\
           & BLEEP         & 0.174±0.029             & 0.290±0.031          & 0.317±0.058          & 0.673±0.161          & 0.625±0.088          & 0.504±0.123          \\
           & His2ST        & 0.224±0.044             & 0.427±0.049          & 0.330±0.046          & 0.610±0.168          & 0.611±0.103          & 0.562±0.152          \\
           & TRIPLEX       & 0.211±0.079             & 0.331±0.089          & 0.310±0.079          & 0.632±0.123          & 0.618±0.080          & 0.412±0.134          \\
           & MagNet (Ours) & \textbf{0.127±0.024}    & \textbf{0.228±0.034} & \textbf{0.378±0.057} & \textbf{0.564±0.184} & \textbf{0.581±0.114} & \textbf{0.574±0.154} \\
55um/224px & ST-Net        & 0.442±0.036             & 0.549±0.019          & 0.609±0.059          & 0.767±0.203          & 0.652±0.086          & 0.649±0.080          \\
           & EGN           & 0.355±0.030             & 0.471±0.010          & 0.601±0.0561         & 0.778±0.229          & 0.651±0.105          & 0.674±0.071          \\
           & HisToGene     & 0.403±0.028             & 0.517±0.017          & 0.596±0.058          & 0.702±0.173          & 0.622±0.074          & 0.663±0.067          \\
           & BLEEP         & 0.339±0.026             & 0.467±0.017          & 0.576±0.049          & 0.717±0.112          & 0.623±0.044          & 0.667±0.043          \\
           & His2ST        & 0.327±0.021             & 0.459±0.013          & 0.601±0.058          & 0.813±0.199          & 0.673±0.089          & 0.673±0.065          \\
           & TRIPLEX       & 0.442±0.200             & 0.525±0.119          & 0.579±0.075          & 0.828±0.148          & 0.688±0.048          & \textbf{0.677±0.059} \\
           & MagNet (Ours) & \textbf{0.324±0.044}    & \textbf{0.458±0.030} & \textbf{0.611±0.082} & \textbf{0.688±0.149} & \textbf{0.612±0.069} & 0.670±0.059          
       
\end{tblr}
    }
        \label{table:comparison}
\end{table*}


\noindent\textbf{Compared Methods and Evaluation Metrics.} MagNet was benchmarked against current ST counterparts, including multi-resolution-based network~\cite{chung2024accurate}, spatial-aware methods HisToGene~\cite{pang2021leveraging} and His2ST~\cite{zeng2022spatial}, similarity-based strategy BLEEP~\cite{xie2024spatially}, and EGN~\cite{yang2023exemplar}, and the classic approach ST-Net~\cite{he2020integrating}. We used the officially released code published along with the papers for all of the methods. The Pearson correlation coefficient (PCC), mean squared error (MSE), and mean absolute error (MAE) are used to evaluate the performance of the models comprehensively.

\noindent\textbf{Experiment Setting and Implementation.} 
Experiments were conducted on NVIDIA RTX A6000 GPU cards. The SGD optimizer was utilized, with momentum set to 0.9 and a weight decay of $10^{-4}$. An initial learning rate of $10^{-4}$ was applied, which followed a cosine decay schedule, decreasing it progressively to 1\% of its initial value during training. All models are trained to converge. We employed a batch size of 256 for training and fine-tuned the hyperparameters $\lambda_1$, $\lambda_2$, $\lambda_b$, $\lambda_s$, and $\lambda_r$ in our hybrid loss function to values of 0.1, 0.1, 0.8, 0.25, and 0.25, respectively. For graph construction, the top-$k$ value was fixed at 8. We select 8~$\mu$m and 16~$\mu$m bins as the target HD resolution to predict, due to the extremely low gene expression amount in 2~$\mu$m bins. During spot-level experiments, we freeze the encoder parameters of the bin and region levels and update the spot level instead. 
