
The deployment of machine learning models---including deep neural networks---has become increasingly widespread, yet their application in costly or safety-critical settings remains hindered by two key challenges \citep{makridakis2016forecasting,quinonero2022dataset}. Firstly, many models continue to produce point-wise predictions without uncertainty estimation, inherently limiting the robustness of obtained information for decision-making \citep{begoli2019need, padilla2021uncertain}. Yet even when uncertainty is incorporated, such as through some form of uncertainty scoring or probabilistic modelling \citep{gawlikowski2023survey}, estimates can be misleading or overconfident \citep{kompa2021empirical, xiong2023can}. A popularized uncertainty framework that partially addresses such issues is \emph{conformal prediction} (CP), which extends point-wise predictions to prediction set or interval estimation \citep{vovk2005algorithmic, angelopoulos2023conformal}. Importantly, a notion of reliability is obtained via a probabilistic coverage guarantee for new, unseen test samples (see \autoref{subsec:background-cp}). Unlike traditional prediction set methods \citep{khosravi2011comprehensive}, CP is fully data-driven, distribution-free, and compatible with `black-box' models.

Secondly, it is well-known that distribution shifts at test time can severely degrade model performance \citep{koh2021wilds, ovadia2019can}. Among types of shifts, \emph{geometric} data shifts---where test samples undergo geometric transformations such as rotations or flips---pose a significant challenge, in particular for pretrained models lacking integrated equivariance or invariance properties \citep{bronstein2021geometric}. As such symmetry-awareness can be sometimes challenging to scale and is thus overlooked \citep{brehmer2024does}, large models trained on vast datasets may nonetheless struggle when faced with pose variations, as exemplified in \autoref{tab:segm-robustness} for segmentation under rotations. Other practical failures may include proper recognition for medical images due to scan variations \citep{fu2023guest} or 3D objects due to axis-misaligned point clouds \citep{vadgama2025utilityequivariancesymmetrybreaking}. For conformal prediction, such geometric shifts can violate \emph{exchangeability} assumptions on the data (\hyperref[def:exch]{Def.~\ref{def:exch}}), leading to potentially unreliable or uninformative prediction sets \citep{barber2023conformal}. Unreliable in the sense that statistical coverage guarantees may no longer hold, and uninformative as prediction sets may grow excessively large. 

To address this, we propose robustifying the conformal procedure by incorporating geometric information on occuring shifts, while preserving CP's advantageous flexibility by avoiding to modify the underlying model. This is practically achieved via \emph{canonicalization} \citep{mondal2023equivariant, kaba23equivariance}, a framework that learns to map data into a canonical form, and decouples the geometric task from the underlying predictor. Leveraging this approach, we explore how obtained geometric information can be effectively combined with CP in multiple different ways. In summary, our contributions include:
\begin{itemize}
    \item Introducing a novel geometric perspective on the topic of distribution shifts in conformal prediction, and motivating how geometric information can ensure core conditions of CP such as exchangeability are met (\autoref{sec:method});
    \item Leveraging canonicalization as a suitable geometric information extractor that is both \emph{post-hoc} and light-weight, in line with practical principles underlying CP;
    \item Investigating its integration with CP in several ways, including mitigating performance drops (\autoref{subsec:exp-robust}), as an information tool for conditional coverage (\autoref{subsec:exp-condcover}), and as a weighting mechanism in multi-shift settings (\autoref{subsec:exp-weightcp}).
\end{itemize}

\begin{table}[t]
  \caption{
    Zero-shot Mask-RCNN segmentation performance (mAP) on regular and $C4$-rotated COCO data without and with invariance (via canonicalization \citep{mondal2023equivariant}). Missing symmetry-awareness leads to failed generalization.
  }
  \centering
  \begin{tabularx}{\linewidth}{X|c|c}
    \toprule
     \textbf{Model} & \textbf{mAP} & \textbf{$C4$-mAP} \\
     \midrule
    Mask-RCNN without Invariance  & 47.81 & 12.79 \\
    Mask-RCNN with Invariance & 43.47 & 43.47 \\
    \bottomrule
  \end{tabularx}
  \label{tab:segm-robustness}
\end{table}


% \putri{While equivariant and invariant models are theoretically appealing for their robustness to geometric transformations, they can be challenging to scale \citep{mondal2023equivariant, brehmer2024does}. Foundation models, often trained on vast and varied datasets, may lack built-in equivariance, leading to performance degradation when faced with unseen geometric variations during deployment. This is examplified in \autoref{tab:segm-robustness} and \citep{mondal2023equivariant}, which demonstrate that pre-trained segmentation models may not be inherently robust to such geometric shifts. Other practical examples include the failure of 3D object recognition models struggling with non-axis-aligned point clouds \citep{vadgama2025utilityequivariancesymmetrybreaking}, or medical imaging datasets, which may be aggregated from multiple sources with differing equipment and protocols [refs]. Additionally, some works have studied the nature of learned invariance and inherently invariant architectures, and demonstrated that models trained with data augmentation, though somewhat robust, may fail under distribution shifts \citep{vadgama2025utilityequivariancesymmetrybreaking, moskalev2023genuine}.}

% \item \putri{realizing such a procedure via the \textit{canonicalization prior} \citep{mondal2023equivariant}, which is a suitable candidate due to its light-weight and post-hoc nature, fully in line with both practical and theoretical principles underlying conformal prediction.}