



\section{Introduction}
\label{sec:intro}
Quantum machine learning (QML) aims to integrate quantum algorithms into broader machine learning pipelines, seeking performance advantages over classical methods on specialised tasks \citep{nature_QML}. QML techniques have already proven effective across supervised, unsupervised, and reinforcement learning settings \citep{qml_supervised_example, qml_unsupervised_example, qml_reinforcement_example}. 

Classical machine learning models are increasingly deployed in domains with significant societal implications, for example, in justice, healthcare, transportation, and defence \citep{justice_example, healthcare_example, driving_example, military_example}. In such high-stakes settings, erroneous decisions can carry severe consequences, making rigorous uncertainty quantification essential to model evaluation and deployment.

For QML models to become trustworthy tools, they must also incorporate reliable uncertainty quantification. This challenge is heightened by the hardware noise in current noisy intermediate-scale quantum (NISQ) devices, which may vary in both character and severity over time \citep{proctor2020detecting}.

Conformal prediction stands out as a technique for uncertainty quantification, offering distribution-free, finite-sample guarantees on predictive performance \citep{gentleintrocp}. Its principal function is to post-process predictive model outputs to produce prediction sets that encompass the true outcome with a user-specified probability. Conformal prediction is appealing within the machine learning community for its ability to function as a wrapper around black-box models \citep{caprio2025joyscategoricalconformalprediction, caprio2025conformalized}.

A commonly used variant, known as split conformal prediction, was recently extended to quantum models in \cite{QuantumCP}. Their seminal work introduces the quantum conformal prediction (QCP) framework, which is designed to use measurements from parametrised quantum circuits (PQCs). When tested on quantum hardware, QCP consistently maintained the target coverage level \citep{QuantumCP}. However, the theoretical guarantees of QCP require a simplifying assumption: that the noise processes within the quantum hardware remain stationary over time.

In this work, we demonstrate that, without this assumption, one can no longer assert that conformity scores are exchangeable, a property that is necessary for standard conformal guarantees. To address this, we extend Adaptive Conformal Inference \citep{ACI} to the quantum setting, yielding the Adaptive Quantum Conformal Prediction (AQCP) algorithm. This algorithm incorporates online recalibration to explicitly handle non-stationary hardware noise and exhibits greater stability than QCP in experiments using an IBM quantum processor. AQCP is straightforward to implement and provides a reliable representation of uncertainty for QML models under arbitrary hardware noise conditions. Alongside demonstrating AQCP's effectiveness, we provide an experimental analysis of various score functions, comparing the average set size they induce when used with AQCP.

\subsection{Related Work}

Beyond \cite{QuantumCP}, which we introduce above, \cite{QuantumCP_Tasar} explored an alternative use of conformal prediction for PQCs. In their method, conformal prediction is applied to classical models trained to emulate PQC output distributions using features derived from circuit architecture and gate frequencies. Additionally, \cite{tasar2025conformal} investigates conformal prediction in a wide variety of quantum settings, including whether conformal prediction can be used to detect entanglement, the impact of context-conditional exchangeability, and an application to anomaly detection. While these works demonstrate that conformal prediction can be meaningfully applied in quantum contexts, none directly examine how non-stationary model noise affects the exchangeability of conformity scores.

In the classical setting, a growing body of literature has sought to relax the exchangeability requirement between calibration and test data to handle settings such as time series and covariate shift. These methods generally address either specific, known distributional shifts or more general deviations from exchangeability \citep{CPundercovariateshift, beyondexchangeability, gibbs2025conformal, ACI}. We take inspiration from this literature in our approach.

The design of score functions for quantum conformal prediction closely aligns with that of probabilistic conformal prediction \citep{PCP}. \cite{QuantumCP} employ a k-nearest neighbour (k-NN) score function, building upon the approach in \cite{PCP}, who propose a sampling strategy combined with a $1$-nearest neighbour (1-NN) score function. \cite{PCP} have also influenced methods for other probabilistic models, including the Conformal-Predict-Then-Optimise (CPO) framework \citep{patel2024conformal}, which similarly employs a k-NN approach. Sample-based strategies have also appeared in the context of conformal risk control \citep{zecchin2023forkinguncertaintiesreliableprediction}, where samples from the model distribution are used to generate prototypical sequences, to which a Euclidean distance score function is then applied.

\subsection{Contributions}

    This paper presents a rigorous framework for reliable uncertainty quantification in quantum machine learning in the presence of realistic, non-stationary hardware noise. Our primary contributions are:

\begin{itemize}
    \item \textbf{Formalisation of time dependence:} We develop a theoretical framework demonstrating how non-stationary noise invalidates the exchangeability of conformity scores, even when calibration and test data are exchangeable.
    \item \textbf{Adaptive algorithm for quantum conformal prediction:} We adapt the Adaptive Conformal Inference method \citep{ACI} to the quantum machine learning setting. This approach, which we term Adaptive Quantum Conformal Prediction (AQCP), utilises score functions specifically designed to operate on samples from an implicit probability distribution.
    \item \textbf{Comprehensive hardware evaluation:} We conduct a thorough experimental evaluation of AQCP's validity and efficiency on IBM quantum hardware, analyse the performance of various sample-based score functions, and show that AQCP maintains the target coverage level with greater stability than the QCP framework.
\end{itemize}
