\section{Appendix: Conformal Prediction Beyond Exchangeability}
\label{app:BeyondEX}
We follow the procedure in \cite{beyondexchangeability}, making the necessary adaptations to our setting. Denote $Z_i=(X_i,Y_i;\mathcal A_{X_i,T_i})$. A weight $w_i\in[0,1]$ is assigned to each data point to quantify its similarity to a given test point. These weights can be derived from various metrics, such as the temporal gap between the observation and the test instance.  The underlying intuition is to assign higher weights to data points presumed to share the same distribution as the test point, $(X_{n+1},Y_{n+1};\mathcal A_{X_{n+1},T_{n+1}})$, and lower weights to those from different distributions. The weights are subsequently normalised,
\[  
    \Tilde{w_i} = \frac{w_i}{\sum_{j=1}^nw_j+1}, \quad\text{for}\quad i=1,\ldots,n\qquad \Tilde{w}_{n+1}=\frac{1}{\sum_{j=1}^nw_j+1}.
\]
\cite{beyondexchangeability} provide equivalent theoretical bounds for both non-symmetric full conformal and split-conformal algorithms. We concentrate only on the split-conformal case. The non-exchangeable split conformal set is given by
\[ 
    C(X_{n+1}) = \left\{y\in\mathcal{Y} :\hat S(X_{n+1},y\,; \mathcal A_{X_{n+1},T_{n+1}})\leq \text{Quantile}\left(1-\alpha, \sum_{i=1}^n\Tilde{w}_i\cdot\delta_{\hat{S}(Z_i)}+\Tilde w_{n+1}\cdot\delta_{+\infty} \right)\right\}.
\]
With this prediction set, we can obtain the finite-sample guarantee from \cite{beyondexchangeability}
\[  
    \mathbb{P}(Y_{n+1}\in C(X_{n+1}))\geq 1-\alpha-\sum_{i=1}^n\Tilde w_i\cdot\text{d}_\text{TV}(S(Z),S(Z^i)),
\]
where $Z = (Z_1,\dots, Z_{n+1})$  represents the original data sequence, $Z^i=(Z_1,\ldots ,Z_{i-1},Z_{n+1},Z_{i+1},\ldots ,Z_{n},Z_i)$ represents the same sequence but with the $i^\text{th}$ and $(n+1)^\text{th}$ observations swapped, $S(z)\in \mathbb{R}^{n+1}$ is the residual vector with entries $(S(z))_i=\hat S(x_i,y_i; \mathcal A_{x_i,T_i})$, and $\text{d}_\text{TV}$ denotes the total variation distance. Their method protects against shifts in the distributions of the $Z_i$'s and, as a consequence, shifts in the scores. 

The method provides a coverage guarantee of at least $1-\alpha$ minus a specific correction term. This correction reflects how much the data deviate from exchangeability, weighted by the importance given to each observation. If the data are truly exchangeable, this correction becomes zero, restoring the standard $1-\alpha$ guarantee. However, in QML, the size of this correction is in practice unknown. This limits the ability to adjust the weights to counteract large distributional shifts, meaning no practical lower bound on coverage can be given without making further assumptions.
