\section{Introduction}\label{sec:intro}



\begin{figure*}[ht]
    \centering
    \begin{subfigure}[b]{0.36\textwidth}
        \includegraphics[width=\textwidth]{figs/schema/time_warp.pdf}
        \caption{Distributional Shift}
        \label{fig:distributional_shift}
    \end{subfigure}
    \hspace{-1em}
    \begin{subfigure}[b]{0.36\textwidth}
        \includegraphics[width=\textwidth]{figs/schema/temporal_smoothing.pdf}
        \caption{Temporal Smoothing}
        \label{fig:temporal_smoothing}
    \end{subfigure}
    \hspace{-1em}
    \begin{subfigure}[b]{0.3\textwidth}
        \includegraphics[width=\textwidth]{figs/schema/conformal_prediction.pdf}
        \caption{Confidence Sets}
        \label{fig:conformal_prediction}
    \end{subfigure}
    
    \caption{Visual representations showcasing different stages of a Temporal Smooth Conformal Predictor (TSCP) with a sample from the UCR-TS dataset~\citep{UCRArchive}. 
    In (a) we display the distributional shift where the original signal $x$ can be manipulated via time warping $\tilde{x}$. 
    In (b) we represent the smoothing versions of $x^\prime$ derived from a temporal shifts around $\tilde{x}$. 
    While in (c) we illustrate non-conformity scores, highlighting that TSCP obtains more precise confidence sets with respect than the Randomly Smoothed Conformal Predictor (RSCP)~\cite{gendler2021adversarially}.
    }
    \label{fig:framework}
\end{figure*}


Deep neural networks have shown their remarkable ability to learn intricate patterns from vast amounts of data, marking them as a preferred choice for complex challenges~\citep{lecun2015deep, han2022survey, wen2022transformers}. In order to be able to employ these models in applications where high-stake decisions are based on their outputs, such as in all safety-critical systems, it is imperative not only to ensure their accuracy but also to understand and quantify the confidence attached to their predictions and the robustness of the model to input space perturbations and domain shifts.
Addressing the former, the Conformal Prediction (CP)~\citep{vovk1999machine, vovk2005algorithmic, shafer2008tutorial} framework has emerged as a tool to construct set classifiers with guaranteed confidence that can be adjusted by the user.

The importance of \textit{domain generalization} becomes especially crucial when the costs of mispredictions are high, as in medical diagnosis or autonomous driving~\citep{zhou2022domain, wang2022generalizing}. 
Notably, research by \citet{park2019pac, park2020pac, park2022pac, CPgeneralisation2022} demonstrated how CP can be utilized to bound performance on unseen domains, if there is an adequate amount of training domains for calibration during the training process. 
In addition, Randomized Smoothing (RS)~\citep{cohen2019certified, salman2019provably} has emerged as method to robustify machine learning models against worst case, i.e., adversarial, attacks. 
While undeniably successful as defense mechanism, RS is not equally useful in an assurance argumentation, where it is necessary to demonstrate robustness against disturbances inherent to the specific data domain being targeted.

Often, real life perturbations come with large norms which renders certification with respect to some $\ell_p$-norm - as typically done in RS - prohibitively conservative, as huge volumes would need to be certified. 
Consider the example in \autoref{fig:distributional_shift}, where a time series signal is characterized by a few peaks.
When a time warping augmentation is applied, it may result in a large magnitude for a conventional $\ell_p$-norm.
However, this transformation would remain concentrated around the original peak's location.
As a result, in order to establish certified robustness, we do not need to consider other transformations with equal norm but amplitudes at a different position of the signal.
These considerations motivate the new technique developed in this paper.


\paragraph{Contribution}

In this paper, we generalize RS to arbitrary perturbations following an automotive use case build around a binary classification of time series input. The input time series are subjected to different, practically relevant transformations such as time warping, window warping, scaling or jitter. 
Since different domains can be modeled, i.e., experiment parameters which introduce domain shifts are known, several data sets are recorded. 
First, we show how CP naturally generalizes RS and use the resulting algorithm to show robustness on the use case tailored perturbations. 
Second, we utilize the available domains during training to bound the performance of the algorithm on unseen test domains under loose assumptions \citet{park2019pac, park2020pac, park2022pac, CPgeneralisation2022}.
In \autoref{fig:framework}, we show a visual representation of our framework. 
By applying temporal transformations to smooth the sample, we enhance the certainty of the predicted confidence set. 
As observed, the output demonstrates increased confidence for class 0, leading to a reduction in the size of the predicted confidence set.

In summary, our core contributions are:
\begin{itemize}
    \item Generalizes RS to handle various transformations in time series classification, like time, window and magnitude warping.
    \item Introduce a \textit{Temporal Smooth Conformal Predictor} (TSCP) that employees temporal-shift transformations to smooth the input and demonstrates CP guarantees robustness against adversarial attacks and perturbations.
    \item Experimental evaluation of TSCP on a open-source time series dataset and on a real-world application, demonstrating the effectiveness in terms of robustness certification and empirical risk minimization.
\end{itemize}