\label{section:preliminaries}

\subsection{Neural ODE and ResNet models}
\label{subsect:models}

We consider the following neural ODE:
\begin{equation}
\dot x(t)=\frac{dx(t)}{dt}=f(x(t)),
\label{eq:nODE}
\end{equation}
with state $x\in\R^n$, initial state $x(0)=u$, and vector field $f:\R^n\rightarrow\R^n$ defined as a finite sequence of classical neural network layers (such as fully connected layers, convolutional layers, activation functions, batch normalization).
The state trajectories of \eqref{eq:nODE} are defined based on the solution $\Phi:\R\times\R^n\rightarrow\R^n$ of the corresponding initial value problem:
$$x(t)=\Phi(t,x(0))=\Phi(t,u).$$

In~\cite{chen2018neural}, such a neural ODE is described as a continuous-depth generalization of a residual neural network constituted of a single residual block.
Conversely, this ResNet can be seen as the Euler discretization of the neural ODE \eqref{eq:nODE}:
\begin{equation}
y=u+f(u),
\label{eq:ResNet}
\end{equation}
where $u\in\R^n$ is the input, $y\in\R^n$ is the output, and the residual function $f:\R^n\rightarrow\R^n$ is identical to the vector field of the neural ODE~\eqref{eq:nODE}.

Since the approach proposed in this paper relies on the Taylor expansion of the trajectories of~\eqref{eq:nODE} up to the second order, we assume here for simplicity that the neural network described by the vector field $f$ is continuously differentiable.

\begin{remark}
The case where $f$ contains piecewise-affine activation functions such as ReLU can theoretically be handled as well, since our approach only really requires their derivatives to be bounded (but not necessarily continuous).
But for the sake of clarity of presentation (to avoid the case decompositions of each ReLU activation), this case is kept out of the scope of the present paper.
\end{remark}


\subsection{Problem definition}
\label{subsect:Prob def}

As mentioned above and in~\cite{chen2018neural}, both the neural ODE and ResNet models describe a very similar behavior, and either model could be seen as an approximation of the other.
Our goal in this paper is to provide a formal comparison of these models in the context of safety verification, by evaluating the approximation error between them.
For such comparison to be meaningful, we consider the outputs $y$ of the ResNet~\eqref{eq:ResNet} on one side, and the outputs $\Phi(1,u)$ of the neural ODE~\eqref{eq:nODE} at continuous depth $t=1$ on the other side, since other values $t\neq 1$ of this continuous depth have no elements of comparison in the discrete architecture of the ResNet. 

Given an initial set $\mathcal{X}_{in}\subseteq\R^n$ for the neural ODE (or equivalently referred to as \emph{input set} for the ResNet), we first define the sets of reachable outputs for either model:
\[\mathcal{R}_{\text{neural ODE}}(\mathcal{X}_{in})=\{y\in \mathbb{R}^n \mid y= \Phi(1,u), \ u\in \mathcal{X}_{in}\},\]

\[\mathcal{R}_{\text{ResNet}}(\mathcal{X}_{in})=\{y\in \mathbb{R}^n \mid y= u + f(u), \ u\in \mathcal{X}_{in}\}.\]

Since we usually cannot compute these output reachable sets exactly, we will often rely on computing an over-approximation denoted as $\Omega(\mathcal{X}_{in})$ such that $\mathcal{R}(\mathcal{X}_{in})\subseteq \Omega(\mathcal{X}_{in})$.

Our first objective is to bound the approximation error between the two models, as formalized below.

\begin{problem}[Error Bounding]
\label{def:error bound}
Given an input set $\mathcal{X}_{in}\subseteq\R^n$, we want to over-approximate the set $\mathcal{R}_\varepsilon(\mathcal{X}_{in})$ of errors between the ResNet~\eqref{eq:ResNet} and neural ODE~\eqref{eq:nODE} models, defined as:
$$\mathcal{R}_\varepsilon(\mathcal{X}_{in})=\left\{\Phi(1, u)-(u+f(u))~|~u\in\mathcal{X}_{in}\right\}.$$
\end{problem}

Our second problem of interest is to use one of our models as a verification proxy for the other.
In other words, we want to combine this error bound with the reachable set of one model to verify the satisfaction of a safety property on the other model, without having to compute the reachable output set of this second model.

\begin{problem}[Verification Proxy]
\label{def:safety verification}
Given an input-output safety property defined by an input set $\mathcal{X}_{in} \subseteq \mathbb{R}^n$ and a safe output set $\mathcal{X}_{s} \subseteq \mathbb{R}^n$, the verification problem consists in checking whether the reachable output set of a model is fully contained in the targeted safe set: $\mathcal{R}(\mathcal{X}_{in})\subseteq\mathcal{X}_{s}$.
In this paper, we want to verify this safety property on one model by relying only on the error set $\mathcal{R}_\varepsilon(\mathcal{X}_{in})$ from Problem~\ref{def:error bound} and the reachability analysis of the other model.
\end{problem}

