% \documentclass{uai2022} % for initial submission
\documentclass[accepted]{uai2022} % after acceptance, for a revised
                                    % version; also before submission to
                                    % see how the non-anonymous paper
                                    % would look like
%% There is a class option to choose the math font
% \documentclass[mathfont=ptmx]{uai2022} % ptmx math instead of Computer
                                         % Modern (has noticable issues)
% \documentclass[mathfont=newtx]{uai2022} % newtx fonts (improves upon
                                          % ptmx; less tested, no support)
% NOTE: Only keep *one* line above as appropriate, as it will be replaced
%       automatically for papers to be published. Do not make any other
%       change above this note for an accepted version.


\newif\iffull
\fullfalse
%\fulltrue

\usepackage{mfirstuc} % For captalising words on figures and tables. 

%% Choose your variant of English; be consistent
\usepackage[american]{babel}
% \usepackage[british]{babel}



%% Some suggested packages, as needed:
\usepackage{natbib} % has a nice set of citation styles and commands
    \bibliographystyle{plainnat}
    \renewcommand{\bibsection}{\subsubsection*{References}}
\usepackage{mathtools} % amsmath with fixes and additions
% \usepackage{siunitx} % for proper typesetting of numbers and units
\usepackage{booktabs} % commands to create good-looking tables
\usepackage{enumitem}
\usepackage{tikz} % nice language for creating drawings and diagrams

%% Provided macros
% \smaller: Because the class footnote size is essentially LaTeX's \small,
%           redefining \footnotesize, we provide the original \footnotesize
%           using this macro.
%           (Use only sparingly, e.g., in drawings, as it is quite small.)

%% Self-defined macros
\newcommand{\swap}[3][-]{#3#1#2} % just an example











\usepackage{tikz-cd}
\usepackage{wrapfig}

% Packages from journal version:
\usepackage{diagbox}
\usepackage{placeins} 

\usepackage[utf8]{inputenc}
\usepackage{amsmath}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{xcolor}


\makeatletter
\newcommand*{\inlineequation}[2][]{%
  \begingroup
    % Put \refstepcounter at the beginning, because
    % package `hyperref' sets the anchor here.
    \refstepcounter{equation}%
    \ifx\\#1\\%
    \else
      \label{#1}%
    \fi
    % prevent line breaks inside equation
    \relpenalty=10000 %
    \binoppenalty=10000 %
    \ensuremath{%
      % \displaystyle % larger fractions, ...
      #2%
    }%
    ~\@eqnnum
  \endgroup
}
\makeatother


\usepackage{hyperref}
\hypersetup{
    colorlinks,
    linkcolor={cyan!50!black},
    citecolor={green!50!black},
    urlcolor={blue!80!black}
}\usepackage{bbm}
\usepackage{bbold}
\usepackage{amsthm}
\usepackage{mathtools}
% \usepackage[nameinlink,capitalise]{cleveref}
\usepackage{graphicx}
\usepackage[thinc]{esdiff}
\usepackage{soul}
\usepackage{mathrsfs}
% \usepackage{eufrak}
% \usepackage{natbib}
% \setcitestyle{numbers}
% \usepackage[switch]{lineno}
\usepackage{xcolor}
\usepackage{subcaption}

\definecolor{darkred}{rgb}{0.5,0,0}
\definecolor{lightblue}{rgb}{0,0.4,0.8}
\definecolor{darkgreen}{rgb}{0,0.5,0}



\newtheorem{conjecture}{Conjecture}
\newtheorem{theorem}{Theorem}
\newtheorem{lemma}{Lemma}
\newtheorem{proposition}{Proposition}
\usepackage[nameinlink,capitalise]{cleveref}
\usepackage{crossreftools}
% NOTE: The package enumitem is loaded in the cls file. To it I added the option shortlabels so that the list in the final conjecture does not give problems. However I don't know if we are allowed to do that.
\definecolor{cricolor}{HTML}{ED4863}
\newcommand{\pink}{\color{cricolor}}
%%%%%%%%%%%%%%%%%%%%%%%%


\newtheorem{result}{Simulation Result}
\newcommand{\Rinf}{R_\infty}
\newcommand{\rinf}{r_\infty}

\newtheorem*{ttheorem}{Theorem}

\newcommand{\disc}{Discrete SIR Model}
\newcommand{\clique}{Clique Model (Network Model)}
\newcommand{\super}{Supermarket Model}
\newcommand{\G}{\mathcal{G}}
\newcommand{\B}{\mathcal{B}}
\newcommand*{\QEDB}{\null\nobreak\hfill\ensuremath{\square}}
\newcommand{\ind}[1]{\,\mathbb{1}_{\{#1\}}}


\newcommand{\Rzero}{\mathcal{R}_0}
\newcommand{\Rnought}{\mathfrak{R}}


\newcommand{\eethres}{\frac{\beta + \gamma}{|\gamma-\beta|} \sqrt{N}}

\newcommand{\open}[1]{{\color{blue} Open question: #1}}
\newcommand{\fnote}[1]{{\ifdraft \color{teal} F: #1}\fi}
\newcommand{\dnote}[1]{\ifdraft{\color{brown} D: #1}\fi}
\definecolor{cricolor}{HTML}{ED4863}
\newcommand{\Cnote}[1]{\ifdraft{\color{cricolor} Cr: #1}\fi}

\renewcommand{\Pr}[1]{\mathbf{Pr}\left[#1\right]}
\newcommand{\E}[1]{\mathbf{E}\left[#1\right]}
\newcommand{\Var}[1]{\mathbf{Var}\left[#1\right]}

\newcommand{\brac}[1]{\left(#1\right)}
\newcommand{\V}{\mathbf{V}}
\newcommand{\eps}{\varepsilon}
\newcommand{\ptr}{p_{travel}}

\renewcommand{\P}{\mathbf{Pr}}
\newcommand{\red}{\color{red}}
\newcommand{\blue}{\color{blue}}
\renewcommand{\epsilon}{\varepsilon}

% \renewcommand{\pink}{}


\title{On Early Extinction and the Effect of Travelling in the SIR Model}

% The standard author block has changed for UAI 2022 to provide
% more space for long author lists and allow for complex affiliations
%
% All author information is authomatically removed by the class for the
% anonymous submission version of your paper, so you can already add your
% information below.
%
% Add authors
\author[1]{\href{mailto:<petra.berenbrink@uni-hamburg.de>}{Petra Berenbrink}{}}
\author[2]{\href{mailto:<colin.cooper@kcl.ac.uk>}{Colin Cooper}{}}
\author[2]{\href{mailto:<cristina.gava@kcl.ac.uk>}{Cristina Gava}{}}
\author[3]{\href{mailto:<david.kohan@eng.ox.ac.uk>}{David Kohan Marzagão}{}}
\author[2]{\href{mailto:<frederik.mallmann-trenn@kcl.ac.uk>}{Frederik Mallmann-Trenn}{}}
\author[2]{\href{mailto:<tomasz.radzik@kcl.ac.uk>}{Tomasz Radzik}{}}
% \author[3,1]{Further~Coauthor}
% Add affiliations after the authors
\affil[1]{%
    Department of Informatics\\
    Universit\"at Hamburg, Germany%\\
    % Pittsburgh, Pennsylvania, USA
}
\affil[2]{%
    Department of Informatics\\
    King's College London
}
\affil[3]{%
    Department of Engineering Science\\
    University of Oxford
  }
  
  \begin{document}
\maketitle

\begin{abstract}
We consider a population protocol version of the SIR model. In every round, an individual is chosen uniformly at random. If the individual is susceptible, then it becomes infected w.p. $\beta I_t/N$, where $I_t$ is the number of infections at time $t$ and $N$ is the total number of individuals. If the individual is infected, then it recovers w.p. $\gamma$, whereas, if the individual is already recovered, nothing happens.

We prove sharp bounds on the probability of the disease becoming pandemic vs extinguishing early (dying out quickly). The probability of extinguishing early, $\Pr{\mathcal{E}_{ext}}$, is typically neglected in prior work since most use (deterministic) differential equations.
Leveraging on this, using $\Pr{\mathcal{E}_{ext}}$, we proceed by bounding the expected size of the population that contracts the disease $\mathbf{E}\left[R_\infty\right]$. Prior work only calculated $\mathbf{E}\left[R_\infty~|~\overline{\mathcal{E}_{ext}}\right]$, or obtained non-closed form solutions.


We then study the two-country model also accounting for the role of $\Pr{\mathcal{E}_{ext}}$.
 We assume that both countries have different infection rates $\beta^{(i)}$, but share the same recovery rate $\gamma$. In this model, each round has two steps: First, an individual is chosen u.a.r. and travels w.p. $p_{travel}$ to the other country. Afterwards, the process continues as before with the respective infection rates.

Finally, using simulations, we characterise the influence of $p_{travel}$ on the total number of infections. Our simulations show that, depending on the $\beta^{(i)}$, increasing $p_{travel}$ can decrease or increase the expected total number of infections $\mathbf{E}\left[R_\infty\right]$.
\end{abstract}

\section{Introduction}\label{sec:intro}
In this paper we consider the well-known SIR process which is used to study the spread  of a contagious disease. %in a population of size $N$.
The model was introduced in the early 20th century (\cite{ross,kermack1927contribution}). 
The population is split into three compartments (or states):  susceptible (S), infected (I) and recovered (R). At the beginning of an epidemic, a small number of individuals are infected and the rest of the population is susceptible. 
Susceptible individuals can  be infected and infected individuals can recover and become permanently 
immune to the disease.
The model is often used to study the spread of diseases like 
COVID-19, measles, mumps and rubella. 

When an epidemic starts, the number of infected individuals increases rapidly.
% , since a large number of them becomes infected.
At some point, the proportion of still susceptible and already recovered individuals will be such that the spread of the infection will begin to slow down to the point of complete stop.
The model is  characterised by transition rates between the (compartments/states) depending on the infection rate $\beta$ and the recovery rate $\gamma$. 

% \medskip
The so-called reproduction number of an SIR process is defined as $\Rnought_0=\beta/\gamma$. This number is equal to the  expected number of secondary cases following the introduction of one infected individual into a fully susceptible population.
This reproduction number determines how the process evolves over time and how many individuals will get infected until the infection dies out. In general SIR processes behave as follows.
If $\Rnought_0<1$, the disease extinguishes early and the number of total infections follows an exponential distribution. 
If $\Rnought_0>1$, the number of infected individuals first grows exponentially (in expectation). If the number of infected and recovered individuals reaches a fraction of $1-1/\Rnought_0$ the number of infected individuals decreases, in expectation, until it reaches zero.  The fraction $1-1/\Rnought_0$ is also called {\em herd immunity threshold}. 


Looking closer at the case $\Rnought_0>1$, one can see that
the total number of infections follows a bi-modal distribution. 
The first peak represents {\em early extinction}, meaning the process terminates early (see \cref{sec:model} for a precise definition). If this unlikely event does not happen, then the process is likely  to reach the herd immunity threshold and a much larger number of individuals gets infected.% (see \cref{thm:cat}) \fnote{add citation}.

% \medskip
In this paper we study the SIR process as a population protocol.
We assume that the population has a fixed size $N$ and that the individuals are modelled as a finite state machine. 
 The state space of our protocol is simple, each individual being in one of the three states $S$, $I$, or $R$. At the beginning of time $t$, we denote by $S_t$, $I_t$ and $R_t$ the number of individuals susceptible to a disease, infected individuals and recovered individuals, respectively.
%  We denote by  $S_t$ the number of individuals susceptible to a disease at the beginning of step $t$. We use $I_t$ to denote the number of infected individuals at the beginning of step $t$ and $R_t$ to denote the number of recovered individuals at the beginning of step $t$.
 At the beginning of the process $t=0$, one individual (or a small subset of them) is infected and all  other individuals are susceptible. No individual is recovered yet. 
The process is now defined as follows. 
In each step a pair of individuals $i,j$ is chosen uniformly at random and is allowed to change state.
If the individual $i$ is currently infected, then it recovers with  probability $\gamma$. If $i$ is susceptible  and $j$ is infected, then $i$ becomes infected with probability $\beta$.
% Note that one  could also assume that a susceptible individuals meets another individual and gets infected if that individual is infected. 
% Note that one  could also assume that a susceptible individuals meets another individual and gets infected if that individual is infected. 



As mentioned before, it may happen that the disease dies out quickly  without ever evolving into a pandemic, i.e., without reaching the herd immunity threshold. 
In this paper we calculate the probability of early extinction as a function of 
$\Rnought_0=\beta/\gamma$ and we present a fully rigorous analysis for such probability using coupling arguments. We show that, if we start with $s$ infected individuals, the probability for early extinction $\Pr{\mathcal{E}_{ext}}$ is asymptotically $\left(1/\Rnought_0\right)^s$ assuming $\beta > \gamma +\eps$ for some arbitrarily small constant $\eps$. If $\beta < \gamma - \eps$, then $\Pr{\mathcal{E}_{ext}}=1-o(1)$.
We then calculate the expected size of the population that gets infected throughout the process $\E{\Rinf}$.
% \footnote{
Note 
that $\Rinf$ denotes the number of recovered individuals at the end of the process. The recovered state is the only terminal state and equals the number of  individuals that were infected throughout the process. 
% }
We  first show that $\E{\Rinf \ |\ \mathcal{E}_{ext}} $ is close to zero.
Then we show, conditioning on $\overline{\mathcal{E}_{ext}}$, that
$\E{\rinf ~|~\overline{\mathcal{E}_{ext}}} = \left(1+{W \left(-\Rnought_0 \cdot e^{-\Rnought_0} \right) }/{\Rnought_0} \right) \pm o(1),$
where $\rinf = \Rinf/N$. Combining our results, we show that, without any conditioning, \[\E{\rinf }=\E{\rinf ~|~\overline{\mathcal{E}_{ext}}}\cdot (1-\Pr{\mathcal{E}_{ext}})  \pm o(1) .\]

Many of the results found in the literature are based on first-order methods (mean field approaches) (e.g., \cite{kroger2020analytical,bicher2013agent}) and they give an
expression similar to our term for 
$\E{\rinf \ |\ \overline{\mathcal{E}_{ext}}} $   as an estimation of $\E{\rinf} $.
This is due to the fact that, with a mean field approach, the process is regarded as deterministic, which means that early extinctions cannot happen for $\Rnought_0>1$. Furthermore, such approach neglects the variance of the process, possibly crucial in the analysis of random processes
(see \cite{berenbrink2017ignore} for a majority type process where the expected change is the same, but the variance in the process determines the convergence time).
Although, it has been observed through simulations that the deterministic and stochastic processes differ in terms of the total number of infections (e.g., Fig.13 of \cite{allen2000comparison}). 



% \]


% We then calculate the expected number of total infections conditioning .

% \begin{enumerate}
% \item $\E{ \rinf~|~\mathcal{E}_{ext}} \in [0, O(1/\sqrt{N}]$ (close to it)
% \item  $\E{ \rinf~|~\overline{\mathcal{E}_{ext}}}  = \left(1+\frac{W \left(-\Rnought_0 \cdot e^{-\Rnought_0} \right) }{\Rnought_0} \right) \pm O(1)$
% \item  $\E{ \rinf}  = \left(1+\frac{W \left(-\Rnought_0 \cdot e^{-\Rnought_0} \right) }{\Rnought_0} \right) \left(1-\left(\frac{1}{\Rnought_0}\right)^s \right) \pm O(1)$
% \end{enumerate}
% Other papers claim: (first order methods)
% \[ \E{ \rinf}  =  \left(1+\frac{W \left(-\Rnought_0 \cdot e^{-\Rnought_0} \right) }{\Rnought_0} \right)
% \]
% But this is wrong.

%without applying the  common mean-field approach. {\bf bit more text}
%Most of the research on the SIR model has focused on studying the differential equations of the expected values
% $\frac{\partial}{\partial t}\E{S_{t}},\frac{\partial}{\partial t}\E{I_{t}},$ and $\frac{\partial}{\partial t}\E{R_{t}}$. We show via simulations that, in contrast to common belief, this does give the correct bound on
% $\E{I_{t}}$ and does therefore also not give the correct bound on the total number of infections $\rinf $ throughout the process ($\rinf = \lim_{t\rightarrow \infty} R_t$).
% The bounds, are however, correct when the number of infections has spread to significant number of nodes. To make this rigorous, we show formally a sharp bound on $\E{\rinf~|~I_{t_0} = \sqrt{N}}$.
% To round it up, we prove for a large range of $\beta$ and $\gamma$  almost-tight bounds on the probability of an early extinction. Formally we define this as reaching $I_t = 0$ before reaching $I_t=\sqrt{N}$. The proof proceeds by a series of interesting couplings.
% Combining all our results, yields an almost-tight bound on $\rinf$, by simply using law of total probability
%
% \begin{equation}\label{eq:mastereq}
% \E{\rinf} = \E{\rinf~|~I_{t_0} = \sqrt{N}}\Pr{I_{t_0}=\sqrt{N}} +\E{\rinf~|~I_{t_0} = 0}\Pr{I_{t_0}=0} .
% \end{equation}
% To prove our bounds, we have to overcome multiple challenges, the biggest being that we cannot simply analyse the process as a biased random walk on $I_t$ (nor as a Galton-Watson process). This is because the change in the number of infected individuals depends on $S_t$ as well.  We overcome the difficulties by using  a series of couplings, which might be of independent interest.
% {\bf Petra: So nicht verstaendlich}


% David's edits.


% individuals are born in the susceptible class. Upon contact with an infected individual, susceptible individuals may get infected and move into the infected class. Once the immune system clears the infected individuals, infected individuals become immune and move to the recovered class. We assume that in each round one of the population members are picked uniformly at random. if the individual is infected it recovers with a probability $\gamma$. If it is susceptible the individual will be infected with a probability of $\beta I/N$, where $\beta$ is the infection rate. Hence, the probability depends on the size of the infected population
% and it equals $\beta$ time the probability that a randomly chosen individual is infected. This is a standard assumption  for air-born diseases. Note that one  could also assume that a susceptible individuals meets another individual and gets infected if that individual is infected. Finally,  it is assumed recovered individuals never get susceptible again. Hence, the number of recovered individuals grows over time. 


% \medskip


In the second part of our paper we consider a two-country setting: we show through simulations how travelling affects the spread of the disease.
% {\pink combined with the impact of $\Pr{\mathcal{E}_{ext}}$ in such a case}. 
A possible ban, or restriction, on travelling has been considered and applied by governments in some cases, especially with respect to the current pandemic. Such measure falls under the label of non-pharmaceutical interventions (NPI).
% , actions taken to mitigate the spread of a disease which are not of the medicine related type, such as travel bans and social distancing.
We introduce an extension of the model where each individual resides in one of two countries.  We assume that both countries can have  different infection rates, $\beta^{(1)}\neq \beta^{(2)}$, and each country has population size $N^{(1)}, ~ N^{(2)}$ with $N=N^{(1)} + N^{(2)}$.  The process is similar to our original process. First, an individual $i$ is chosen uniformly at random among the whole population of both countries (with probability $1/N$). The chosen individual then travels  to the other country with a probability of $\ptr$ and  remains in the current country w.p. $1 - p_{travel}$.
After this, one of the two countries is chosen with a probability proportional to its population and one step of the single-country process is performed in that country. We study, via simulations, the connection between $\ptr$ and the total number of infections  over time. We obtain interesting and surprising insights. 
For example, one would think that increasing $\ptr$ increases the total number of infections. While this is generally true for small values of $\ptr$, (though for some values of $\beta$ and $\gamma$, not even this is true) it turns out that this does not hold if  one country has $\Rnought_0^{(1)}=\beta^{(1)}/\gamma>1$ and the other country has sufficiently small $\Rnought_0^{(2)}<1$. In that case for large enough values of $\ptr$, the total number of infections is decreased. Our simulations suggest that early extinction affects the size of the epidemic, leading its average to be reduced and its variance to increase.

 %This surprising effect  should be consider when creating travel policies for the COVID-19 outbreak; at least if the parameters are suitable. Of course, other factors also have to be taken into account such as the increased contact when travelling.


% can be explained by the ``effective averaging'' of the $\beta$s. If the effective $\Rnought_0$, which is a function of the $\beta$s, $\gamma$ and $\ptr$ dips below $1$ in both countries, this causes an early extinction in all countries. Therefore, a case can



\subsection{Related Work}
%\begin{enumerate}
%    \item Frog Model
%    \url{https://arxiv.org/pdf/1610.04301.pdf}
%    \item Hayk rumour spreading \url{https://dl.acm.org/doi/10.1145/3293611.3331622}
%    \item Epidemics on envolving graphs \url{https://arxiv.org/abs/2003.08534}
%\end{enumerate}
The SIR model is a very basic mathematical model for the spread of diseases (e.g. it does not model birth and death of individuals, the latter resulting from the illness or other reasons) and a very large number of generalizations were suggested in the literature. Due to the vast literature, we will only discuss results in the SIR comparable to our results and we will concentrate on analytical results. 
There are also many compartmental models similar to the SIR model.
% , for example the SIS model in which agents can become susceptible again. 
See \cite{Li18, D99} for a nice overview about different mathematical models.


% \medskip

The SIR model was introduced in 1927 by \cite{kermack1927contribution}, building on work by \cite{ross}. The authors also introduced  the well-known set of non-linear ordinary differential equations  (which we introduce in this paper too) to describe the spread of the disease.  
%Due to the growing importance of theoretical models for epidemics, the articles were republished in 1991 \cite{KM91}. 
The authors showed the so-called threshold theorem, which  predicts the critical fraction of susceptible individuals in the population that must be exceeded if an epidemic is to occur. The authors show that 
$
    s_{\infty}=1-r_{\infty}=s_0 e^{-\Rnought_0(r_{\infty}-r_0)}
$, where 
$s_{\infty}=|S_{\infty}|/N$ (fraction of susceptible individuals as time goes to infinity), $r_{\infty}=|R_{\infty}|/N$, $s_{t}=|S_{t}|/N$, and  $r_t=|R_{t}|/N$.
This transcendental equation has a solution in terms of the Lambert $W$ function: $s_{\infty}=-\Rnought_0^{-1} W(-s_0 \Rnought_0 e^{-\Rnought_0(1-r_0)})$.

% \cite{KMII91} consider variants of their SIR model: For example they show that an increase of the birth rate (new individuals which enter the system as susceptible ones)  results in an increase in the rate of infection.

% \medskip 
Most of the publications investigating the SIR model use numerical methods, employing a wide number of different techniques (see \cite{B06,RDG07,RGD07} and the references therein). There is a huge amount of literature studying very diverse effects in the SIR compartmental model. Much of it approximates the random process with a deterministic process defined via ordinary differential equations. Hence an early extinction of the process is not possible. 
% 
In \cite{HLM14} the authors derived an exact analytical solution to the SIR model in parametric form. A similar result was shown by \cite{M12, M17}. In \cite{SKS11} an exact analytical solution to the  SIR and SIS models with constant population is obtained with the help of direct integration tools. 
\cite{lefevre2020sir} propose a block-structured Markov process to describe the spread of epidemics of 
% Susceptible-Infected-Removed 
SIR type and they determine the distribution of the final state of the process. In \cite{BR15} the authors present a new method for the recursive computation of the epidemic size distributions. 
%The method is applicable to the SIR model and other homogeneously-mixing Markovian models. Their method is both exact, numerically stable and computationally efficient.
The authors do not estimate the expected size of the epidemic nor do they give a closed form of its distribution.

% In \cite{lefevre2020sir} the authors  propose a Markov process to describe the spread of epidemics of Susceptible-Infected-Removed (SIR) type. The authors calculate the distribution of the final state of the process. 



\cite{PCMV15} give a great overview about epidemic processes in networks: Here the individuals are connected by a network modelling social contacts, such that the infection spreads from a node to its neighbours. 
%The authors also comment on analytical approaches for these models and on the close relation to models from statistical physics. 
The authors of \cite{janson2014law} consider random graphs with a given degree sequence and prove that there is a threshold as a function for $\gamma$, $\beta$, and given vertex degrees. Below the threshold, only a small number of infections occurs, while above it most of the graph gets infected. \cite{kempe2003maximizing}  consider influence maximisation in the independent cascade model introduced in \cite{goldenberg2001talk, goldenberg2001using}. The process works in parallel rounds and it starts with a set of active (infected) nodes. Every active node infects every non-infected neighbour with a probability of $p$. Then the active nodes become inactive and the newly infected neighbours become active.  The process runs until no more activations are possible. The optimization problem of selecting the most influential nodes to be activated in the beginning is NP-hard and the authors provide the first provable approximation guarantees for efficient algorithms.
For an overview about results in this model see \cite{shakarian2015independent}.

%In \cite{BB11} the authors analyse a model where each individual is connected to only two other individuals and the probability to become infected depends on the number of infected individuals. The authors calculate exactly the time-dependent density of infected, recovered and susceptible populations for random initial conditions. 
%Authors in \cite{allen2000comparison} study a stochastic SIR version and compare it with the deterministic one. Unlike our model, they assume a positive probability of an individual dying, and, for a constant total population, add a (susceptible) newborn to the model.
% {\color{cricolor}{Another analysis on stochastic models is done in \cite{bartlett2020deterministic}, in which the inadequacy of deterministic models for epidemics is discussed, especially in the case of small population sizes. The work translates the deterministic equations to non-linear stochastic equations, later approximated to  a linear version defined as \emph{orthodox bivariate linear time-series}. The analysis does not take into account early extinction and
% % The analysis is however brief and 
% only observations on the variance of the number of infections over time are given. A similar attempt is presented in \cite{ball1983threshold}. However, the author shows how epidemic processes can be turned into stochastic birth-death processes: both in the continuous and discrete time processes the analysis requires the population size $N \rightarrow \infty$, leading to a different behavior than our SIR model with fixed population size. Further, our bounds on $\Pr{\mathcal{E}_{ext}}$ are more specific and directly connected to $\gamma$ and $\beta$. In the work there is also a focus on a multi population scenario, although there is no mention to travelling between populations.}}
% \medskip

% In \cite{Whittle55} the authors approximate  the extinction probability in the  stochastic SIR model  with $1-1/\Rnought_0$. \fnote{What does this mean?}. They analyse the process via branching process and assume that  infected individual infects  susceptible individuals independent from each other.  \dnote{I couldn't identify which of Whittle's 1995 paper is this yet}

\cite{bohman2012sir} consider an SIR process of random graphs with a given degree sequence in an continuous time model. Each infected node sends infections to each of its neighbors  at times determined by independent exponential random variables with parameter $\lambda$. An infected node recovers at a random time given by an independent exponential random variable with parameter $\rinf$. The authors assume that the infection spreads from a single infected node and show that either the  disease halts after infecting only a small number of nodes, or an epidemic spreads to infect a linear number of nodes. The authors also show that, conditioned on the event that more than a small number of nodes are infected, the epidemic is likely to follow a trajectory given by the solution of an associated system of ordinary differential equations. Their approach gives bounds on the  total number of infected nodes. 
%In \cite{Luczak} the authors show that there is a threshold depending on the degree distribution and the rate of infection  below which only a small number of infections occur. Above the threshold a large outbreak occurs with probability bounded away from zero.

There is also related work that considers an agent-based modelling of the SIR model (e.g. \cite{bicher2013agent}). These investigate some form of non-homogeneous populations (e.g. distances between agents given by a graph), and analyse results empirically, either using geographic information systems \cite{perez2009agent}, or social relationships \cite{alzu2021simulation}. 

Many works model how different societal factors play a role in the evolution of an epidemic. In the case of the COVID-19 pandemic, many of them focus on accounting for different factors, like social distancing measures and testing regimens in order to build a reliable model. In  \cite{levesque2021model} the authors create a Crump-Mode-Jagers continuous time branching process modelling COVID-19 propagation in order to  decide which mitigation strategies are more effective. Similar works developed mathematical and data-driven models in order to establish the efficacy of such measures \cite{sun2020lift, choi2021optimal, liu2021predicting}. However, there is still a lack of work studying the role travelling has on the epidemic. 
In \cite{arino2003basic}, the authors incorporate the concept of travel into the ordinary differential equations for a Susceptible-Exposed-Infected-Removed-Susceptible (SEIRS) model. They derive bounds on $\Rnought_0$ and show that a disease-free equilibrium is reached, though they hint at its uniqueness through simulations and no discussion is made about the final expected size of the population. Their model is complex and detailed, with the central assumption that individuals come back to the origin country before leaving again for any other country. This is different from our work, where we do not require individuals to follow travel patterns. 
%The work in \cite{arino2003basic} stems from \cite{sattenspiel1995structured, sattenspiel1998structured}, where SIR compartmental models are employed for the 1984 Measles epidemic in the island of Dominica and for the 1918-1919 Spanish flu pandemic. No new mathematical analysis is described in them. 
In \cite{zakary2017new} a multi-regions discrete-time model describes the spatial-temporal spread of an epidemic. Starting from one region, this enters to regions connected with their neighbors by any mean favouring movement. Like in our case, the authors consider homogeneous SIR populations. 
%The authors of \cite{an2021dynamic} apply reinforcement learning and develop an online optimization model for the trade-off between epidemic control and the negative impacts on regional economy.



    
    
%Not using differential stuff:

%Ganesh, Massoulié, Towsley: The Effect of Network Topology on the Spread of Epidemics (2005)
%Relating graph properties and epidemic lifetime\\

%Zhu, Ying: Information Source Detection in the SIR Model (2013)
%MLE of patient zero in SIR model after t time steps on a graph\\





%Gandolfi, Cecconi: SIR epidemics on a scale-free spatial nested modular network (2016) epidemic threshold for a hypercubes?\\


%Book from David \\








\section{Models and Results}\label{sec:model}
In this section we introduce formally the two models considered in this paper. 
Both models work in sequential rounds and are population protocols \cite{AADFP06}.  
At every round, two individuals $i$ and $j$
are picked uniformly at random. 
Individual $i$ can either become infected, recover, or stay susceptible, depending on the current state of $i$ and $j$. In our or second model, individuals can also travel.
We study discrete-time models, where in each integer step, an action occurs. One major advantage the discrete-time models have over continuous time models is that they allow to study the early extinction phenomenon. Note that continuous systems and, in particular, differential equations fail to capture early extinctions.
% The main reason is the focus on the probability of early extinction. Indeed, non-integer variations on the number of infections do not allow for a clear trend towards extinction, rather they perpetuate the infection process, not allowing for a distinction between an early extinction and a pandemic-like evolution of the process.


\paragraph{Single Country Model.} 
Let $S_t,I_t,R_t$ be the number of susceptible, infected and recovered individuals at time $t$, with  $S_t+I_t+R_t=N$, where $N$ is the total number of individuals. Our SIR process is defined as follows. 
In every round two individuals  $i,j$ are picked uniformly at random.   
\vspace{-2ex}
\begin{enumerate}\itemsep0pt
\item If individual $i$ is infected, then it recovers w.p. $\gamma$. 
\item If individual $i$ is susceptible and $j$ is infected, then $i$ will becomes infected w.p. $\beta$. 
\end{enumerate}
\vspace{-2ex}
Note that $\beta\cdot I_t/N$ is the probability that the selected susceptible individual $i$ becomes infected. We assume that $\beta$ and $\gamma$ are both constants. In expectation, the system evolves as follows.
%The expected change of the quantities matches the differential equations used for SIR model, scaled by a factor of $1/N$. This scaling is naturally occurring in asynchronous models since at any time at most one individual is active.  
%If one were to group $N$ rounds into one super-round, then the expected changes match exactly those of the differential equations.
\begin{eqnarray}
\E{I_{t+1}}&=&I_t+\beta(I_t/N) (S_t/N)-\gamma (I_t/N) \label{eq:Ichange}\\
\E{S_{t+1}}&=&S_t-\beta (I_t/N)(S_t/N)
\label{St}\\
\E{R_{t+1}}&=&R_t+\gamma  (I_t/N)\label{ItR}
\end{eqnarray}
The reproduction number is defined as $\Rnought_0=\beta/\gamma$. This number is equal to the  expected number of infections caused by an infected individual assuming that $S=N$. Over time, the number $S_t$ decreases such that every newly infected individual introduces  less and less infections into the population.  The herd immunity threshold is defined as $1-1/\Rnought_0$. This is the value of $S_t/N$ such that $\Rnought_0\cdot S_t/N=1$.


In \cite{JM01}, a pandemic is defined as “an epidemic occurring worldwide, or over a very wide area, crossing international boundaries and usually affecting a large number of individuals”.  In this work we
assume a simplified notion of pandemic, depending only on the total number of infected individuals.  
We say a process results in a \emph{pandemic} if at one point of time the number of active infections ($I_t$) reaches $\sqrt{N}$.\footnote{
% The value of $\sqrt{N}$ is arbitrary; 
Our results suggest that any value $> \log N$ scaled with $\beta$ and $\gamma$ will work, as reaching it will ensure that eventually a constant fraction of $N$ will become infected with very high probability. The reason for $\sqrt{N}$ is it avoids using terms of $\gamma$ and $\beta$ in the definition, assuming they are both constants (independent of $N$). 
% We define extinction as never reaching $\sqrt{N}$ infected individuals at any point of time. Note that any threshold in $[\omega(1), o(N)]$ would do. 
There is virtually no difference between a threshold of say $C \log n$ and any value in $n^{1-\varepsilon}$ for constants $C=C(\beta,\gamma)$ and $\eps<1$, since, if an infection reaches size $C\log n$, then it will also reach $n^{1-\varepsilon}$ with high probability.}
Moreover, if the process does not result in a pandemic, then we say it \emph{extincts early}. We denote the corresponding events by $\mathcal{E}_{pan}$ and  $\mathcal{E}_{ext}$. Theoretically, over time a constant fraction of the population might become infected, even though at any point of time the number of current infections stays below $\sqrt{N}$. However, with the same techniques as we use in this paper, one can show that this event happens with an exponentially small probability---it is akin to a biased random walk on a line remaining in the interval $(1,\sqrt{N})$ for a linear number of rounds.
% As examples, according to the Center for Disease Control and Prevention, it is assumed that the Spanish flu infected one third of the total world population, and 
% the black death killed 20 \% of London's inhabitants. 
Finally, we  define $\Rinf$ as the \emph{epidemic} final size; since all individuals eventually recover this corresponds to $\Rinf = \lim_{t\rightarrow \infty} R_t$. We define $\rinf = \Rinf/N$. Let $W$ denote the 
Lambert W-function, 
which is the inverse of function $f(W)=W e^W$
considered for $W\in [-1,\infty)$ (where function $f$ 
increases monotonically from $-1/e$ to infinity).




\paragraph{Model for two countries.} \label{sec:MultipleCountry}
In this case we assume that the $N$ individuals are distributed over two countries.  $N_t^{(1)}$  and $N_t^{(2)}$ denotes the number of individuals staying in country $1$ and $2$ at time $t$. We assume $N_t^{(1)}=N_t^{(1)}=N/2$.
The number of susceptible, infected and recovered individuals at time $t$ in country $i$ is denoted by $S_t^{(i)},I_t^{(i)}$ and $R_t^{(1i)}$ for $i \in \{1,2\}$.  Initially we have $I_t^{(1)}=I_t^{(2)}=s$ for $s\ge 1$. 
We assume that every country has its own infection rate: $\beta^{(1)}$ and 
$\beta^{(2)}$. The recovery rate in both countries is $\gamma$. 
The process has two selection steps and step $t$ works as follows.
\begin{enumerate}\itemsep0pt
\item Pick an individual $i$ uniformly at random (with prob. $1/N$). 
 With probability $\ptr$ individual $i$ travels from its current country to the other country.   Adjust  $N_t^{(1)}$ and $N_t^{(2)}$ accordingly. 
 \item Pick country $\ell$ as follows: $\ell = 1$ w.p. $N_t^{(1)}/N$ and $2$ otherwise. 

 
 An SIR step as described above is applied
 on the chosen country.
 \begin{enumerate}\itemsep0pt
 \item Pick a pair of individuals $i$ and $j$ uniformly at random from country $\ell$.
\item If individual $i$ is infected, then it recovers w.p. $\gamma$. 
\item If individual $i$ is susceptible and $j$ is infected, then $i$ will become infected w.p. $\beta^{(\ell)} $.
\end{enumerate}
 \end{enumerate}
 
Compared to existing works, this model is simpler and posing minimal constraints to the movement of individuals. This makes it easily adaptable, for example in the case of a change in the number of countries involved. Further, a simpler model might facilitate future analytical results regarding the expected size of the epidemic.



%\begin{enumerate}
%    \item  $ \Pr{ i \in I^{(j)}_{t+1} ~|~ i \in S^{(j)}_t} = \beta^{(j)}\cdot  %\frac{I^{(j)}_t}{N^{(j)}} $
%    with the remaining probability $i$ stays susceptible, i.e., remains in $S^{(j)}_t$
%   \item  $ \Pr{ i \in R^{(j)}_{t+1} ~|~ i \in I^{(j)}_t} = \gamma $
%   with the remaining probability $i$ stays in infected, i.e., remains in $I^{(j)}_t$
%    \item  $ \Pr{ i \in R^{(j)}_{t+1} ~|~ i \in R^{(j)}_t} = 1 $
%\end{enumerate}







\subsection{Results}
Our main result  \cref{thm:Pext} shows a very tight characterisation of the early extinction probability. Recall that a process extincts early if the number of infected dies out before ever reaching  $I_t\ge\sqrt{N}$, where $N$ is the size of the population. Recall that $\Rnought_0=\beta/\gamma$ is the  reproduction number, $\Rinf$ is the total number of infected individuals and $\rinf=\Rinf/N$ .
First we present a simplified version of \cref{thm:Pext}.

\begin{theorem}[Simplified version of \cref{thm:Pext}]\label{thm:zero}
Let $\beta + \gamma \leq 1$.
\label{thm:earlyextinction} Consider the single country model starting with  $I_0 = s\le \frac{\sqrt{N}}{2\log{N}\log\log{N}}$ infections.
 Let $\epsilon = 5\log N/\sqrt{N}$ and let $\Pr{\mathcal{E}_{ext}}$ be the probability of early extinction. 
  \begin{enumerate}
    \item  If $\beta < \gamma - \varepsilon$, then 
    $\Pr{\mathcal{E}_{ext}}\ge 1-o(1)$. 
    \item  If $\beta > \gamma+ \varepsilon$, then  
    $\Pr{\mathcal{E}_{ext}} = \left(1/\Rnought_0\right)^s\pm o(1)$.
\end{enumerate}
\end{theorem}
For the case  where $\beta>\gamma$ and $s=1$  the probability of early extinction is essentially $1/\Rnought_0=\gamma/\beta$. If $s>1$   the probability decreases exponentially in $s$.
If $\beta<\gamma$, then the process is very likely to reach an early extinction.
We leave the case $\beta=\gamma$ open but note that $\epsilon$ will be arbitrary small with growing $N$. The challenging part in the  proof of  \cref{thm:Pext} is  non-linear update rule of our process. 
% \footnote{
Note that rewriting \eqref{eq:Ichange}-\eqref{ItR} by replacing $R_t=N-S_t - I_t$ reveals the non-linearity.
% }  
To overcome these challenges, we use a series of  couplings, allowing us to relate our SIR process to a biased random walk. 



Note that in the above result $\mathcal{E}_{ext}$ is defined as the event of having $\sqrt{N}$ many infected individuals at the same time. Hence, it is still possible that $\Rinf=\Omega(N)$, i.e., at the end of the process the number of recovered individuals (which equals the total number of infections) is linear in $N$. In  
\cref{pro:coupling_n1} (see \cref{sec:couplings})  we show that this is not the case; 
in the event of $\mathcal{E}_{ext}$ the number of recovered individuals never exceeds $\sqrt{N }\log N$, w.h.p.
This means that our bounds on the the early extinction still hold even if the definition was changed to requiring  $\Rinf \leq \sqrt{N }\log N$.
%



% \medskip



% \begin{wrapfigure}{R}{0.4\textwidth}
% \centering
% \includegraphics[width=0.4\textwidth]{R0.png}
% \caption{\label{fig:plotofW}The plot shows $\rinf$ for different values of $\Rnought_0=\beta/\gamma$ and $s=1$.}
% \end{wrapfigure}

The next theorem shows a bound on the the expected number of total infections $\E{\Rinf} $, expressed in terms of the Lambert function $W$. Note that $W(x) < 0$ for $x\in [-1/e, 0)$.  As mentioned before, many approaches like  first-order methods, mean field approaches and ordinary differential equations ODE (see e.g., \cite{kroger2020analytical,bicher2013agent}), cannot account for early extinction due to the underlying determinism. Instead, they obtain bounds of the kind
$\E{\rinf \ |\ \overline{\mathcal{E}_{ext}}} $.  
There has  also been been some work based on stochastic differential equations (e.g., \cite{ allen2008,williams2012stochastic}), complemented with simulations results. We are not aware of results based on SDEs that obtain closed-form results for $\E{\rinf}$.

\begin{theorem}\label{thmn:combined}
Assume that $|\beta -\gamma| \geq \epsilon$ 
for an arbitrarily small constant $\epsilon$.
Assume $N=N(\epsilon)$ is large enough.
Let $W:[-1/e,\infty)\longrightarrow [-1,\infty)$ be the Lambert function.
Consider the single country model starting with  $I_0 = s\le \frac{\sqrt{N}}{2\log{N}\log\log{N}}$ infections.
% Let $\epsilon = 5\log N/\sqrt{N}$. 
  \begin{enumerate}
    \item  If $\beta < \gamma - \varepsilon$, then the expected total number of infections is sublinear, i.e.,
    $\E{ \rinf} = o(1)$. 
    \item  If $\beta > \gamma + \varepsilon, \quad$ then\\ 
    $\quad \E{ \rinf} \sim  \left(1+\frac{W \left(-\Rnought_0 \cdot e^{-\Rnought_0} \right) }{\Rnought_0} \right) \left(1-\left(\frac{1}{\Rnought_0}\right)^s \right)$. 
 \end{enumerate}
 See \cref{fig:plotofW} for a depiction.
\end{theorem}

\begin{figure}
    \centering
    \includegraphics[width=0.4\textwidth]{figures/R0.png}
    \caption{The Plot Shows $\rinf$ for Different Values of $\Rnought_0=\beta/\gamma$ and $s=1$.}
    \label{fig:plotofW}
\end{figure}

The following theorem estimates the expected number of infections conditioned on not having early extinction. It is  used to show \cref{thmn:combined}.
The following theorem yields a result similar to the one obtained through first-order methods used by
\cite{allen2008,williams2012stochastic}, but adapted to our population based model.
In \cref{thm:SIRonecountry} (in \cref{sec:totalinfection}) we provide a generalisation of the following theorem.








\begin{theorem}\label{thm:cat}
Consider the single country model.
Assume that $\beta > \gamma$ are constants.
  Let 
 $W:[-1/e,\infty)\longrightarrow [-1,\infty)$ be the Lambert function. Then, for large $n$,
 \begin{equation}\label{sadnasdjnaskdhksa}
 \E{ \rinf~|~\overline{\mathcal{E}_{ext}}} \sim  \left(1+\frac{W \left(-\Rnought_0 \cdot e^{-\Rnought_0} \right) }{\Rnought_0}\right).
 \end{equation}  
 Furthermore,
 $\rinf$ is concentrated around its expected value.
\end{theorem}


\section{ Early Extinction }\label{sec:Pext}
 
The SIR model described in \cref{sec:model}, often results in herd-immunity, where a large fraction of the population was infected such that only very few susceptible individuals remain. From then on, the number of infections decreases slowly until it finally reaches zero. 
However, in some case, even when $\beta > \gamma$, it can happen that the number of infections remains low and the virus vanishes very suddenly. This is what we refer to as an {\it early extinction} and which is the focal point of this section. We will derive bounds on the probability of an early extinction in terms of the parameters $\gamma$ and $\beta$. The formal statement is given in the next theorem and implies the theorem given in \cref{thm:earlyextinction}.


\begin{theorem} \label{thm:Pext}
    Consider the single country SIR model as described in \cref{sec:model}. Define $\tau = \sqrt{N}\log N$, 
    % $\epsilon$ be an arbitrarily small positive constant,
    $\epsilon = 5\log N/\sqrt{N}$, and $\epsilon' = \eps (N-\tau)/N$, and assume $I_0=s$ for any $s\le \frac{\sqrt{N}}{2\log{N}\log\log{N}}$.
      Then we have for $\beta > \gamma + \eps$

    
    $\left(\frac{\gamma}{\beta}\right)^s- \frac3N ~\leq~ \Pr{\mathcal{E}_{ext}}
         ~\leq \left(1 + \frac{s\tau}{N-s\tau}\right) \left(\frac{\gamma}{\beta }\right)^s + \frac2N$
    
    \mbox{and for } $\beta < \gamma - \eps$
    
    $1- \left(\frac{\gamma}{\beta}\right)^{s-\sqrt{N}}-\frac2N ~\leq~ \Pr{\mathcal{E}_{ext}} ~\leq~ 1$
    
\end{theorem}






To show \cref{thm:Pext} our approach is as follows. 
Ideally, we would like to bound the number of infections as a (biased) random walk over the number of infected individuals. The probability that such a random walk reaches either of its end points
\iffull
can be limited by a bound like the one given in \cref{pro:CaminataAleatoriaParcial}.
\else
are well-understood.
\fi
Unfortunately, here the probability to 'walk' from $I_t$ to $I_t-1 $ or $I_t+1$ is a function of both $I_t$ and $S_t$.
We avoid the dependency on $I_t$ by  considering  {\em active} steps only. Recall, these are steps in which the number of infected individuals either increases by one or decreases by one. 
 This allows us to drop the terms $I_t/N$  in the transition probabilities.
We circumvent the dependency on $S_t$ by defining a family of processes and by coupling them with which each other
(see \cref{sec:couplings}). The processes will be defined and motivated later in this section. 
Once both problems are solved, we are left with four biased random walks (upper vs lower bounds and $\beta < \gamma$ vs $\beta > \gamma$). 
%that we analyse using  \cref{pro:CaminataAleatoriaParcial} (see \cref{main}). 
This allows us in \cref{thm:Pext} to derive almost tight bounds on $ \Pr{\mathcal{E}_{ext}}$.

%\medskip







\subsection{Couplings}\label{sec:couplings}
In this section our goal is to show \cref{pro:alltogether} which 
upper and lower bounds the probability of early extinction of our process by the  extinction probability of two biased random walk process.
\cref{pro:alltogether} shows that the approximation error is tiny and has only a second-order impact on the extinction probability.
In order to prove \cref{pro:alltogether}, we will define two intermediate processes and provide three pairwise couplings. 

In this section, unless stated otherwise, we only consider active steps, i.e., steps where $I_{t+1} \neq I_t$. The subscript $t$ counts now only 
the active steps.
Recall that $\mathcal{E}_{ext}$ (early extinction) is the event that the number of infections reaches zero before ever  having  $\sqrt{N}$ infected individuals at the same time.
% 
Define stopping time as
$T = \min_t\left\{  I_t = 0~or~I_t = \sqrt{N} \right\}$.
Hence, $T$ refers to the number of active steps until there are either zero or $\sqrt{N}$ many infected individuals, and 
$\mathcal{E}_{ext}$ is the event that $I_{T} = 0$. 
For any SIR process $P$, we denote by $\xi_t^{(P)}$ the configuration of $P$ at time $t$, which is given by the triplet $\xi_t^{(P)}=\left(S_t^{(P)}, I_t^{(P)}, R_t^{(P)}\right)$. We define
    $\tau =\sqrt{N}\log{N}$ and 
a second stopping time $T_\tau = \min_t\{ I_t = 0, I_t = \sqrt{N} ~or~ t = \tau\} = \min\{T,\tau\}$.

Compared to the stopping time $T$, here  the process does not only stop if  the number of infected individuals reaches zero or $\sqrt{N}$ but also if that does not happen during the first $\tau$ (active) time steps. All our processes are stopped at time either $T$ or $T_\tau$. We assume that for every $t$ larger that the stopping time we have $\xi_t^{(P)}=\xi_{T}^{(P)}$ or $\xi_t^{(P)}=\xi_{T_{\tau}}^{(P)}$, respectively.
%
Now we are ready to define our  processes, which we call $P, P_{\tau,*}, P_{*,S}, P_{\tau,S}$.

\begin{enumerate}\itemsep0pt
\item $P$ is our original process with the stopping time $T$.
\item $P_{\tau,*}$  is our original process with the stopping time $T_{\tau}$.
\item Fixing $S_t$ over time to a constant $S\in \{N-\tau, N\}$, we define the following biased random walk process $P_{*,S}$. 
At each round the following is done.
\begin{enumerate}\itemsep0pt
\item Draw random numbers $X_I, X_S, X_a$ i.i.d. and u.a.r. from $[0,1]$
\item If $X_I \in [0, I^{(P_{*,S})}_t/N)$, then 
\begin{enumerate}
\item if $X_a \in [0,\beta)$ and $X_s \in [0,S/N]$, increase $I_t^{(P_{*,S})}$;
\item if $X_a \in [\beta,\beta+\gamma)$,  decrease $I_t^{(P_{*,S})}$.
\end{enumerate}
\end{enumerate}
The nice property about process $P_{*,S}$ is that (since we only consider active time steps) the process behaves exactly like a biased random walk.  The
increase probability of $I_t$ is $\beta S/N$ and the decreas probability  is $\gamma$. For this process we apply the stopping time $T$.

\item For $S\in\{N-\tau, N\}$, we define the process $P_{\tau,S}$.  $P_{\tau,S}$ behaves like $P_{*,S}$ but  we apply the  stopping time $T_\tau$. 




\end{enumerate}




\begin{figure}
\centering
\begin{minipage}[t]{0.49\textwidth}
    \centering
    \begin{tabular}{l|ll}
  & no time limit & time limit $\tau$ \\
    \hline
  $S$ not fixed & 
  $P$  & $P_{\tau,*}$  \\    $S$ fixed & 
    $P_{*,S}$ & $P_{\tau,S}$\\
    \end{tabular}
  \captionsetup{hypcap=false}
    \captionof{table}{Description of the processes.}
  \label{tab:four_processes}
    \vspace{0.3cm}
  \end{minipage}
  \centering
  \begin{minipage}[t]{0.49\textwidth}
    \centering
    \begin{tikzcd}
        P \arrow[leftrightarrow]{r}{\text{Prop }1}  & P_{\tau,*} \arrow[leftrightarrow]{r}{\text{Prop }2} &%
        P_{*, S} \arrow[leftrightarrow]{r}{\text{Prop }3}& P_{\tau,S}
    \end{tikzcd}
    \captionsetup{hypcap=false}
     \captionof{figure}{Diagram of the couplings }
     \label{fig:relashes}
     \vspace{-0.2cm}
  \end{minipage}
\end{figure}

\cref{tab:four_processes} sums up the four processes, while \cref{fig:relashes} shows the couplings we are going to prove in the following sections.
In this section we will show the following result.






\begin{theorem}
\label{pro:alltogether}
Consider the processes $P$ and $P_{*,S}$. 
We have
\[
\begin{split}
&\Pr{I_{T}^{(P_{*,N-\tau})}= \sqrt{N}} - 2/N \leq  \Pr{I_{T}^{(P)} = \sqrt{N}} \leq  \\
  &\quad  \leq \Pr{I_{T}^{(P_{*,N})}= \sqrt{N}}  +2/N, \quad  \text{and}
\end{split}
\]
\[
\begin{split}
\Pr{
I_{T}^{(P_{*,N})}= 0
}  -2/N  &\leq\\ \Pr{I_{T}^{(P)} = 0} 
\leq \ &\Pr{I_{T}^{(P_{*,N-\tau})}= 0 } +2/N
\end{split}
\]
\end{theorem}


From this it follows that we can analyse the early extinction time of $P_{*,S}$ instead of $P$. To show 
\cref{pro:alltogether} we use 3 couplings:
\cref{pro:coupling_n1} couples processes $P$ and $P_{\tau,*}$, \cref{pro:coupling_n2} couples  $P_{\tau,*}$ and $P_{\tau,S}$, and \cref{pro:coupling_n3} connects $P_{\tau,S}$ and $P_{*,S}$. 












\begin{proposition}\label{pro:coupling_n1}

With prob $1-1/N$ we have 
$
I_{T}^{(P)}=
I_{T_\tau}^{(P_{\tau,*})}.$

\end{proposition}




\begin{proposition}\label{pro:coupling_n2}
Consider the processes $P_{\tau,*}$ and $P_{\tau,S}$ for $S \in \{N,N-\tau\}$. Let 
 $P=\Pr{I_{T_\tau}^{(P_{\tau,*})} = \sqrt{N}} $. We have $\Pr{I_{T_\tau}^{(P_{\tau,N-\tau})} = \sqrt{N}} \leq P \leq \Pr{I_{T_\tau}^{(P_{\tau,N})} = \sqrt{N}}. $

\end{proposition}
% The proof of both propositions can be found in \cref{sec:proofOfCoupling_n1} and \cref{sec:proofOfCoupling_n2}.
\iffull
Proofs can be found in \cref{sec:proofOfCoupling_n1} and \cref{sec:proofOfCoupling_n2}, resp.
\fi

\begin{proposition}\label{pro:coupling_n3}
Consider the processes $P_{\tau,S}$ and $P_{*,S}$
for $S\in \{N-\tau,N\}$. Then  we have 
\[ \Pr{I_{T_\tau}^{(P_{\tau,N})}= \sqrt{N}}  \leq \Pr{I_{T}^{(P_{*,N})}= \sqrt{N}} + 1/N,\ and \]
\[ \Pr{I_{T}^{(P_{*,N-\tau})}= \sqrt{N}} - 1/N \leq  \Pr{I_{T_\tau}^{(P_{\tau,N-\tau})}= \sqrt{N}}  \]


\end{proposition}
\begin{proof}
This proof is similar to proof of \cref{pro:coupling_n1}.
\end{proof}


Putting all three couplings together yields \cref{pro:alltogether}.
From this we are finally able to conclude \cref{thm:Pext}, since now we can analyse a biased random walk instead. The proof distinguishes between two cases, depending on whether $\beta$ or $\gamma$ is larger and builds on 
\iffull
\cref{pro:CaminataAleatoriaParcial}.
\else
known results for biased random walks.
\fi
It can be found in the supplementary material.

\newcommand{\probincrease}[2]{p_{#1}^{(#2)}}
\newcommand{\C}{\mathcal{C}}























% %%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%
% %%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%
% %%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%
% %%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%
% %%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%
% %%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%
% %%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%
% %%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%
% %%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%
% %%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%
% %%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%
% %%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%
% %%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%
% %%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%
% %%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%
% %%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%
% %%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%







\section{Total Number of Infections}\label{sec:totalinfection}



In this section we analyse the total number of infections for the SIR model for one country. 
\iffull
The proof of the next result can be found in \cref{thm:proofOfSIRonecountry}. 
\else
The proof can be found in the full version.
\fi


\begin{theorem}\label{thm:SIRonecountry}
Consider the single country model.
Assume that $\beta -\gamma \geq \epsilon$ 
for an arbitrarily small constant $\epsilon$.
Assume $N=N(\epsilon)$ is large enough.
  Let 
 $W:[-1/e,\infty)\longrightarrow [-1,\infty)$ be the Lambert function.
Consider a time step $t^*=\omega(1)$.
Let $\mathcal{E}'$ be the event $I_{t^*} \in [\omega(1),o(N)]$ and $R_{t^*} = o(N)$.
 Then, for large $n$,
 \begin{equation}\label{jk90ws4r}
 \E{ \rinf~|~\mathcal{E}'} \sim  \left(1+\frac{W \left(-\Rnought_0 \cdot e^{-\Rnought_0} \right) }{\Rnought_0}\right).
 \end{equation}  
 Furthermore,
 $\rinf$ is concentrated around its expected value.
  Moreover,  $\E{ \rinf~|~\mathcal{E}'}$ is concave for $\Rnought_0$.
\end{theorem}






\section{Simulations for two Countries}

In this section, we study the impact of travelling and its relationship to early extinction on the total number of infections.
Recall that $\ptr$ is defined as the travelling rate. 
Without loss of generality we set $\gamma=0,2$  in all our simulations.\footnote{By changing $\beta$ one can get arbitrary %infection rates
reproduction numbers $\Rnought_0$. Similarly, one can vary $\ptr$ to obtain arbitrary ratios of $\ptr/\gamma$. } We set the initial size of each country to be $N^{(1)}_0=N^{(2)}_0=2000$. Note that these size vary to a small extent throughout the process due to traveling individuals. The total population is $N=N^{(1)}_0+N^{(2)}_0 = 4000$. 
Initially, there is one infection per country ($I_0^{(1)}=I_0^{(2)}=1$) and the rest of the population is susceptible.
In our simulations we vary the travelling rate $\ptr$ and the infection rates $\beta^{(1)}$ and $\beta^{(2)}$. W.l.o.g. we  assume $\beta^{(1)}\leq \beta^{(2)}$.
For each value of $\ptr$, $\beta^{(1)}$, and $\beta^{(2)}$, we output the average of $10.000$ iterations. 
The outcome of the simulations can be found in 
\cref{fig:std} and
\cref{fig:gallery2}.
For each value of $\ptr$ we plot the average number of infections $\widehat{\Rinf^{(1)}}$ in country one 
in light green and in light blue we plot the average number of infections in country two $\widehat{\Rinf^{(2)}}$. 
Note that we write $\widehat{\Rinf}$ instead of $\E{\Rinf}$ to emphasise the difference between the average and the expected value. 
The total number of infections across both countries $\widehat{\Rinf}=\widehat{\Rinf^{(1)}}+\widehat{\Rinf^{(2)}}$ is plotted in red. 
%We do not exclude early extinction runs as it hides important effects.
We plot the smoothed version of $\widehat{\Rinf}$  in black.
%, however, excluding early extinctions hides important 
Let $\Rinf(\ptr)$ denote the total number of infections for a given travelling rate $\ptr$.
Finally, the corresponding standard deviation is plotted using vertical bars.

First, note that the standard deviation appears large: this is inherent to the process and due to the fact that early extinctions (in this case, when $ \widehat{\Rinf} \leq \sqrt{2N} = \sqrt{4000} \approx 63$) occur with constant probability. Indeed, the plots are the result of averaging among all the iterations, included the ones resolving in early extinction. By excluding runs with an early extinction, the standard deviation becomes negligible. Another effect of this exclusion is that the values of $\widehat{\Rinf}$ turn up to be visibly smaller (see \cref{fig:std}). Another interesting detail with respect to this is shown in \cref{fig:a}. In there, the error bars representing the variance significantly decrease with increasing $p_{travel}$. This tells us that $\widehat{\Rinf}$ becomes small because the percentage of runs that terminate with early extinction increases significantly, compared to the case where no or little travel is present.
This to the point that these runs do not constitute the variance of the process anymore, but the most likely outcome, hence the reduction in the variance.

It is perhaps natural to assume that increasing $\ptr$ also increases $\widehat{\Rinf}$. While this is true for some values of the $\beta$s, there are also values for which 
 $\widehat{\Rinf}$ first increases monotonically and then decreases monotonically. In addition, for large $\ptr$ the number of total infections can drop below the value where no travelling occurs, i.e., $\widehat{\Rinf}(1)<\widehat{\Rinf}(0)$.
\cref{tableDavid}, \cref{fig:std} and \cref{fig:gallery2}.





\begin{figure}[ht!]
\centering
\begin{subfigure}{.49\textwidth}
  \centering
  \includegraphics[width=1\linewidth]{figures/Plots_eeIncluding_beta2070.png}
  \caption{Including Early Extinction.}
  \label{fig:sub1}
\end{subfigure}\vspace*{\fill}
\begin{subfigure}{.49\textwidth}
  \centering
  \includegraphics[width=\linewidth]{figures/Plots_eeExcluding_beta2070.png}
  \caption{Excluding Early Extinction.}
  \label{fig:sub2}
\end{subfigure}
\caption{The Plots Show $\rinf$ as Function of $\ptr$ Averaged Across $1000$ Iterations. 
We Consider two Different Settings.
In the Plot of the l.h.s. we Plot $\widehat{\Rinf}$. For the Plot on the r.h.s. we Plot $\widehat{\Rinf}$ Conditioned on $\overline{\mathcal{E}_{ext}}$.
%
The Plots Shows that $\widehat{\Rinf}$ Conditioned on $\overline{\mathcal{E}_{ext}}$
has Only very Little Standard Deviation whereas the Standard Deviation of $\widehat{\Rinf}$ is Considerably Larger. 
However, this is Unavoidable since there is a Positive Probability of an Early Extinction in which Case the Total Infections is Close to $0$. 
Therefore, the Standard Deviation is Naturally of Linear Size.
Note that in All Other Experiments we have Ten Times More Iterations Suggesting an Adequate Number of Iterations in our Experiments.
}
\label{fig:std}
\end{figure}

\begin{figure}[ht!] % "[t!]" placement specifier just for this example
\begin{subfigure}{0.47\textwidth}
\centering
\includegraphics[width=1\linewidth]{figures/Plots_eeIncluding_beta0030}
\caption{$\beta^{(1)} < \gamma < \beta^{(2)}$ and $\overline{\beta} < \gamma$. The Total Number of Infections $\widehat{\Rinf}(\ptr)$ Appears to be non-Monotone.}\label{fig:a}
\end{subfigure}\,\,\,\vspace*{\fill}
\begin{subfigure}{0.47\textwidth}
\centering
\includegraphics[width=\linewidth]
{figures/Plots_eeIncluding_beta0050}
\caption{$\beta^{(1)} < \gamma < \beta^{(2)}$ and $\overline{\beta} > \gamma$. The Total Number of Infections $\widehat{\Rinf}(\ptr)$ Appears to be non-Monotone.} \label{fig:b}
\end{subfigure}\,\,\,


\begin{subfigure}{0.47\textwidth}
\centering
\includegraphics[width=1\linewidth]{figures/Plots_eeIncluding_beta5080}
\caption{$\beta^{(1)} , \beta^{(2)} > \gamma$. The Total Number of Infections $\widehat{\Rinf}(\ptr)$ Appears to be Monotonically Increasing. See also \cref{tableDavid}.} \label{fig:e}
\end{subfigure}%\vspace*{\fill}





\caption{Simulation results for $\widehat{\Rinf}$ varying $\ptr$.} \label{fig:gallery2}

\end{figure}

\begin{table}[]
    \centering
    \caption{\capitalisewords{The Table Shows our Simulation Results} for $\widehat{\Rinf}(\ptr)$ Given Different Values of $\beta$s over $10.000$ Iterations, with the Exception of $(0.2, 0.7)$ -- Where Early Extinction (EE) Runs are Excluded (*) -- that is Averaged Over $1000$ Iterations. We Round to the Nearest Integer.} 
    \scalebox{0.8}{\begin{tabular}{|r||c|c|c|c|c|c|}
    \hline
          \diagbox{$\beta^{(1)}$,  $\beta^{(2)}$}{$\ptr$} & $0$ &	$0.05$	& $0.1$	& $0.2$	& $0.5$ 	& $0.95$ \\ \hline
          \hline
         $0$, $0.3$ & $395$ & $293$ & $121$ & $32$ & $12$ &  $10$ \\ \hline
         $0$, $0.5$ & $1078$ & $1514$ & $1491$ & $1352$ & $1025$ & $857$ \\ \hline
         $0$, $0.8$ & $1470$ & $2266$ & $2416$ & $2497$ & $2499$ & $2500$ \\ \hline
         $0.2$, $0.7$ & $1399$ & $2489$ & $2655$ & $2695$ & $2679$ & $2798$ \\ \hline
         (*) $0.2$, $0.7$ & $1899$ & $3181$ & $3373$ & $3454$ & $3460$ & $3445$ \\ \hline
         $0.3$, $0.3$ & $763$ & $1276$ & $1301$ & $1282$ & $1305$ & $1294$ \\ \hline
         $0.3$, $0.5$ & $1467$ & $2357$ & $2433$ & $2464$ & $2419$ & $2416$ \\ \hline
         $0.3$, $0.8$ &  $1868$ & $2926$ & $3085$ & $3132$ & $3189$ & $3185$ \\ \hline
         $0.5$, $0.8$ & $2543$ & $3367$ & $3407$ & $3428$ & $3443$ & $3465$ \\ \hline
         $0.8$, $0.8$ & $2953$ & $3673$ & $3687$ & $3688$ & $3679$ & $3682$ \\
         \hline
    \end{tabular}}
 

     \label{tableDavid}
\end{table}

\begin{result}
Assume that $\gamma=0.2$.

\begin{enumerate}
\item
 $\beta^{(1)}=0$ and $\beta^{(2)}=0.3$
 results in  $\Rinf(\ptr)$ being non-monotone. Moreover,
$\Rinf(1/2)=O(1)$ (\cref{fig:a}).
\item  $\beta^{(1)}=0$ and $\beta^{(2)}=0.5$
results in $\Rinf(\ptr)$ being  non-monotone.
(\cref{fig:b}).
\item  $\beta^{(1)}=0.5$ and $\beta^{(2)}=0.8$
results in  $\Rinf(\ptr)$  increasing monotonically.
 (\cref{tableDavid}, and also \cref{fig:e}).
 \iffull
 See further examples in \cref{sec:plots}.
 \fi
\end{enumerate}
\end{result}


Based on this, we believe that the simulation results can be generalised  to the following conjecture.

\begin{conjecture}
For all $\beta^{(1)}, \beta^{(2)}$ and $\gamma$ we have
\begin{enumerate}[label=(\roman*)]
\item $\beta^{(1)}, \beta^{(2)} < \gamma$: \quad $\E{\Rinf(\ptr)}=O(1)$ for all $\ptr\in [0,1]$. 
\item  $\beta^{(1)} < \gamma < \beta^{(2)}$  is non-monotone.
\item $\beta^{(1)},\beta^{(2)} > \gamma$:\quad
 $\E{\Rinf(\ptr)}$ increases monotonically.
  Note that \cref{fig:e} suggests that it is important for the $\beta$s to be strictly larger than $\gamma$ in order for  $\E{\Rinf(\ptr)}$ to increase monotonically.
\end{enumerate}
\end{conjecture}






At the core of the conjecture is the following observation.
As $\ptr$ increases, both $\beta^{(1)}$ and $\beta^{(2)}$  are blended together resulting in a linear combination of both values. Consider the following thought experiment where the $\beta$s fully blend until the ``effective'' $\beta$ of both cities is the average  $\overline{\beta}=(\beta^{(1)} + \beta^{(2)})/2$.
If we ignore the effect of travelling, all statements of the conjecture follow from the \cref{thmn:combined}. 
%
On the other hand, for high values of $p_{travel}$, then we can look at $\overline{\beta}$ and see in \cref{fig:a} that if $\overline{\beta}$ is below $\gamma$, then the total number of infections $\Rinf$ is close to $0$. We give the following explanation. When travel becomes more likely, infected agents might be selected to travel from one country to the other one. In such starting country, then, there will be less infected agents, making the infection step even more less likely than the recovery step. This further reduces the number of infected agents. On top of this, recovered or susceptible agents can be selected to travel too, lowering the ratio of infected agents in their destination country. Since the process starts with a small number of infected agents, travelling will undermine their influence significantly more than the influence of susceptible or recovered agents.


For $\overline{\beta}$ larger than $\gamma$,  $\Rinf$ is concave (see \cref{thm:SIRonecountry}). The concavity is the key to the understanding.
For example, when both original $\beta$s
were above $\gamma$, then having $\overline{\beta}$ in both countries increases $\Rinf$, due to the concavity. 
Let us consider the cases of the conjecture one by one.
%
Part (i)  follows since $\overline{\beta}<\gamma$ and any blending of the $\beta$s will yield ``effective'' $\beta$s less than $\gamma$. 
%
For part (ii) note that when $\ptr=0$ and no blending of the $\beta$s occurs, then we have that in one city $\Rnought_0 > 1$ (expected pandemic) and in the other $\Rnought_0 < 1$ (expected early extinction). When $\ptr$ is slightly above $0$ then the blending of the $\beta$s effectively decreases $\beta^{(2)}$ a little bit, the number of infectable individuals also doubles once $\ptr >0$. This becomes clear when one considers the  setting where $\beta^{(1)}=0$
 and $\beta^{(2)}=1$. Here $\E{\Rinf}>N$ and thus clearly some individuals of country 1 contribute.   
%
Part (iii) follows immediately from the concavity.
% 
The simulations presented here are only a small fraction of the number of our simulations covering the large variety of different values for $\beta^{(1)}, \beta^{(2)}$ and $\gamma$.
All our simulations confirmed our conjecture.
Nonetheless, we were not able to turn the arguments above into a rigorous proof. On one hand, considering one country only is already immensely complicated and a second country increases the number of variables and their decencies even further. On the other hand, $\Pr{\mathcal{E}_{ext}}$ and further considerations have to be accounted for.
Indeed, one  cannot simply ignore $\ptr$ and assume that both countries have  $\overline{\beta}$.
 The simulations suggest that we have essentially two values for $\Rinf$; one being $\Rinf(0)$ and one being $\Rinf(p)$, for any $p>0$. 
 \iffull
 ( See \cref{sec:plots}).
\fi
%  appears to be same. 
 The reason for this is that if $\ptr=0$, then it can happen that one city has an early extinction. When $\ptr >0$, we believe that whenever only one of the countries has an early extinction, then  the other country will eventually send infected individuals over and rekindle the infection until it succeeds. 
%  \item \cref{fig:b} shows that $\widehat{\Rinf}(0.1)$ maximise the number of total infections. We believe 
%  \end{enumerate}
While the above shows the influence of effects when  $\ptr$ is small, ww believe that for large values of $\ptr$ our arguments capture the behaviour of the process.
% 
% We conclude by relating our results to the  current discussion on open borders  and travelling in face of the COVID-19 pandemic.
% First, our results simplify in many ways, for example neglecting the fact that individuals who travel have an increased contact and might therefore spread the virus faster. In addition, the SIR model does not capture COVID-19 completely.
% That being said, behaviours like in \cref{fig:a}, where the number of total cases decreases as a result of open borders are conceivable even for COVID-19. In particular if one country already has herd-immunity.  We strongly believe that effect is important to study in great depth in order to inform policy decisions concerning the opening of boarders.  



\section{Future Work}

In this work we adopted one of the many standard models for disease spreading assuming pairwise interactions. Incorporating super spreaders is a great idea for further work. One way to incorporate super spreaders is to assume an underlying social network with high degree nodes.  Another one is to first activate a random individual and then to choose a random number of interaction partners using a suitable distribution. Both models need a different analysis method compared to the analysis in this paper. We believe it is very interesting to study them in future work.

\acknowledgements
F. Mallmann-Trenn was in part supported by the EPSRC grant EP/W005573/1.



\clearpage











% \printbibliography

\bibliography{uai2022-template}

\appendix

















% \include{extra_stuff.tex} 

\end{document}

