Lecture 06 - Stochastic aspects in the spread of epidemics

6 April 2022

Julien Arino

Department of Mathematics & Data Science Nexus
University of Manitoba*

Canadian Centre for Disease Modelling
Canadian COVID-19 Mathematical Modelling Task Force
NSERC-PHAC EID Modelling Consortium (CANMOD, MfPH, OMNI/RÉUNIS)

* The University of Manitoba campuses are located on original lands of Anishinaabeg, Cree, Oji-Cree, Dakota and Dene peoples, and on the homeland of the Métis Nation.

Outline

  • Why stochastic systems?
  • More on sojourn times
  • What types of stochastic systems?

Why stochastic systems?

Why stochastic systems?

  • Life results from the interaction of countless processes

  • Each process happens with some degree of imprecision. E.g., when a virus replicates, it misses a base here and there; when a cell undergoes mitosis, material is not split 50-50; when you meet another human, you forget to shake their hand. Etc.

  • Because of repetition, this gives rise to uncertainty

  • Stochastic systems incorporate uncertainty

More on sojourn times

Deterministic models incorporate stochasticity, they just hide it

  • Some (simple) probability theory
  • The exponential distribution
  • The Dirac distribution
  • A cohort model
  • Sojourn times in an SIS disease transmission model
  • An COVID-19 model : "making Erlangs"
  • Contrary to common claim, ODEs and the like do incorporate stochasticity, they simply do so in a way that makes it not apparent
  • Most models considered in this course (ODE or CTMC) assume exponential sojourn times in compartments

Some (simple) probability theory

Some probability theory

Suppose that a system can be in two states, and

  • At time , system is in state
  • An event happens at some time , which triggers the switch from state to state

A random variable is a variable that takes random values, that is, a mapping from random experiments to numbers

Let us call the random variable

time spent in state before switching into state

States can be anything:

  • : working, : broken;
  • : infected, : recovered;
  • : alive, : dead;

We take a collection of objects or individuals in state and want some law for the distribution of the times spent in , i.e., a law for

For example, we make light bulbs and would like to tell our customers that on average, our light bulbs last 200 years..

For this, we conduct an infinite number of experiments, and observe the time that it takes, in every experiment, to switch between and

From this, deduce a model, which in this context is called a probability distribution

Discrete vs continuous random variables

We assume that is a continuous random variable, that is, takes continuous values. Examples of continuous r.v.:

  • height or age of a person (if measured very precisely)
  • distance
  • time

Another type of random variables are discrete random variables, which take values in a denumerable set. Examples of discrete r.v.:

  • heads or tails on a coin toss
  • the number rolled on a dice
  • height of a person, if expressed rounded without subunits, age of a person in years (without subunits)

Probability

A probability is a function , from a probability space to

Formally: is a probability space, with the sample space, a -algebra of subsets of whose elements are the events, and a measure from to such that , , and

Gives the likelihood of an event occurring, among all the events that are possible, in that particular setting. For example, and

Probability density function

Assume continuous; it has a continuous probability density function

center

Cumulative distribution function

The cumulative distribution function (c.d.f.) is a function that characterizes the distribution of , and defined by

center

Properties of the c.d.f.

  • Since is a nonnegative function, is nondecreasing
  • Since is a probability density function, , and thus

center

Mean value

For a continuous random variable with probability density function , the mean value of , denoted or , is given by

Survival function

Another characterization of the distribution of the random variable is through the survival (or sojourn) function

The survival function of state is given by

This gives a description of the sojourn time of a system in a particular state (the time spent in the state)

is a nonincreasing function (since with a c.d.f.), and (since is a positive random variable)

The average sojourn time is

Since ,

Expected future lifetime

Hazard (or failure) rate

The hazard rate (or failure rate) is

Gives probability of failure between and , given survival to

We have

The exponential distribution

The exponential distribution

The random variable has an exponential distribution if its probability density function takes the form

with . Then the survival function for state is of the form , for , and the average sojourn time in state is

The Dirac distribution

The Dirac distribution

If on the other hand, for some constant ,

which means that has a Dirac delta distribution , then the average sojourn time is

A cohort model

A model for a cohort with one cause of death

We consider a population consisting of individuals born at the same time (a cohort), for example, the same year

We suppose

  • At time , there are initially individuals
  • All causes of death are compounded together
  • The time until death, for a given individual, is a random variable , with continuous probability density distribution and survival function

The model

Denote the population at time . Then

gives the proportion of , the initial population, that is still alive at time

Case where is exponentially distributed

Suppose that has an exponential distribution with mean (or parameter ), . Then the survival function is and takes the form

Now note that

with

The ODE makes the assumption that the life expectancy at birth is exponentially distributed

Case where has a Dirac delta distribution

Suppose that has a Dirac delta distribution at , giving the survival function

Then takes the form

All individuals survive until time , then they all die at time

Here, we have everywhere except at , where it is undefined

Sojourn times in an SIS disease transmission model

An SIS with tweaked recovery

Traditional ODE models assume recovery from disease at per capita rate (often denoted )

Here, assume that, of the individuals who have become infective at time , a fraction remain infective at time

Thus, considered for , the function is a survival function

Reducing the dimension of the problem

We have

is constant (equal total population at time ), so we can deduce the value of , once we know , from the equation

Model for infectious individuals

Integral equation for the number of infective individuals:

  • number of individuals who were infective at time and still are at time
    • is nonnegative, nonincreasing, and such that
  • proportion of individuals who became infective at time and
    who still are at time
  • is with , from the reduction of dimension

Expression under the integral

Integral equation for the number of infective individuals:

The term

  • is the rate at which new infectives are created, at time ,
  • multiplying by gives the proportion of those who became infectives at time and who still are at time

Summing over gives the number of infective individuals at time

Case of an exponentially distributed time to recovery

Suppose that is such that the sojourn time in the infective state has an exponential distribution with mean , i.e.,

Then the initial condition function takes the form

with the number of infective individuals at time . This is obtained by considering the cohort of initially infectious individuals, giving a model such as

Equation becomes

Taking the time derivative of yields

which is the classical logistic type ordinary differential equation (ODE) for in an SIS model without vital dynamics (no birth or death)

Case of a step function survival function

Consider case where the time spent infected has survival function

i.e., the sojourn time in the infective state is a constant

In this case becomes

Here, it is more difficult to obtain an expression for . It is however assumed that vanishes for

When differentiated, gives, for

Since vanishes for , this gives the delay differential equation (DDE)

What we know this far

  • The time of sojourn in compartments plays an important role in determining the type of model that we deal with
  • All ODE compartmental models, when they use terms of the form , make the assumption that the time of sojourn in compartments is exponentially distributed with mean
  • At the other end of the spectrum, delay differential with discrete delay make the assumption of a constant sojourn time , equal for all individuals
  • Both can be true sometimes.. but reality is often somewhere in between

Survival function, , for an exponential distribution with mean 80 years

center

The problems with the exponential distribution

  • Survival drops quickly: in previous graph, 20% mortality of a cohort at age 20 years
  • Survival extends way past the mean: in previous graph, almost 25% survival to age 120 years
  • Acceptable if what matters is mean duration of sojourn over long time period
  • Less so if interested in short term dynamics
  • Exponential distribution with parameter has mean and variance , i.e., one parameter controls both the mean and dispersion

An COVID-19 model : "making Erlangs"

JA & Portet. A simple model for COVID-19. Infectious Disease Modelling 5:309-315 (2020)

Simple way to "fix" sojourn times: sums of exponential distributions

  • Exponential distribution of sojourn times is acceptable if what matters is mean duration of sojourn over long time period
  • For COVID-19, were trying to give "predictions" over 2-4 weeks period, so we need more than the mean

Use a property of exponential distributions, namely, that the sum of i.i.d. (independent and identically distributed) exponential distributions is Erlang distributed

Sum of exponential distributions

and independent exponential r.v. with rate parameters and . Then the p.d.f. of is the convolution

The Erlang distribution

P.d.f. of the Erlang distribution

shape parameter, rate parameter (sometimes use scale parameter )

So, if , has distribution

i.e., an Erlang distribution with shape parameter and rate parameter

Continuing..

, , be exponential i.i.d. random variables with parameter

Then Erlang distributed with rate parameter and shape parameter

Properties of the Erlang distribution

An Erlang is a Gamma with shape parameter . P.d.f. of the Erlang distribution

shape parameter, rate parameter

Mean , variance , c.d.f.

and thus survival function

center

What types of stochastic systems?

Chain binomial models

  • Reed-Frost model from circa 1920
    • Infection spread from I individuals to others after adequate contact
    • Any S individual, after contact with an I individual in a given period, develops the infection and is infectious to others only within the following time period; in subsequent time periods, become R
    • Each individual has a fixed probability of coming into adequate contact with any other specified individual in the group within one time interval, and this probability is the same for every member of the group
    • Closed population
  • Many variations since

Discrete time Markov chains (DTMC)

  • Will discuss later
  • Equivalent of discrete time system but includes stochasticity
  • Jump to next state depends only on current state(memoryless property)
  • Easy to study using linear algebra

Continuous time Markov chains (CTMC)

  • Almost exact stochastic equivalent to ODE
  • Conversion from ODE to CTMC and vice-versa is very easy for compartmental models
  • Harder to study than DTMC but still quite amenable to analysis

Branching processes

  • Special case of CTMC
  • Simplest type is Galton-Watson process: evolves following and

where are i.i.d. nonnegative integer-valued r.v.

  • Can also be continuous-time (not all "reproductions" happen at the same time)

Stochastic differential equations

  • Typically, something of the form

where is a Wiener process (a Brownian motion)