9

Branching processes

Summary The branching process is a fundamental model for the random growth of populations. The method of generating functions, and in particular the random sum formula, provides the key to the study of this process. The criterion for the ultimate extinction of a branching process is stated and proved.

9.1 Random processes

Until now, we have been developing the basic terminology and results of probability theory, next, we turn our attention to simple applications. The passing of time plays an essential part in the world which we inhabit, and consequently many applications of probability involve quantities which develop randomly as time passes. Such randomly evolving processes are called random processes or stochastic processes, and there are many different types of these. Most real processes in nature, such as the pollen count in Phoenix or the position of Swansea City in the football league, develop according to rules which are too complicated to describe exactly, and good probabilistic models for these processes can be very complicated indeed. We shall stick to some of the simplest random processes, and the specific processes which we shall consider in some depth are

(a)  branching processes: modelling the growth of a self-reproducing population (such as mankind),

(b)  random walks: modelling the movement of a particle which moves erratically within a medium (a dust particle in the atmosphere, say),

(c)  Poisson processes and related processes: modelling processes such as the emission of radioactive particles from a slowly decaying source, or the length of the queue at the supermarket cash register.

There is a fairly complete theory of each of these three types of process, of which the main features are described in Chapters 911, respectively. In contrast, the general theory of stochastic processes is much more challenging and is outside the range of this book. At one extreme, probabilists study ‘concrete’ processes such as those above, often designed to meet the needs of a particular application area, and at the other extreme there is an abstract theory of ‘general’ stochastic processes. Any tension between these two extremes is resolved through the identification of key properties which are shared by large families of processes and yet are sufficiently specific to allow the development of a useful theory. Probably the most important such property is the so-called ‘Markov property’. We do not discuss this here, but refer the reader to Chapter 12 for an account of Markov processes in discrete time.

9.2 A model for population growth

We define the term nomad to be a type of hypothetical object which is able to reproduce itself according to the following rules. At time image, there exists a single nomad. This nomad lives for a unit of time and then, at time image, it dies in the act of childbirth and is replaced by a family of offspring nomads. These nomads have similar biographies, each surviving only until time image and then each dying and being replaced by a family of offspring. This death–birth process continues at all subsequent time points image. If we know the sizes of all the individual nomad families, then we know everything about the development of the nomad population, and we might represent this in the usual way as a family tree (see Figure 9.1). The problem is that different nomads may have different numbers of offspring, and these numbers may not be entirely predictable in advance. We shall assume here that the family sizes are random variables which satisfy the following two conditions:

I. the family sizes are independent random variables each taking values in image,

II. the family sizes are identically distributed random variables with known mass function p, so that the number C of children of a typical nomad has mass function image for image.

Such a process is called a branching process and may be used as a simple model for bacterial growth or the spread of a family name (to give but two examples). Having established the model, the basic problem is to say something about how the development of the process depends on the family-size mass function p. In order to avoid trivialities, we shall suppose throughout this chapter that

image

(9.1)

Fig. 9.1 A typical nomad family tree, with generation sizes 1, 3, 1, 4, 7, image.

We introduce some notation. The set of nomads born at time n is called the nth generation of the branching process, and we write Zn for the number of such nomads. The evolution of the process is described by the sequence image of random variables, and it is with this sequence that we work. Specific properties of the Zn are given in the next section, and we close this section with a list of interesting questions.

(a)  What is the mean and variance of Zn?

(b)  What is the mass function of Zn?

(c)  What is the probability that nomadkind is extinct by time n?

(d)  What is the probability that nomadkind ultimately becomes extinct?

9.3 The generating-function method

The first step in the study of this branching process is to explain how to find the distributions of the Zi in terms of the family-size mass function p. Clearly, image and

image

(9.2)

since Z1 is the number of children of the founding nomad. It is not easy to give the mass function of Z2 directly, since Z2 is the sum of a random number Z1 of random family sizes: writing Ci for the number of children of the ith nomad in the first generation, we have that

image

that is, Z2 is the sum of the family sizes of the Z1 nomads in the first generation. More generally, for image

image

(9.3)

where image are the numbers of children of the nomads in the imageth generation. The sum of a random number of random variables is treated better by using probability generating functions than by using mass functions. We write

image

for the probability generating function of Zn, and

image

for the probability generating function of a typical family size. We wish to express Gn in terms of G, and we do this in the following theorem.

Theorem 9.4

The probability generating functions G, G0, image satisfy

image

(9.5)

and hence Gn is the nth iterate of G,

image

(9.6)

Proof

We have image, and so image. Equation (9.3) expresses Zn as the sum of image independent random variables, each having probability generating function G, and so the random sum formula, Theorem 4.36, may be applied with image and image to deduce that

image

(9.7)

By iteration,

image

where image by (9.2).

Theorem 9.4 contains the information necessary for studying the development of the process. The next result is an immediate corollary.

Theorem 9.8

The mean value of Zn is

image

(9.9)

where image is the mean of the family-size distribution.

Proof

By the theory of probability generating functions,

image

Therefore,

image

The variance of Zn may be derived similarly in terms of the mean μ and the variance image of the family-size distribution. See Exercise 9.12.

It follows by Theorem 9.8 that

image

indicating that the behaviour of the process depends substantially on which of the three cases image, image, image holds. We shall see this in more detail in the next two sections, where it is shown that if image, the nomad population is bound to become extinct, whereas if image, there is a strictly positive probability that the line of descent of nomads will continue forever. This dependence on the mean family-size μ is quite natural since ‘image’ means that each nomad gives birth to (on average) strictly fewer nomads than are necessary to fill the gap caused by its death, whereas ‘image’ means that each death results (on average) in an increase in the population. The case when image is called critical since then the mean population-size equals 1 for all time; in this case, random fluctuations ensure that the population size will take the value 0 sooner or later, and henceforth nomadkind will be extinct.

  

Exercise 9.10

Show that, in the above branching process,

image

for any image. This may be proved either directly from the conclusion of Theorem 9.4 or by adapting the method of proof of (9.7).

Exercise 9.11

Suppose that each family size of a branching process contains either one member only (with probability p) or is empty (with probability image). Find the probability that the process becomes extinct at or before the nth generation.

Exercise 9.12

Let μ and image be the mean and variance of the family-size distribution. Adapt the proof of Theorem 9.8 to show that the variance of Zn, the size of the nth generation of the branching process, is given by

image

  

9.4 An example

The key to the analysis of branching processes is the functional equation

image

(9.13)

relating the probability generating functions of Zn and image and derived in Theorem 9.4. There are a few instances in which this equation may be solved in closed form, and we consider one of these cases here. Specifically, we suppose that the mass function of each family size is given by

image

so that each family size is one member smaller than a geometrically distributed random variable with parameter image (remember (2.16)) and has probability generating function

image

We proceed as follows in order to solve (9.13). First, if image,

image

Secondly, we apply (9.13) with image to find that

image

The next step gives

image

It is natural to guess that

image

(9.14)

for any image, and this is proved easily from (9.13), by the method of induction.

The mass function of Zn follows by expanding the right-hand side of (9.14) as a power series in s, to find that the coefficient of image is

image

(9.15)

In particular,

image

so that this branching process becomes extinct sooner or later, with probability 1.

There is a more general case of greater interest. Suppose that the mass function of each family size is given by

image

where image. The previous example is the case when image, but we suppose here that image so that image. In this case,

image

and the solution to (9.13) is

image

(9.16)

valid for image and image; again, this can be proved from (9.13) by induction on n. The mass function of Zn is rather more complicated than (9.15) but may be expressed in very much the same way. The probability of extinction is found to be

image

where image is the mean family-size. Hence

image

giving that ultimate extinction is certain if image and less than certain if image. Combined with the result when image and image, this shows that ultimate extinction is certain if and only if image. We shall see in the next section that this is a special case of a general result.

  

Exercise 9.17

Find the mean and variance of Zn when the family-size distribution is given by image for image and image. Deduce that image if and only if image.

  

9.5 The probability of extinction

In the previous example, ultimate extinction of the branching process is certain if and only if the mean family-size μ satisfies image. This conclusion is valid for all branching processes (except for the trivial branching process in which every family size equals 1 always), and we shall prove this. First, we define the extinction probability

image

Next, we show how to find e. Let image be the event that the branching process is extinct (in that nomadkind has died out) by the nth generation, and let image. Now,

image

If image, then necessarily image, so that image, and in particular image. Since the sequence image of events is increasing, we may use the continuity of probability measures, Theorem 1.54, to obtain that

image

(9.18)

How do we calculate e in practice? Clearly, if image, then image, since all families are non-empty. The next theorem deals with the general case.

Theorem 9.19 (Extinction probability theorem)

The probability e of ultimate extinction is the smallest non-negative root of the equation

image

(9.20)

Proof

Since image, we have by (4.9) that image. By (9.5) and (9.6),

image

Set image to find that image satisfies

image

(9.21)

with the boundary condition image. Now take the limit as image. By (9.18), image. Furthermore, G is a power series with radius of convergence at least 1, giving that G is continuous on image. It follows that e is a root of the equation image, as required.

In order to show that e is the smallest non-negative root of (9.20), suppose that image is any non-negative root of (9.20); we shall show that image. First, G is non-decreasing on image since it has non-negative coefficients, and hence

image

by (9.21). Similarly,

image

by (9.21), giving by induction that

image

Hence, image.

The last theorem explains how to find the probability of ultimate extinction, the next tells us when extinction is bound to occur.

Theorem 9.22 (Extinction/survival theorem)

Assume that image. The probability e of ultimate extinction satisfies image if and only if the mean family-size μ satisfies image.

We have eliminated the special case with image, since in this trivial case all families have size 1, so that image and image.

Proof

We may suppose that image, since otherwise image and image. We have seen that e is the smallest non-negative root of the equation image. On the interval image, G is

(a)  continuous, since its radius of convergence is at least 1,

(b)  non-decreasing, since image,

(c)  convex, since image,

and so a sketch of G looks something like the curves in Figure 9.2. Generally speaking, there are either one or two intersections between the curve image and the line image in the interval image. If the derivative image satisfies image, the geometry of Figure 9.2(a) indicates that there are two distinct intersections (and, in particular image). On the other hand, if image, Figure 9.2(b) indicates that there is a unique such intersection, and in this case image is the unique root in image of the equation image. However, image, and in summary image if and only if image.

Fig. 9.2 The curve image and the line image, in the two cases (a) image and (b) image.

  

Exercise 9.23

If the family-size distribution of a branching process has mass function image for image and image, use Theorem 9.19 to show that the probability that the process becomes extinct ultimately is image if image.

Exercise 9.24

If each family size of a branching process has the binomial distribution with parameters 2 and p image, show that the probability of ultimate extinction is

image

  

9.6 Problems

1.  Let image be independent random variables, each with mean μ and variance image, and let N be a random variable which takes values in the positive integers image and which is independent of the Xi. Show that the sum

image

has variance given by

image

If image are the generation sizes of a branching process in which each family size has mean μ and variance image, use the above fact to show that

image

Deduce an expression for image in terms of μ, image, and n.

2.  Use the result of Problem 9.6.1 to show that, if image is a branching process whose family sizes have mean μ image and variance image, then image as image.

3.  By using the partition theorem and conditioning on the value of Zm, show that if image is a branching process with mean family-size μ, then

image

4.  If image is a branching process in which image and the size of the rth generation Zr has the generating function image, prove that

image

Suppose that the process is modified so that the initial generation Z0 is Poisson with parameter λ, and the number of offspring of each individual is also Poisson with parameter λ. Find a function f such that if image is the generating function of the total number of individuals image, then

image

(Oxford 1977F)

5.  A branching process image has image. Let the total number of individuals in the first n generations of the process be Zn, with probability generating function Qn. Prove that, for image,

image

where P1 is the probability generating function of the family-size distribution. (Oxford 1974F)

6     (a)  Explain what is meant by the term ‘branching process’.

(b)  Let Xn be the size of the nth generation of a branching process in which each family size has probability generating function G, and assume that image. Show that the probability generating function Gn of Xn satisfies image for image.

(c)  Show that image is the probability generating function of a non-negative integer-valued random variable when image, and find Gn explicitly when G is thus given.

(d)  Find the probability that image, and show that it converges as image to image. Explain why this implies that the probability of ultimate extinction equals image.

(Cambridge 2001)