(sec:GenFxnComp)=
# Composition of PGFs

In this section, we focus just on PGFs.  Given two PGFs, we would like to know how to interpret $\mu_1(\mu_2(x))$.  

We have learned the following facts for PGFs:
- If we take a linear combination of PGFs $\mu_1(x)$, $\mu_2(x)$, $\ldots$ where the coefficients themselves are probabilities summing to $1$:

  $$\psi(x) = \sum \pi_i \mu_i(x)$$
  then $\psi(x)$ is a PGF and the corresponding distribution can be thought of as choosing the distribution for $\mu_i$ with probability $\pi_i$ and then choosing $n$ from that distribution.

- If we multiply PGFs $\mu_1(x)$, $\mu_2(x)$, $\ldots$, $\mu_k(x)$ together:

  $$\psi(x) = \prod_{i=1}^k \mu_i(x)$$
  then $\psi(x)$ is a PGF and the corresponding distribution can be thought of as choosing one number from each of the $k$ distributions for $\mu_1(x), \ldots, \mu_k(x)$ and then adding the $k$ numbers together.  In particular, if all of the PGFs are the same, then we get a power of the PGF.



Now we consider two arbitrary PGFs 

\begin{align*}
\mu_1(x) &= \sum_{k=0}^\infty p_k x^k\\
\mu_2(x) &= \sum_{m=0}^\infty q_m x^m
\end{align*}
We want to know how we can interpret 

$$
\psi(x) = \mu_1(\mu_2(x))
$$
Let's look at an example before we do the general case.

```{prf:example} Cash machine PGF
:label: example-CashMachine
Let $\mu_1(x) = \sum_{k=0}^\infty p_k x^k$ be the PGF for the number of customers who come to a cash machine during a day and $\mu_2(x)= \sum_{m=0}^\infty q_m x^m$ be the PGF for the amount of money a random customer withdraws.  What would be the PGF for the total amount of money withdrawn from the cash machine during a day?

Let's first think through separate cases based on $k$, how many customers arrive.  We'll find the PGF assuming that $k$ is known.  Then we'll combine the PGFs for each possible $k$ into a complete PGF.   Given $k$, we know that the PGF of the amount withdrawn is $\xi_k(x)=[\mu_2(x)]^k$ by {prf:ref}`cor-PGFPower`. 

The PGF for the total amount withdrawn is then $\psi(x) = \sum_{k=0}^\infty p_k \xi_k(x)$ by {prf:ref}`thm-PGFLinComb`.  Substituting $[\mu_2(x)]^k$ for  $\xi_k(x)$ we get

$$
\psi(x) = \sum_{k=0}^\infty p_k [\mu_2(x)]^k = \mu_1(\mu_2(x))
$$
is the PGF for the total amount withdrawn.
```



Returning to the general case of $\psi(x) = \mu_1(\mu_2(x))$, we have

$$
\psi(x) = \sum_{k=0}^\infty p_k [\mu_2(x)]^k
$$

The $k$-th power of $\mu_2(x)$ is the PGF for sum of $k$ numbers drawn from the distribution of $\mu_2(x)$.  The linear combination of these corresponds to choosing the number $k$ from the distribution for $\mu_1(x)$ and then choosing $n$ to be the sum of $k$ numbers chosen from the distribution for $\mu_2(x)$.

This leads to

```{prf:theorem} Composition of PGFs
:label: theorem-PGFComp

Given PGFs $\mu_1(x)$ and $\mu_2(x)$, the composition $\psi(x) = \mu_1(\mu_2(x))$ is the PGF for the following process of choosing a number $n$:
- use the distribution with PGF $\mu_1(x)$ to choose a random integer $k$
- then choose $k$ numbers independently from the distribution with PGF $\mu_2(x)$ and add them together to get $n$.
```
That is, $\mu_1(\mu_2(x))$ is the PGF for a sum of independent random numbers from the $\mu_2(x)$ distribution where the number of terms in the sum is randomly chosen from the $\mu_1(x)$ distribution.

We illustrate this in the following example in which $\mu_1(x)=\mu_2(x)$:

```{prf:example} 
:label: example-InfectiousDice
Consider one person who rolls a single standard six-sided die with PGF $\mu(x) = (x+x^2+ \cdots + x^6)/6$.  Let $k$ be the resulting random number.  

Then $k$ individuals are chosen from the population, each of whom rolls a six-sided die.  Add their results together to get $n$.  The PGF for $n$ is $\psi(x)=\mu(\mu(x))$.

This is illustrated in figure {numref}`fig-InfectiousDice`.
```


```{figure} InfectiousDice.png
---
height: 450px
width: 600px
name: fig-InfectiousDice
---
If we roll one die to choose the random number $k$ and then sum the rolls of $k$ dice to get $n$, then the PGF of the final sum is $\mu(\mu(x))$, illustrating {prf:ref}`example-InfectiousDice`.
```


## Self-test
**Note to self: I've added problems, starting from Q3,  to this since the start of the subject.  I don't want to change the ordering during the course.  After the course, re-order the questions.**

1.  Consider the dice rolling in {prf:ref}`example-InfectiousDice`.  Set $X_0=1$.  For $g=1, 2, 3$, find $X_g$ by having $X_{g-1}$ individuals each roll a die and setting $X_g$ to be the sum.  

    1. Show that the PGF of $X_1$ is $\psi(x)$ where $\psi(x)$ is the PGF of a single die.
    2. Explain why the PGF of $X_2$ is $\psi(\psi(x))$.  [hint: {prf:ref}`example-InfectiousDice` should be helpful for this.]
    3. What is the PGF of $X_3$?

2. Consider a single infected individual introduced to a very large population.  Assume that the number of new infections caused by one infectious individual has PGF $\mu(x)$.  Explain why the number of individuals reached in two transmission steps has PGF $\mu(\mu(x))$.

3. Consider the process where a fair coin with numbers $1$ and $2$ is flipped.  Based on the result, roll a standard die.  What is the PGF of the outcome?  What if the coin is biased so that $p_1=p$?

4. Flip a fair coin with numbers $1$ and $2$ on its sides.  Let $k$ be the result.  Then roll $k$ standard dice and sum the result to get $m$.  Then flip $m$ coins and sum the result to get $n$.  
   1. What is the PGF $\mu_{\text{coin}}(x)$ of the coin?  
   2. What is the PGF $\mu_{\text{die}}(x)$ of a single die?  
   3. What is the PGF of $n$ in terms of $\mu_{\text{coin}}(x)$ and $\mu_{\text{die}}(x)$?  
   4. Your answer above probably has $\mu_{\text{coin}}$ appear twice.  Which appearance corresponds to the first flip?
