__Saint-Petersberg paradox__

$P(X=2^k) = \frac{1}{2^{k+1}}$

$\mathbb{E}X=\sum^{\infty}_{k=0}2^k \cdot \frac{1}{2^{k+1}} = \sum^{\infty}_{k=0}\frac{1}{2} = \infty$

__Geometric distribution__

Exp: toss a non-fair coin until first head

Let $X$ denote the number of tossings until first head (including head)

If it's a fair coin, as follow

| x      | 1   | 2   | 3   | $\dotsm$ | k |
|--------|-----|-----|-----|----------|---|
| p(X=x) | 1/2 | 1/4 | 1/8 |          | $\frac{1}{2^k}$ |

$\frac{1}{2}+\frac{1}{4}+\frac{1}{8}+ \dotsm + \frac{1}{2^k} = 1$

If it's non-fair

P(H) = p

$P(X=k) = (1-p)^{k-1}p$

$\mathbb{E}X= \sum^{\infty}_{k=1} k \cdot (1-p)^{k-1}p$  
$=p[\sum^{\infty}_{k=1} (1-p)^{k-1} + \sum^{\infty}_{k=2} (1-p)^{k-1} + \sum^{\infty}_{k=3} (1-p)^{k-1} + \dotsm]$  
$=p[(1/p) + (1-p)/p + (1-p)^2/p + \dotsm]$  
$=1+(1-p)+(1-p)^2$  
$=\frac{1}{p}$

$Var X = \frac{1-p}{p^2}$

__example__

Find Expectation of Geometric distribution with probability of success equal to 0.1. Enter the value:

$\mathbb{E}(X) = \frac{1}{0.1} = 10$

Find Variance of Geometric distribution with probability of success equal to 0.1. Enter the value:

$Var X = \frac{1-0.1}{0.1^2} = 90$

__example__

There are 4 black balls and 1 white ball in a box. The balls are taken out until the white ball occurs. Black balls return to the box. Find the probability that the experiment stops after 5 tries. Enter the exact value below (e.g., 13/28 or 0.12):

$p=0.2$

$P(X=5) = 0.8^4 \times 0.2 = 0.08192$

__Possion distribution__

$\lambda > 0$

$ P(X=k)=\frac{\lambda^k}{k!}e^{-\lambda} $

__Conditions for Use__

- An event can occur any number of times during a time period.
- Events occur independently. In other words, if an event occurs, it does not affect the probability of another event occurring in the same time period.
- The rate of occurrence is constant; that is, the rate does not change based on time.
- The probability of an event occurring is proportional to the length of the time period. For example, it should be twice as likely for an event to occur in a 2 hour time period than it is for an event to occur in a 1 hour period.

For example, the Poisson distribution is appropriate for modeling the number of phone calls an office would receive during the noon hour, if they know that they average 4 calls per hour during that time period.

- Although the average is 4 calls, they could theoretically get any number of calls during that time period.
- The events are effectively independent since there is no reason to expect a caller to affect the chances of another person calling.
- The occurrence rate may be assumed to be constant.
- It is reasonable to assume that (for example) the probability of getting a call in the first half hour is the same as the probability of getting a call in the final half hour.

__example__

Assume that number of calls that some call center receives during one minute is Poisson random variable with parameter $\lambda = 2$. Use Python to find probability that number of calls is larger than 5. Enter number with first 5 digits after the decimal point.

Hint: Use scipy.stats.poisson object.

In [None]:
from scipy.stats import poisson
import numpy as np

p = []
s = 0

for x in range(0,6):
    p.append(poisson.pmf(x, 2))
    s += poisson.pmf(x, 2)
print(p)
print(s, 1-s)

# [0.1353352832366127, 0.2706705664732254, 0.2706705664732254, 0.18044704431548356, 0.09022352215774178, 0.03608940886309672]
# 0.9834363915193856 0.016563608480614445

This problem continues previous one. Assume now that one operator can handle one call in one minute. If call is not handled, it's missed. How many operators should I hire to be sure that probability to miss a call during one minute is not larger than 0.05? Of course I want to minimize number of operators hired.

Hint: Denote number of calls by $X$ (it's a random variable) and number of operators hired by $q$ (it's integer number). If $X>q$, then call is lost. You need to find minimal value qq such that $P(X>q) \le 0.05$. You can re-state this question in terms of CDF values, then answer it by investigating of CDF function in Python. However, there is more efficient way to do it: `scipy.stats` random variables have `.ppf` method that calculates percent point function (also known as quantile function) that is inverse function for CDF. For any value $p$ it finds a minimal value qq such that $CDF(q) \ge p$.

In [None]:
print(poisson.ppf(1-0.05, 2))