In [3]:
# Slides for Probability and Statistics module, 2016-2017
# Matt Watkins, University of Lincoln

# Interlude
<br>
<div style="background-color:LightBlue; margin-left: 20px; margin-right: 20px; padding-bottom: 8px; padding-left: 
8px; padding-right: 8px; padding-top: 8px; border-radius: 25px;">
A die is rolled, and we imagine a sample space $S = \{1,2,3,4,5,6\}$.
<br><br>
For a fair die, each outcome can be argued to be equally likely. We can deduce the probability $p$ of the outcomes as

$$1=P(S)=P(\{1\})+P(\{2\})+⋯+P(\{6\})=6p$$

which implies that $p = \frac{1}{6}$, as you'd expect.
<br><br>
Define two events,
<li> $A$, "the roll is less than 5", so $A = \{1,2,3,4\} $</li>
<li> $B$, "the roll is more than 2", so $B = \{3,4,5,6\} $</li>
<br>
The number of elements in $A$ is $n(A)=n(b)=4$. 
<br>
So $P(A) = n(A)*p = 4*1/6$ and $P(B) = n(B)*p = 4*1/6$
<br><br>
What is the probability of $A \cup B$. In words what is the probability that either "the dice roll is less than 5" or "the die roll is more than 2"?
</div>

The answer is not $n(A) + n(B))$ because the two sets have an intersection

$A \cap B = \{1,2,3,4\} \cap  \{3,4,5,6\} = \{3,4\}$

so our third axiom _does not_ apply.

One of the basic rules of sets is we don't need to keep duplicate members - so

$A \cup B = \{1,2,3,4\} \cup  \{3,4,5,6\} = \{1,2,3,4,3,4,5,6\} = \{1,2,3,4,5,6\} = S $

We see in this case that

$$
n(A \cup B) = n(A) + n(B) - n(A \cap B)
$$

which for probabilities proportional to number of elements also gives

$$
P(A \cup B) = P(A) + P(B) - P(A \cap B)
$$


### Extra useful relationships and definitions:
<br>

<div style="background-color:Gold; margin-left: 20px; margin-right: 20px; padding-bottom: 8px; padding-left: 
8px; padding-right: 8px; padding-top: 8px; border-radius: 25px;">
**Definition** Complement of an event $E$ (with respect to the sample set $S$) is $P(\bar{E}) = 1 - P(E)$.
<br><br>

**Definition:** $E_1$ and $E_2$ are mutually exclusive if $P(E_1 \cap E_2) = P\{\emptyset\} = 0$
<br><br>

**Definition** A set of events $E_1, E_2, \ldots, E_n$ of some experiment are said to be *exhaustive* if $E_1 \cup E_2 \cup E_3 \cup \ldots \cup E_n = S$.
<br><br>

If $E_1$ and $E_2$ are events of the same experiment $P(E_1 \cup E_2) = P(E_1) + P(E_2) - P(E_1 \cap E_2)$
</div>



### Principles of counting - combinatorials and permutations

Suppose two experiments are carried out 

- experiment 1 can result in any of $m$ outcomes 
- for each outcome of experiment 1 experiment 2 can have $n$ outcomes 
- together there are $mn$ outcomes. 

We can write all possible outcomes as an ordered pair, $(m,n)$.  

The $mn$ possible outcomes can be tabulated

| | | | |
|-|-|-|-|
|(1,1)|(1,2) |$\cdots$|(1,n)|
|(2,1)|(2,2) |$\cdots$|(2,n)|
|$\vdots$|$\vdots$|$\vdots$|$\vdots$|
|(m,1)| (m,2) |$\cdots$|(m,n)|

**Example:** 2 balls are randomly drawn from a bowl that contains 6 black and 5 white balls - what is the chance that we draw one black and one white ball?

We want to apply
$$P(E)=\frac{\text{Number of outcomes in } E}{N}$$

First calculate $N$ the total number of possible outcomes:

- experiment 1, the first ball can be picked in 11 different ways $m = 11$. 
- experiment 2, for each of the results of experiment 1, we can pick the next ball in one of 10 ways $n$ = 10.
- so total number of outcomes possible is $mn$ = 110

In set builder notation

$B = \{(x_1,x_2):x_1=1,2...11,x_2=1,2...11,x_2 \neq x_1\}$

and so if we were to list all the ordered pairs in a table they would be $11*10$

$N = n(B) = |B| = 11*10 = 110$

Now we have the denominator we need to calculate the number of ways of getting one black and one white ball. Either

- the first ball is black, which can occur in 6 ways (m), for each way there are 5 ways the second ball can be white ($n$)
- the first ball is white (5 ways,$m$) and the second ball is black (6 ways, $n$)

Either of the two ways (or, union of the two events) leads to an event with one black and one white ball so by axiom 3 total number of ways to get one black and one white ball is $5*6 + 6*5 = 60$.



In set builder notation, we number the balls so that 1 to 6 are white and 7 to 11 are black. The first ball is $x_1$ and second $x_2$

$T = \{(x_1,x_2):x_1 = 1,2,3,4,5,6, x_2 = 7,8,9,10,11\} \cup \{(x_1,x_2):x_1 = 7,8,9,10,11, x_2 = 1,2,3,4,5,6\}$

Therefore

$$P(E)=\frac{\text{Number of outcomes in } E}{N} = \frac{5∗6+6∗5}{110}=\frac{6}{11}$$


*If there are more than two experiments to be performed then the above rule generalises to $n_1⋅n_2 \cdot \cdots \cdot  n_r$ possible outcomes of r experiments.*

So continuing our previous example, how many ways are there to select all the balls from the bowl?

In this case there are $11⋅10⋅9 \cdots 3⋅2⋅1  = 11!$

this is known as the factorial function.

Lets calculate this just to see how big these numbers get!


In [10]:
total = 1
print("{:>10} {:>10}".format("i","total"))

for i in range(1,12):
    total = total * i 
    print("{:10d} {:10d}".format(i,total))

         i      total
         1          1
         2          2
         3          6
         4         24
         5        120
         6        720
         7       5040
         8      40320
         9     362880
        10    3628800
        11   39916800



#### Permutations
<br>
<div style="background-color:Gold; margin-left: 20px; margin-right: 20px; padding-bottom: 8px; padding-left: 
8px; padding-right: 8px; padding-top: 8px; border-radius: 25px;">
**Fancy Definition:** A permutation is a one-to-one mapping of a set onto itself whilst changing the ordering of the elements. For instance if $A=\{a,b,c\}$ a possible permutation would be

$$
σ=\left( 
\begin{array}{ccc}
a & b & c \\ 
c & b & a
\end{array}
\right)
.$$
Where the permutation sends $a$ to $b$, $b$ to $c$, and $c$ to $a$. 


</div>

<br>

<div style="background-color:Gold; margin-left: 20px; margin-right: 20px; padding-bottom: 8px; padding-left: 
8px; padding-right: 8px; padding-top: 8px; border-radius: 25px;">
**Definition:** A full set of permutations is all the ways of arranging some distinguishable objects. Each permutation swaps between two of these ways of arranging them.
</div>

The number of permutations of $n$ objects can be found from the counting rules to be $n!$, in much the same way as for the ball selection example above.


#Example

Find all permutations of $A = \{1,2,3\}$

$B = \{(1,2,3),(1,3,2),(2,1,3),(2,3,1),(3,1,2),(3,2,1)\}$

and

$n(B) = 6!$

#### Combinations

<div style="background-color:Gold; margin-left: 20px; margin-right: 20px; padding-bottom: 8px; padding-left: 
8px; padding-right: 8px; padding-top: 8px; border-radius: 25px;">
**Definition:** We define the number of combinations of $r$ objects taken from a set of size $n$, ${n \choose r}$, for $r≤n$, by
$$
{n \choose r}=\frac{n!}{(n−r)!r!}
$$
</div>

${n \choose r}$ is the number of combinations of $n$ objects taken $r$ at a time, also referred to as the binomial coefficient. 

Remember 

<div style="background-color:LightGreen; margin-left: 20px; margin-right: 20px; padding-bottom: 8px; padding-left: 
8px; padding-right: 8px; padding-top: 8px; border-radius: 25px;">
Binomial theorem

$$
(x+y)^n = \Sigma_{i=0}^{n} {n \choose i} x^iy^{n-i}
$$
</div>

We'll encounter it again several times in the binomial and poisson distributions.


#### Example:

We need to select student representatives for the school - a commitee of 5 people is selected from 8 physicists and 30 mathematicians -

a) If we decide we need to have 2 physicists and 3 mathematicians on the commitee, how many possible sets of representatives are there?

we have two 'experiments' here 

- first we choose 2 physicists out of 8 - i.e. ${8 \choose 2}$ = 28
- secondly for each choice of the physicists we can select 3 mathematicians out of 30 - i.e. ${30 \choose 3}$ = 4060
- So in total 28*4060 = 113680 possible commitees of this composition...!



b) what is the probability of getting a commitee with 4 mathematicians and 1 physicist?



- The total number of ways of selecting the commitee is ${38 \choose 5}$ = 501942 
- the number of ways of getting 4 mathematicians and 1 physicist is basically the same as part a) above ${30 \choose 4}⋅{8 \choose 1}$

- So the probability is ${30 \choose 4}⋅{8 \choose 1} / {38 \choose 5} =0.44$.

In [1]:
# define some useful functions
import numpy as np

def factorial(n):
    '''
    calculate n factorial
    '''
    total = 0
    for i in range(0,n+1):
        total = total * i 
        if i == 0 : # 0! is conventionally taken to be 1
            total = 1
    return total

def n_choose_r(n,r):
    '''
    calculate binomial coefficient
    '''
    nr = factorial(n)/(factorial(n-r)*factorial(r))
    return int(nr)

In [2]:
# getting numbers for the example above
print(n_choose_r(8,2))
print(n_choose_r(30,3))
print(28*4060)
print(n_choose_r(38,5))
print("{:<8.3}".format(float(n_choose_r(30,4))*n_choose_r(8,1)/n_choose_r(38,5)))

28
4060
113680
501942
0.437   


## Infinite, and continous probability spaces

everything up to this point deals with probability spaces with a finite sample space. Here we just mention two other cases - when the sample space discrete points, but an infinite number of them (all integers for instance) - or the very common case when the sample space can take on a continous range of values.

### Continous probability spaces

Consider a spinner - schematically a circle _of unit circumference_ and a pointer

![](../Images/spinner.jpg)

this could end up being a model for a [Roulette wheel](https://en.wikipedia.org/wiki/Roulette), for instance. If we give the spinner a whirl, the pointer will be pointing somewhere a distance $x$ along the circumference. It seems reasonable that every value $0 \leq x \lt 1$ of the distance between the pointer and the mark on the spinner is equally likely to occur. This means that the sample space is the interval  $S = [0,1)$. We want a probability model where every value of the sample space is equally likely (we'll call the result of a spin $X$ for now, later we'll see that this is a _continuous random variable_).

In a similar way to before we must have

$$
P\left( 0\leq X \lt 1 \right) = 1.
$$

It is also the case that we expect the probability of a reading in the top half of the spinner is equal in likelihood to one in the lower half,

$$
P\left( 0\leq X \lt \frac{1}{2} \right) = P\left( \frac{1}{2} \leq X \lt 1 \right) = \frac{1}{2}.
$$

More generally, if we consider an event, $E = [a,b] $, we'd like

$$
P\left( a\leq X \lt b \right) = b - a
$$


for every $a$ and $b$.

We can satisfy

$$
P\left( a\leq X \lt b \right) = b - a
$$

for every $a$ and $b$ for the event $E = [a,b]$ by a formula of the form

$$
P(E) = \int_{E} f(x) \mathrm{d}x,
$$

and $f(x)$ is the constant function with value 1. 

We call $f(x)$ the _density function_ of $X$. 

This is the generalisation of the discrete case we saw earlier:

$$
P(E) = \sum_{i \in E} P(i).
$$

# Summary

- Sample space, outcomes, events and probability distribution.
- Axiomatic definition of probability
- Set operations for manipulating probabilities
- Counting rules
- Combinatorials