# Probability enters

## Contents

* [Probability basics](#probability_basics) 
* [Probability axioms](#probability_axioms)
* [Random variables](#random_variables)
* [Probability mass and density functions](#probability_densities)
* [Ninja training 3](#njt3)

## Probability basics <a name="probability_basics"></a>

 Probability space or a probability triple consist of these three elements ${\displaystyle (\Omega ,{\mathcal {F}},\mathbb{P})}$


* $\Omega$ is the set of all possible outcomes.

In the dice rolling example $\Omega = \{1, 2, 3, 4, 5, 6\}$.

* $\mathcal{F} $ is the set of events event (subsets of out outcomes).

In the dice rolling example $$\mathcal{F} = \{ \emptyset, \{1 \}, ..., \{6\}, \{1,2\}, ...,
\{5,6\}, ..., \{1,2,3,4,5,6\} \}.$$

This is an advanced concept don't worry if you don't understand.

$\mathbb{P}$ is a probability measure (mearsure has a deep meaning, just think of it as a function).
$$\mathbb{P}: \mathcal{F} \rightarrow \mathbb{R} $$

In the dice rolling example $$\mathbb{P}(\emptyset)=0,$$ $$ \mathbb{P}(\{1\})=\frac{1}{6},$$ 
$$\mathbb{P}(\{1,2\})=\frac{1}{3},$$

## Probability axioms <a name="probability_axioms"></a>

1. $\mathbb{P}(A) ≥ 0$ for every event $A$ in $\mathcal{F}$.
2. $\mathbb{P}(\Omega)=1$
3. For a collections of pairwise disjoint events $A_{1}, A_{2}, \dots$,
    $$\mathbb{P}\left(\bigcup_{i=1}^{\infty} A_{i}\right)=\sum_{i=1}^{\infty} \mathbb{P}\left(A_{i}\right).$$
 

The third one looks pretty intense lets, talk about it.



So what does disjoint mean? All the events event $A_{1}, A_{2}, \dots$ have unique outcomes.

Visually,

In [2]:
# Don't worry about the code. Just look at the picture.
from IPython.display import Image
Image('../fig/disjoint.jpg')

<IPython.core.display.Image object>

The left hand side (LHS) of axiom 3 says the 

***probability of the union of those disjoint events***

is equal to

***the sum of the probabilities of each disjoint event***.



If it did not come to you, keep trying. Don't give up!!!

## Random variables <a name="random_variables"></a>


A random variable (RV) X is a function that maps an outcome $\omega$ to a numeric value. That is $X(\omega) \in \mathbb{R}.$

$\mathbb{P} (X(\omega) \in A)$ denotes the probability of the outcome such that X falls in the range A.

Note that it is $\mathbb{P} (X(\omega) \in A)$ is often written as $\mathbb{P} (X \in A)$.

That was pretty technical!!! Let's have an example.

Example:
Let $X$ = winnings on a $\$2$ bet on an even die roll.
* $X$ maps 1,3,5 to -2
* $X$ maps 2,4,6 to 2
* $\mathbb{P}(X=2) = \mathbb{P}(X=-2) = \frac{1}{2}$

## Probability mass and density functions <a name="probability_densities"></a>

There are two main types of random variables; Discrete and continuous.

Some discrete examples.
* The number you roll on a dice.
* The number of tinder matches you get in a week. 
* The of times you are late to work or school this year.

Some continuous examples
* The amount of time you have to wait in a queue.
* The number of centermeters you grow this year.
* The number of minutes the bus is late.

We describe a discrete RV with a probability mass function (pmf) $p(x)$ which is $P(X=x)$.

Let's work with the tinder example.

Assume that the maximum number of matches I get in a we is 3 the a possible pmf could be

$$ p(x) = \begin{cases}0.2 \quad \text{if} \quad x = 0\\
                    0.5 \quad \text{if} \quad x = 1\\
                    0.2\quad \text{if} \quad x = 2\\
                    0.1\quad \text{if} \quad x = 3\\
    \end{cases}$$



In English this means that the

* probability I get zero tinder matches is 0.2.
* probability I get one tinder matches is 0.5.
* probability I get two tinder matches is 0.2.
* probability I get three tinder matches is 0.1.

Note that all the probabilities add to 1. This is axiom two.

We describe a continuous RV with a probability density function (pdf) $f(x)$. Techniquely $f(x)$ is not $P(X=x)$ as the probablity of a continous RV taking a single value is zero.

Example:

Let $X$ be the amount of time in minutes you have to wait for coffee then a possible pdf for $X$ is 

$$f(x) = 2\exp(-2x), \quad \quad\quad x \geq 0 $$


## Ninja training 3 <a name="njt3"></a>

**Task 1:** Starting with the 3 probabiliity axioms prove the following.

If $A$ is an event then $$\mathbb{P}\left(A^{c}\right)=1-\mathbb{P}(A).$$ Then given this result we have that $\mathbb{P}(\emptyset)=0$

**Task 2:** Prove the following.
If $A$ and $B$ are events and $A \subseteq B$ ($A$ is a subset of $B$) then $\mathbb{P}(A) \leq \mathbb{P}(B)$.

**Task 3:** For any two events $A$ and $B$ we have 

$$\mathbb{P}(A \cup B)=\mathbb{P}(A)+\mathbb{P}(B)-\mathbb{P}(A \cap B)$$

**Task 4:**  This question only requires deep thought.

Let $X$ be RV distributed exponetially with an intensity $\lambda = 2$. In other words you it had this pdf.
$$f(x) = 2\exp(-2x), \quad \quad\quad x \geq 0 $$

Think about how one would calculate $\mathbb{P}(X\leq 2)$.

You may need to use Google.