### Conditional probability

Definition of and application examples

#### Usage

A quick review before jumping into ML/AI and the use of Bayes Theorem in applications like scikit-learn or PyTorch

#### References

1. Introduction à la théorie de probability: Robert Dalang
2. Data Science par la pratique: Joel grus
3. Scipy stats: https://docs.scipy.org/doc/scipy/reference/stats.html#discrete-distributions
4. Pandas docs: https://pandas.pydata.org/pandas-docs/stable/
5. Numpy docs: https://docs.scipy.org/doc/numpy-1.13.0/reference/
6. Writing equations in the notebook: https://jupyter-notebook.readthedocs.io/en/stable/examples/Notebook/Typesetting%20Equations.html

In [1]:
import numpy as np
import pandas as pd
import matplotlib
import matplotlib.pyplot as plt
import scipy
import seaborn

### Conditional probability: definition

Given two events $a$ and $b$. The probality of a happening is $P(a)$ but what happens if $b$ happens. Does the value of $P(a)$ change given that $b$ happened? Or we could ask "What is $P(a)$ given $P(b)$ ?"

Preciseley if $P(b) > 0$ then $P(a|b) = \frac{P(a\cap b)}{P(b)}$

### What is the probability of $a$ given $b$ ?

### Back to the Urn problem (with a twist and repalcement)

An urn has 6 red balls and 5 black balls. We draw a ball. If the ball we draw is red, then we throw
out the ball we chose, and two aditional red balls are placed into the urn. The same for the blue
balls. With that said, on each draw, the number of balls in the urn grows.

In [2]:
# first draw:
# what is the probability that the first ball is red?
numRed = 6
numBlack = 5
Pred= numRed/(numBlack + numRed)
Pred

0.5454545454545454

Take a look at that using probability notation (where the subscript is the draw number) and r = redball and b = blackball:

First draw is red:

$\displaystyle P(R_1) = \frac{r}{r+b}$

Second draw is red:

$\displaystyle P(R_2) = \frac{r+1}{r + b + 1}$

So to figure that out we need to look at both $P(R_1)$ and $P(R_2)$ together:

$\displaystyle P(R_2|R_1) = \frac{P(R_2\cap R_1)}{P(R_1)}$









In [3]:
Rone = Pred
Rtwo = (numRed + 1)/(numBlack + numRed + 1)
PredTwo = (Rone * Rtwo)/Rone 
PredTwo

0.5833333333333334

#### Take a look at that using notation: 


$\displaystyle P(R_2|R_1) = \frac{P(R_2\cap R_1)}{P(R_1)} = \frac{P(R_2)P(R_1)}{P(R_1)} = P(R_2)$

### What if you don't you know what the first draw is ?

Then you need to apply the law of Total Probability. This allows us to account for the event that the first draw was a blackball.

If $\displaystyle P(B_1)$ equals the event that the first ball picked was black and $\displaystyle P(R_b)$ equals the event that the second ball peicked was red and the first ball was black then:

$ \displaystyle P(R_2) = P(R_2\cap R_1) + P(R_b)\cap B_1)$



In [4]:
# we already no what the first term is
# just figure out the second term

Pblack = numBlack/(numBlack + numRed)#<--- probability that the first ball was black
PblackRedTwo = numRed/(numBlack + numRed + 1)#<---- add one ball to the total number of balls
blackFirst = Pblack * PblackRedTwo#<---- its an intersection so multiply
redFirst = Pred * PredTwo 

# now add the two possible events:

redFirst + blackFirst


0.5454545454545454

### This is called a Martingale process, from wikipedia:

"The concept of martingale in probability theory was introduced by Paul Lévy in 1934, though he did not name it. The term "martingale" was introduced later by Ville (1939), who also extended the definition to continuous martingales. Much of the original development of the theory was done by Joseph Leo Doob among others. Part of the motivation for that work was to show the impossibility of successful betting strategies." 

https://en.wikipedia.org/wiki/Martingale_(probability_theory)