# 1. Conditional Probability

## 1.1 Example: Random Experiments
- A study to survey if `Get married` mean `Happiness`

| ID  | Married | Happy |
|-----|---------|-------|
| 1   | Yes     | No    |
| 2   | Yes     | Yes   |
| 3   | No      | Yes   |
| ... | ...     | ...   |

#### Solve

- We construct a new table


| Married \ Happy | Yes  | No   | **Total** |
|-----------------|------|------|-----------|
| Yes             | 42   | 28   | `70`      |
| No              | 6    | 24   | `30`      |
| **Total**       | `48` | `32` | `100`     |


- Random experiments
    + $\Omega$: the set of all participants (= 100)
    + Define multiple events
        + $H$: The chosen participant is happy
        + $M$: The chosen participant is married
    + Calculate the Probability of each event
        + $P(H) = \frac{48}{100} = 0.48$
        + $P(M) = \frac{70}{100} = 0.7$
    - Conditional Probability
        + Probability `that` the person is happy `provided that` this person is married: $P(H|M) = \frac{42}{70} = 0.6$
        + Probability `that` the person is happy `provided that` this person is not married: $P(H|\bar{M}) = \frac{6}{30} = 0.2$

## 1.2 Definitions
#### Events
- $\Omega$:  All outcomes that have **equal** prob
- Events: $A,B \subset \Omega$

#### Conditional Probability
- The conditional probability of A `provided that` B happened

$$\begin{split}
P(A|B)  &= \frac{A \cap B}{B} \\
        &= \frac{\frac{A \cap B}{\Omega}}{\frac{B}{\Omega}} = \frac{P(A\cap B)}{P(B)}
\end{split}$$

- **Notes**
    + $\Omega$:  All outcomes must have **equal** prob and **definite**
    + $P(B) \neq 0$

#### Corollary

$$P(A\cap B) = P(A|B)P(B)$$

<img src="assets/1.png" width="450"/>

## 1.3 Example
- A fair coin is tossed 4 times. `Provided that` at least 2 heads occurred within the 4 tossings, what's the probability that at least one tail also occurred

#### Solve
- Events
    + $A$: tossed a fair coin 4 times at least one tail occurred
    + $B$: tossed a fair coin 4 times at least 2 heads occurred
    + $A\cap B$: tossed a fair coin 4 times at least one tail `and` 2 heads occurred
- $P(A|B) = \frac{A\cap B}{B} = \frac{10}{11}$

In [1]:
from itertools import product

A_cap_B = 0
B = 0
for x1, x2, x3, x4 in product(['H', 'T'], repeat=4):
    # Experiment
    tails = 0
    heads = 0
    for i in [x1, x2, x3, x4]:
        if i == 'T': tails += 1
        else: heads += 1

    # Count
    if tails >= 1 and heads >= 2:
        A_cap_B += 1
    if heads >= 2:
        B += 1

print(A_cap_B, B)

10 11


# 2. Independent events
- 2 events are independent if:
    + we know that one event occurred, it gives us no new information involved that probability of the other event
- Examples:
    + A fair coin is tossed twice. 
        + `A`: There's at least one head
        + `B`: There's at least one tail
        + A and B are `not independent`
    + A fair coin is tossed twice
        + `A`: There's exactly one head 
        + `B`: There's exactly one tail
        + A and B are `not independent`
    + A fair coin is tossed twice
        + `A`: The first tossing yields head
        + `B`: The second tossing yields tail
        + A and B are `independent`

## 2.1 Definition
- Assume we have 2 events
    + Events $A$ that $P(A) > 0$
    + Event $B$ that $P(B) > 0$
    
- $A$ and $B$ are `independent` ($A \perp B$) iff ($\iff$) 1 of the following 3 conditions sastified
    + 1. $P(A|B) = P(A)$
    + 2. $P(B|A) = P(B)$
    + 3. $P(A\cap B) = P(A)P(B)$

#### Corollary: Intersection
+ If $A$ and $B$ are `independent`
    + P(A) > 0
    + P(B) > 0
+ Then **A and B intersect**. Prove: $P(A\cap B) = P(A)P(B) > 0$


## 2.2 Example 1: Testing Independence

| Like Cheese \ Has Dog | Yes  | No   | **Total** |
|-----------------------|------|------|-----------|
| Yes                   | 48   | 32   | `80`      |
| No                    | 12   | 8    | `20`      |
| **Total**             | `60` | `40` | `100`     |

- events
    + `C`: participants like cheese
    + `D`: participants has dog

#### Solve
- Probability
    + Probability `that` a person like cheese `provided that` he has dogs: $P(C|D) = \frac{48}{60} = 0.8$
    + Probability that a person like cheese: $P(C) = \frac{80}{100} = 0.8$
- As $P(C|D) = P(C) = 0.8$
    + If we know that some particular person has dog, it doesn't give us any new information about the preferences of this person regarding cheese.
    + we say that `C and D are independent events`

## 2.3 Example 2: Testing Independence
- Toss a dice and get number of points. Define a set of events
    + $A$: Number is even
    + $B$: Number $\geq$ 4
    + $C$: Number $\geq$ 5

#### Solve
+ Calc all probabilities
    + $P(A) =  \frac{3}{6} = 0.5$ (A = {2,4,6})
    + $P(B) = \frac{3}{6} = 0.5$ (B = {4,5,6})
    + $P(C) = \frac{2}{6} = 0.33$ (C = {5,6})
    + $P(A\cap B) = \frac{2}{6} = 0.33$ ($A\cap B$ = {4,6})
    + $P(A\cap C) = \frac{1}{6} = 0.17$ ($A\cap C$ = {6})
    + $P(B\cap C) = \frac{2}{6} = 0.33$ ($B\cap C$ = {5,6})
+ A and B are not independent
+ A and C are independent. Since  $P(A\cap C) = P(A)P(C)$
+ B and C are not independent

# 3. Mosaic Plot
- Visualization of conditional probabilities and Independence


## 3.1 Example 1: Dependent Events

| Married \ Happy | Yes  | No   | **Total** |
|-----------------|------|------|-----------|
| Yes             | 42   | 28   | `70`      |
| No              | 6    | 24   | `30`      |
| **Total**       | `48` | `32` | `100`     |


- Events
    + $M$: chosen person is married
    + $H$: chosen person is happy
- Probabilities:
    + $P(M) = \frac{70}{100} = 0.7$
    + $P(\bar{M}) = 1 - 0.7 = 0.3$
    + $P(H|M) = \frac{H \cap M}{M} = \frac{42}{70} = 0.6$
    + $P(H|\bar{M}) = \frac{H \cap \bar{M}}{\bar{M}} = \frac{6}{30} = 0.2$
- Mosaic Plot
    + The square has side = 1
    + We present in x: $P(M)$, $P(\bar{M})$
    + We present in y: $P(H|M)$, $P(H|\bar{M})$

<img src="assets/2.png" width="450"/>

- We can infer from Mosaic Plot
    + $P(H\cap M)$ = the area of $P(M)P{H|M} = 0.6*0.7 = 0.42$
    + $P(H\cap \bar{M})$ = the area of $P(\bar{M})P{H|\bar{M}} = 0.3*0.2 = 0.06$
    + $P(H)$ is the red area of $P(H\cap M) + P(H\cap \bar{M}) = 0.42+0.06 = 0.48$

## 3.2 Example 2: Independent Events

| Like Cheese \ Has Dog | Yes  | No   | **Total** |
|-----------------------|------|------|-----------|
| Yes                   | 48   | 32   | `80`      |
| No                    | 12   | 8    | `20`      |
| **Total**             | `60` | `40` | `100`     |


- Events
    + $C$: chosen person likes cheese
    + $D$: chosen person has dog

- Probabilities:
    + $P(D) = \frac{60}{100} = 0.6$
    + $P(\bar{D}) = 1 - 0.6 = 0.4$
    + $P(C|D) = \frac{C \cap D}{D} = \frac{48}{60} = 0.8$
    + $P(C|\bar{D}) = \frac{C \cap \bar{D}}{\bar{D}} = \frac{32}{40} = 0.8$

- Mosaic Plot

<img src="assets/3.png" width="450"/>


# 4. Using independence to find probabilities
- **Forward**: Have probabilities -> Check dependence
- **Other Approach**: Assume 2 events are independent -> Find probabilities

## 4.1 Example 1
- Peter has to go to school this morning. He defines 3 events
    + `A`: Alarm clock failed
        + P(A) = 0.1
    + `T`: Traffic Jam occured
        + P(T) = 0.3 
    + `C`: missed the class
        + P(C) = not know
- If Alarm clock failed and Traffic Jam occured, Peter misses the class: $C = A \cap T$. Find `P(C)`

#### Solve
- Since we aware and  firmly know that `A and T are independent`
    + $P(C) = P(A \cap T) = P(A)P(T) = 0.1*0.3 = 0.03$

## 4.2 Example 2: Independent Opinions
- A company has 2 experts to consult on an important project. Define Events
    + `E1`: Expert 1 is wrong
    + `E2`: Expert 2 is wrong
- Evaluate the probability that expert 1 and expert 2 are both wrong $P(E1 \cap E2 )$ in 2 situations
    + 1. 2 experts give independent opinions
    + 2. expert 2 just follows/copy the solution of expert 1

#### Solve
- Case 1: E1 and E2 independent
    + $P(E1 \cap E2) = P(E1)P(E2)$
    + Since $P(E1) < 1$ and $P(E2) < 1$ => $P(E1 \cap E2) < P(E1)$ 

- Case 2: E1 = E2 (completely dependent)
    + $P(E1 \cap E2) = P(E1 \cap E1) = P(E1)$
    + $P(E1 \cap E2)$ of case 2 is always larger than case 1
    + E2 not giving any new information and not contributing to $E1 \cap E2$
    

# 5. Pairwise and mutual independence
- Dependency involved 3 events
- Pairwise independence = subset of mutual independence

## 5.1 Example
- Toss a fair coin 3 times
- Total 8 outcomes: $\Omega = \{ HHH,HHT,HTH,HTT,THH,THT,TTH,TTT \}$ 
- Define Events
    + $R_j \in \{H, T \}$: The result of j-th tossing (H or T), j in range \[1,3\]
    + A: R1 = R2
    + B: R2 = R3
    + C: R1 = R3

#### Analyze
- Probabilities
    + $P(A) = \frac{\{HHT,HHH,TTT,TTH\}}{\Omega} = \frac{4}{8} = 0.5$
    + $P(B) = 0.5$
    + $P(C) = 0.5$
- Check `pairwise independence`
    + $P(A\cap B) = \frac{\{HHH, TTT\}}{\Omega} = \frac{2}{8} = 0.25$
        + $P(A\cap B) = P(A)P(B)$ => `A and B are independent`
    + Same for `A and C are independent`
    + Same for `B and C are independent`
- Check `mutual independence`
    + $P(A\cap B \cap C) = \frac{\{HHH, TTT\}}{\Omega} = \frac{2}{8} = 0.25$
        + We have $P(A)P(B)P(C) = 0.5^3 = 0.125$ => $P(A\cap B \cap C)  \neq P(A)P(B)P(C)$
- Conclusion
    + A,B,C are pairwise independence
    + A,B,C are not mutual independence

## 5.2 Definition
#### Mutual independence
- $A_1, A_2, A_3$ are mutually independence if **both** properties are hold
    + 1. **They are pairwise independent**: $P(A_i \cap A_j) = P(A_i) P(A_j)$, $\forall i,j \in [1, 3]$, $i \neq j$
    + 2. $P(A_1 \cap A_2 \cap A_3) = P(A_1)P(A_2)P(A_3)$

# 6. Bernoulli Scheme
- A probabilistic model that represents `a series of independent n trials`
- Each trial must produce 2 outcomes
    + `Success`: P(Success) = p
    + `Failure`: P(Failure) = 1 - p

#### Properties
- Outcome set: $size(\Omega) = 2^n$
- Each Success/Failure event at i-th trial are `mutually independent`

## 6.1 Example
- Toss an unfair coin `n` times
    + $H_i$: Get a head at i-th toss
        + $P(H_i) = p$
    + $T_i$: Get a tail at i-th toss
        + $P(T_i) = 1 - p$
- Calculate the probability of to get HHTHT in the first 5 trials: $P(HHTHT)$

#### Solve
- Since each trials are mutually independent:
    + $P(HHTHT) = P(H_1 \cap H_2 \cap T_3 \cap H_4 \cap T_5) = P(H_1)P(H_2)P(T_3)P(H_4)P(T_5) = p^3(1-p)^2$ 

## 6.2 Coin tossing model
- Toss an unfair coin `n` times
    + $H_i$: Get a head at i-th toss
        + $P(H_i) = p$
    + $T_i$: Get a tail at i-th toss
        + $P(T_i) = 1 - p$
- Calculate the probability of
    + Event O: has `m` heads and `n-m` tails after n trials

#### Solve
- $P(O) = p^m(1-p)^{n-m}$

# 7 Law of total probability
## 7.1 Example
- Experiments: Randomly pick a student
- Define events
    + $C$: The student knows Calculus
    + $H_1$: Student on 1st year of university
        + $P(H_1) = 0.5$
        + Probability the student knows Calculus provided that student is year 1: $P(C | H_1) = 0.1$
    + $H_2$: Student on 2nd year of university
        + $P(H_2) = 0.3$
        + $P(C | H_2) = 0.8$
    + $H_3$: Student on 3rd year of university
        + $P(H_3) = 0.2$
        + $P(C | H_3) = 0.7$
- Notes
    + $H_i \cap H_j = \emptyset$, $\forall i \neq j$, $i,j \in [1,3]$
    + $H_1 \cup H_2 \cup H_3 = \Omega$
        + Or $ \sum\limits_{i=1}^n P(H_i) = 1$

- Calculate $P(C)$

#### Solve
<img src="assets/4.png" width="450"/>


+ $P(C) = P(C \cap H_1) + P(C \cap H_2) + P(C \cap H_3) = P(C|H_1)P(H_1) + P(C|H_2)P(H_2) + P(C|H_3)P(H_3) = 0.1*0.5 + 0.8*0.3 + 0.7*0.2 = 0.43$ 


## 7.2 Law of total probability
- If A set of hypotheses: {H_1, H_2, \dots, H_n} hold 2 properties
    + 1. **Non overlap**: $H_i \cap H_j = \emptyset$, $\forall i \neq j$, $i,j \in [1,n]$
    + 2. $H_1 \cup H_2 \cup \dots \cup H_n = \Omega$
        + Or $ \sum\limits_{i=1}^n P(H_i) = 1$
- The we can recover P(C) as
    + $P(C) = \sum\limits_{i=1}^n \left[P(H_i)P(C|H_i) \right]$

# 8. Bayes's rule
- Law of total probability: $P(H_i), P(C|H_i)$ -> $P(C)$
- Bayes's rule: $P(H_i), P(C|H_i), P(C)$ -> $P(H_i|C)$
    +  inverse conditional probabilities

## 8.1 Example
- Experiments: Randomly pick a student
- Define events
    + $C$: The student knows Calculus
        + $P(C) = 0.43$
    + $H_1$: Student on 1st year of university
        + $P(H_1) = 0.5$
        + Probability the student knows Calculus provided that student is year 1: $P(C | H_1) = 0.1$
    + $H_2$: Student on 2nd year of university
        + $P(H_2) = 0.3$
        + $P(C | H_2) = 0.8$
    + $H_3$: Student on 3rd year of university
        + $P(H_3) = 0.2$
        + $P(C | H_3) = 0.7$
- Notes
    + $H_i \cap H_j = \emptyset$, $\forall i \neq j$, $i,j \in [1,3]$
    + $H_1 \cup H_2 \cup H_3 = \Omega$
        + Or $\sum\limits_{i=1}^n P(H_i) = 1$
- Calculate $P(H_1 | C)$
    + The probability if student is year 1 student provided that the student knows calculus

#### Solve
- $P(H_1 | C) = \frac{P(H_1 \cap C)}{P(C)} = \frac{P(C|H_1)P(H_1)}{P(C)} = \frac{0.1*0.5}{0.43} = 0.12$

## 8.2 Bayes Rules
- $P(H_i)$: Prior probability for hypothesis $H_i$
- $P(H_i | C)$: Posterior probability for hypothesis $H_i$
    + Meaning: If C occur giving new information about the probability of $H_i$

#### Formulas

$$\begin{split}
P(H_i | C)  &= \frac{P(H_i \cap C)}{P(C)} \\
            &= \frac{P(H_i)P(C|H_i)}{P(C)} \\
            &= \frac{P(H_i)P(C|H_i)}{\sum\limits_{i=1}^n \left[ P(H_i)P(C|H_i) \right]}
\end{split}$$

- Note: $ \sum\limits_{i=1}^n P(H_i) = 1$


## 8.3. Formulas
- Marginal Distribution: $P(A)$, $P(B)$
- Joint Distribution: $P(A \cap B)$
- Conditional distribution: $P(A|B)$, $P(B|A)$
- Let define
    + $\sum\limits_i A_i = \Omega_A$ or $\sum\limits_iP(A_i) = 1$
    + $\sum\limits_j B_j = \Omega_B$ or $\sum\limits_jP(B_j) = 1$

#### Joint distribution
$$P(A_i \cap B_j) = P(A_i)P(B_j|A_i) = P(B_j)P(A_i|B_j)$$ 

#### Marginal distribution
$$P(A_i) = \sum\limits_jP(A_i\cap B_j) = \sum\limits_j \left[P(B_j)P(A_i|B_j) \right],\ \text{with} \sum\limits_jP(B_j) = 1$$
$$P(B_j) = \sum\limits_iP(A_i\cap B_j) = \sum\limits_i \left[P(A_i)P(B_j|A_i) \right],\ \text{with} \sum\limits_iP(A_i) = 1$$

#### Conditional distribution

$$P(A_i|B_j) = \frac{P(A_i \cap B_j)}{P(B_j)} = \frac{P(A_i)P(B_j|A_i)}{ \sum\limits_i \left[P(A_i)P(B_j|A_i) \right]},\ \text{with} \sum\limits_iP(A_i) = 1$$

$$P(B_j|A_i) = \frac{P(A_i \cap B_j)}{P(A_i)} = \frac{P(B_j)P(A_i|B_j)}{\sum\limits_j \left[P(B_j)P(A_i|B_j) \right]},\ \text{with} \sum\limits_jP(B_j) = 1$$


## 8.4 Exercise 1
- There's a rare disease that occurs in 1% of the population. There's a test that yields positive result on 99% of ill people, and false positive on 2% of healthy people. The test shows positive result on a patient. What's the probability that the patient does not have the disease? 

#### Solve
- Define Events
    + I: Ill
        + $P(I) = 0.01$
    + H: Healthy
        + $P(I) + P(H) = 1$ -> $P(H) = 0.99$
    + P: Test positive
        + $P(P|I) = 0.99$: Test Positive if person is ill
        + $P(P|H) = 0.02$: Test Positive if person is healthy

- Solve

$$\begin{split}
P(H | P)  &= \frac{P(H)P(P|H)}{P(H)P(P|H) + P(I)P(P|I)} \\
            &= \frac{0.99 \times 0.02}{0.99 \times 0.02 + 0.01 \times 0.99} \\
            &= \frac{2}{3}
\end{split}$$

## 8.5 Exercise 2
- There are 2 taxi companies. Company A has 40% of German cars and company B has 10% of German cars. All other cars are Korean. Taxi aggregator assigns orders to A or B with probabilities 1/3 and 2/3 respectively. If German car arrives, what's the probability that it's from company B?

#### Solve
- Define Events
    + A: cars from Company A
        + $P(A) = \frac{1}{3}$
    + B: cars from Company B
        + $P(B) = \frac{2}{3}$
        + Note: $P(A) + P(B) = 1$
    + G: car is German
        + $P(G|A) = 0.4$: car is German if car from A
        + $P(G|B) = 0.1$: car is German if car from B

- Solve

$$\begin{split}
P(B | G)  &= \frac{P(B)P(G|B)}{P(B)P(G|B) + P(A)P(G|A)} \\
            &= \frac{\frac{2}{3} \times 0.1}{\frac{2}{3} \times 0.1 + \frac{1}{3} \times 0.4} \\
            &= \frac{1}{3}
\end{split}$$