# 1. Linear transformations of random variables
- We study a system of random variables where 2 random variables X and Y can interact with each other

## 1.1 System of random variables
- X,Y must have the same set of outcomes
    - $\Omega$: set of outcomes
    - $P(\omega)$: probability of outcome $\omega \in \Omega$
    - Random variable $X: \Omega \to R$
    - Random variable $Y: \Omega \to R$
- We define $(X, Y)$: the system of random variables

## 1.2 Transformations of random variables
- Let define X ass

| **P** | 0.2 | 0.5 | 0.3 |
|------------|-----|-----|-----|
| **X**      | -1  | 3   | 4   |


- X can transform to Y by

#### Y = X + c, which `c = constant`
- Example `c = 2`

| **P**    | 0.2 | 0.5 | 0.3 |
|---------------|-----|-----|-----|
| **X**         | -1  | 3   | 4   |
| **Y = X + 2** | 1   | 5   | 6   |

#### Y = cX, which `c = constant`
- Example `c = 2`

| **P**    | 0.2 | 0.5 | 0.3 |
|---------------|-----|-----|-----|
| **X**         | -1  | 3   | 4   |
| **Y = 2X** | -2   | 6  | 8   |

#### Combine $Y = c_1X + c_2$, which `c1,c2 = constant`
- Example `c = 2`

| **P**    | 0.2 | 0.5 | 0.3 |
|---------------|-----|-----|-----|
| **X**         | -1  | 3   | 4   |
| **Y = 2X+1** | -1   | 7  | 9   |

- **Notes**: Only values are shifted, p does not changed

# 2. Symmetric distributions

- A symmetric distribution has the following PMF

<img src="assets/8.png" width="450"/>

#### Properties
- $E(X) = x_0$
- $E(X- x_0) = 0$


# 3. Functions of random variables
- We define a function `f` apply to random variable `X` to become `Z`
    + $Z = f(X)$
- X,Y,Z must have the same set of outcomes
    - $\Omega$: set of outcomes
    - $P(\omega)$: probability of outcome $\omega \in \Omega$
    - Random variable $X: \Omega \to R$
    - Random variable $Z: \Omega \to R$
- Properties 
    + Only Z values are shifted, p does not changed

## 3.1 PMF

| **P(X=x)** | $p_1$    | $p_2$    | ... | $p_n$    |
|------------|----------|----------|-----|----------|
| **X**      | $x_1$    | $x_2$    | ... | $x_n$    |
| $Z = f(X)$ | $f(x_1)$ | $f(x_2)$ | ... | $f(x_n)$ |

## 3.2 Expected value
- $E(Z) = E[f(X)]$

- Let $c, c_1, c_2$ = constant; X,Y,Z = random variables
    - $E(Z = c) = c$
    - $E(Z = X + c) = E(X) + c$
    - $E(Z = cX) = cE(X)$
    - $E(Z = c_1X + c_2) = c_1E(X) + c_2$
    - $E(Z = X + Y) = E(X) + E(Y)$
    - $E(Z = XY) = \sum\limits_i^m \sum\limits_j^n x_i y_j P(X=x_i \cap Y=y_j)$
    - $E(Z = X*Y) = E(X)E(Y)$, Only If `X and Y are independent`, See `6. Independent random variables`
    - $E[Z = E(X)] = E(X)$
    - $E[Z = XE(Y)] = E(X)E(Y)$
    
## 3.3 Variance
- Let $c, c_1, c_2$ = constant; X,Y,Z = random variables
    - $Var(Z = c) = 0$
    - $Var(Z = X + c) = Var(X)$
    - $Var(Z=cX) = c^2 Var(X)$
    - $Var(Z=c_1X + c_2) = c_1^2 Var(X)$
    - $Var(Z = X + Y) = Var(X) + Var(Y) + 2 cov(X,Y)$, See `7. Covariance`
    - $Var(Z = X + Y) = Var(X) + Var(Y)$, Only If `X and Y are independent`




# 4. Joint probability distribution
## 4.1 Example
- Toss a fair coin 2 times
- Define a random variable X

$$X = \begin{cases}
1, \text{ if 1st tossing is head} \\
0, \text{ otherwise}
\end{cases}$$

- Define random variable Y: $Y = 1 - X$
- Define random variable Z:

$$Z = \begin{cases}
1, \text{ if 2nd tossing is head} \\
0, \text{ otherwise}
\end{cases}$$

#### PMFs
- X

| $\omega$   | {HH,HT} | {TH,TT} |
|------------|---------|---------|
| **P(X=x)** | 0.5     | 0.5     |
| **X**      | 1       | 0       |

- Y

| $\omega$   | {TH,TT} | {HH,HT} |
|------------|---------|---------|
| **P(Y=y)** | 0.5     | 0.5     |
| **Y**      | 1       | 0       |

- Z

| $\omega$   | {HH,TH} | {TT,HT} |
|------------|---------|---------|
| **P(Z=z)** | 0.5     | 0.5     |
| **Z**      | 1       | 0       |

#### Analyze
- X,Y,Z have the same PMF but
- If we check
    + $P(X=0 \cap Y=0) =0$
    + $P(X=0 \cap Z=0) = P({TT}) = 0.25$
- We need a new **definition** beside PMF $\to$ `Joint probability distribution`

## 4.2 Definition
- Let X,Y: random variables
    + suppose $X = \{x_1, x_2, \dots, x_n\}$
    + suppose $Y = \{y_1, y_2, \dots, y_n\}$

- joint distribution of X and Y is **a matrix** of $p_{ij}$, which
    + $p_{ij} = P(X = x_i \cap Y = y_j)$
    
| Y \ X     | $X = x_1$             | $X = x_2$             | $\dots$ | $X = x_m$             |
|-----------|-----------------------|-----------------------|---------|-----------------------|
| $Y = y_1$ | $P(X=x_1 \cap Y=y_1)$ | $P(X=x_2 \cap Y=y_1)$ | $\dots$ | $P(X=x_m \cap Y=y_1)$ |
| $Y = y_2$ | $P(X=x_1 \cap Y=y_2)$ | $P(X=x_2 \cap Y=y_2)$ | $\dots$ | $P(X=x_m \cap Y=y_2)$ |
| $\dots$   | $\dots$               | $\dots$               | $\dots$ | $\dots$               |
| $Y = y_n$ | $P(X=x_1 \cap Y=y_n)$ | $P(X=x_2 \cap Y=y_n)$ | $\dots$ | $P(X=x_m \cap Y=y_n)$ |


- Properties
    + $p_{ij} \geq 0$
    + $\sum\limits_{i=1}^m\sum\limits_{j=1}^np_{ij} = 1$
    

## 4.3 Example: Build `joint probability distribution` (Joint PMF) table for Example 4.1
- X and Y
    + $P(X=0 \cap Y=0) = 0$
    + $P(X=1 \cap Y=0) = P(\{HH,HT\}) = \frac{1}{2}$
    + $P(X=0 \cap Y=1) = P(\{TH,TT\}) = \frac{1}{2}$
    + $P(X=1 \cap Y=1) = 0$

| Y \ X | **0** | **1** |
|-------|-------|-------|
| **0** | 0     | 0.5   |
| **1** | 0.5   | 0     |


- X and Z
    + $P(X=0 \cap Z=0) = P(\{TT\}) = \frac{1}{4}$
    + $P(X=1 \cap Z=0) = P(\{HT\}) = \frac{1}{4}$
    + $P(X=0 \cap Z=1) = P(\{TH\}) = \frac{1}{4}$
    + $P(X=1 \cap Z=1) = P(\{HH\}) = \frac{1}{4}$

| Z \ X | **0** | **1** |
|-------|-------|-------|
| **0** | 0.25  | 0.25  |
| **1** | 0.25  | 0.25  |

# 5. Marginal distribution
- If we are give a `joint probability distribution` table of X and Y 
    - And, we only interested in 1 random variable
- Given joint probability distribution table $\to$ find PMF of X and Y

## 5.1 Marginal distribution
- If 2 random variable X, Y have the joint distribution as
    + $p_{ij} = P(X = x_i \cap Y = y_j)$

- We can calculate marginal distribution of X (**a vector**) as
    + $P(X=x_i) = p_{i1} + p_{i2} + \dots + p_{in} = \sum\limits_{j=1}^n p_{ij}$

- We can calculate marginal distribution of Y (**a vector**) as
    + $P(Y=y_j) = p_{1j} + p_{2j} + \dots + p_{mj} = \sum\limits_{i=1}^m p_{ij}$

## 5.2 Example 
- Give the  joint PMF of 2 random variables X,Y as

| X \ Y     | **Y = 0** | **Y = 1** | **Y = 2** |
|-----------|-----------|-----------|-----------|
| **X = 0** | 1/4       | 1/6       | 1/12      |
| **X = 1** | 1/16      | 1/8       | 5/16      |

- Find PMF of X and Y

#### Solve

| X \ Y        | **Y = 0** | **Y = 1** | **Y = 2** | $P(X = x_j)$ |
|--------------|-----------|-----------|-----------|--------------|
| **X = 0**    | 1/4       | 1/6       | 1/12      | 1/2          |
| **X = 1**    | 1/16      | 1/8       | 5/16      | 1/2          |
| $P(Y = y_i)$ | 5/16      | 7/24      | 19/48     |              |

## 5.3 Exercises

- Give a  joint distribution table of 2 random variables X,Y as

| X \ Y     | **Y = 0** | **Y = 1** | **Y = 2** |
|-----------|-----------|-----------|-----------|
| **X = 0** | 1/4       | 1/6       | 1/12      |
| **X = 1** | 1/16      | 1/8       | 5/16      |


#### Find $P( X=1 \cap Y>0)$
- $P( X=1 \cap Y>0) = P( X=1 \cap Y=1) + P( X=1 \cap Y=2) = \frac{1}{8} + \frac{5}{16} = \frac{7}{16}$

#### Find $P( Y=2\ |\ X = 1)$

- $P( Y=2\ |\ X = 1) = \frac{P(Y=2 \cap X=1)}{P(X=1)} = \frac{5/16}{1/16 + 1/8 + 5/16} = \frac{5}{8}$

# 6. Independent random variables
- Consider of a system of variable X and Y, defined on the same probability space
- If we know something about X then we can have some new information about Y if X,Y are not independent

## 6.1 Independence and Information inference
- Let X,Y: random variables
    + suppose $X = \{x_1, x_2, \dots, x_n\}$
    + suppose $Y = \{y_1, y_2, \dots, y_n\}$

- We have $P(Y = y_j | X = x_i)$ meaning: If $X = x_i$ occurs giving new information about Y
- X and Y are Independent if
    + $P(Y = y_j\ |\ X = x_i ) = P(Y = y_j)$, $\forall i \in [1,m]$, $\forall j \in [1,n]$ 
    + Or $P(X=x_i \cap Y = y_j ) = P(X = x_i)P(Y = y_j)$, $\forall i \in [1,m]$, $\forall j \in [1,n]$ 

## 6.2 Example: Independence
- Give the joint PMF of 2 random variables X,Z as

| **Z\X** | **X=0** | **X=1** |
|---------|---------|---------|
| **Z=0** | 0.25    | 0.25    |
| **Z=1** | 0.25    | 0.25    |

- Check Independency

#### Solve

- Calculate PMF of X and Z

| **Z\X**    | **X=0** | **X=1** | $P(Z=z_j)$ |
|------------|---------|---------|------------|
| **Z=0**    | 0.25    | 0.25    | 0.5        |
| **Z=1**    | 0.25    | 0.25    | 0.5        |
| $P(X=x_i)$ | 0.5     | 0.5     |            |

- Check Independence
    + $P(X=0 \cap Z = 0 ) = P(X=0)P(Z=0) = 0.5*0.5 = 0.25$
    + $P(X=0 \cap Z = 1 ) = P(X=0)P(Z=1) = 0.5*0.5 = 0.25$
    + $P(X=1 \cap Z = 0 ) = P(X=1)P(Z=0) = 0.5*0.5 = 0.25$
    + $P(X=1 \cap Z = 1 ) = P(X=1)P(Z=1) = 0.5*0.5 = 0.25$
- Conclusion: `X and Z are Independent`
    + No info can be infered to Z if we know X

## 6.3 Example: Non-Independence
- Toss a fair coin 3 times. Let random variables
    + X: Number of heads for 1st and 2nd toss
    + Y: Number of heads for 2nd and 3rd toss

- Check Independency

#### Solve
- Construct Joint PMF Table

| Y\X          | **X=0** | **X=1** | **X=2** | $P(Y = y_j)$ |
|--------------|---------|---------|---------|--------------|
| **Y=0**      | 1/8     | 1/8     | 0       | 1/4          |
| **Y=1**      | 1/8     | 1/4     | 1/8     | 1/2          |
| **Y=2**      | 0       | 1/8     | 1/8     | 1/4          |
| $P(X = x_i)$ | 1/4     | 1/2     | 1/4     |              |

- Exist: $P(X=0 \cap Y=0 ) = 1/8 \neq P(X=0)P(Y=0) = 1/4 * 1/4 = 1/16$
- Conclusion: `X and Y are not Independent`
    + Information can be infered. Eg:
        + if X = 0 then Y must $\neq$ 2
        + if X = 0 then Y can be `0` or `1`

## 6.4 Example: Completely Dependent
- Roll a fair dice. Let random variables
    + X: the number on the dice
    + Y: = 0 if X is even, = 1 if X is odd
- Check Independency

#### Solve
- Construct Joint PMF Table

| Y\X          | **X=1** | **X=2** | **X=3** | **X=4** | **X=5** | **X=6** | $P(Y = y_j)$ |
|--------------|---------|---------|---------|---------|---------|---------|--------------|
| **Y=0**      | 0       | 1/6     | 0       | 1/6     | 0       | 1/6     | 1/2          |
| **Y=1**      | 1/6     | 0       | 1/6     | 0       | 1/6     | 0       | 1/2          |
| $P(X = x_i)$ | 1/6     | 1/6     | 1/6     | 1/6     | 1/6     | 1/6     |              |

- Exist: $P(X=1 \cap Y=0 ) = 0 \neq P(X=1)P(Y=0) = 1/6 * 1/2 = 1/12$
- Conclusion: `X and Y are not Independent`
    + Information of Y can be infered completely if we know X. Eg
        + If X = 1, Y = 1
        + If X = 2, Y = 0
        + ...
         

# 7. Covariance

- Covariance measures the relation between 2 random variables X and Y
- It is strongly related to dependence/Independence between X and Y

## 7.1 Definition

- Define Covariance of X and Y as $cov(X, Y) = E [ (X- E(X)) (Y - E(Y))    ]$

$$\begin{split}
cov(X,Y) &= E [ (X- E(X)) (Y - E(Y))    ]  \\
    &= E [XY -XE(Y) -YE(X) + E(X)E(Y)] \\
    &= E(XY) - E(X)E(Y) -E(X)E(Y) + E(X)E(Y) \\
    &= E(XY) - E(X)E(Y)
\end{split}$$



## 7.2 Var(X + Y)

$$\begin{split}
Var(X + Y) &= E [ (X + Y - E(X+Y))^2 ] \\
        &= E [ (X-E(X) + Y-E(Y))^2 ] \\
        &= E[(X-E(X))^2] + E[(Y-E(Y))^2] + 2 E[(X-E(X))(Y - E(Y))] \\
        &= Var(X) + Var(Y) + 2 cov(X,Y)
\end{split}$$

- We have cov(X,Y) represent by Var(X + Y) as (rarely use)

$$cov(X,Y) = \frac{1}{2} \left[ Var(X + Y) - Var(X) - Var(Y) \right]$$

## 7.3 Properties
- If `X and Y are independent` then `cov(X,Y) = 0`, **Note**: the inverse is not true

- Let $c, c_1, c_2$ = constant; X,Y = random variables
    - $cov(X,Y) = cov(Y,X)$
    - $cov(X,X) = Var(X)$
    - $cov(X+c, Y) = cov(X,Y)$
    - $cov(cX, Y) = c*cov(X,Y)$
    - $cov(X,\ c_1X + c_2) = c_1*cov(X,X) = c_1Var(X)$


## 7.4 Exercise 1
- Calculate Variance of `Binomial random variable`
- **Note**: Binomial random variable = $\sum$ n independent Bernoulli random variables. Given  Bernoulli PMF as

| **X**      | **X=1** | **X=0** |
|------------|---------|---------|
| **P(X=x)** | p       | 1-p     |

#### Solve
- Define
    + $X$: Binomial random variable
    + $X_i$, $i \in [1,n]$:  Bernoulli random variables


- Expected value of Bernoulli random variables
    + $E(X_i) = p$


- Calc $(X_i - E(X_i))^2$

| $X_i$         | $X_i = 1$    | $X_i = 0$              |
|----------------|------------|----------------------|
| $P(X_i=x)$     | p          | 1-p                  |
| $(X_i - E(X_i))^2$ | $(1-p)^2$ | $(0-p)^2  = p^2$ |


- Variance of Binomial experiments

$$\begin{split}
Var(X) &= Var(X_1 + X_2 + \dots + X_n) \\
    &= Var(X_1) + Var(X_2) + \dots + Var(X_n) \text{, Since } X_i \text{ are independent} \\
    &= nVar(X_i) = nE[ (X_i-E(X_i))^2 ] \\
    &= n \left[  P(X_i=1)(X_i-E(X_i))^2_{|x=1} + P(X_i=0)(X_i-E(X_i))^2_{|x=0}\right] \\
    &= n \left[   p(1-p)^2 + (1-p)p^2\right] \\
    &= np(1-p)
\end{split}$$


## 7.5 Exercise 2

- Roll a fair dice. Let random variables
    + X: the number on the dice
    + Y: = 0 if X is even, = 1 if X is odd

- Given Joint PMF Table, Find `cov(X,Y)`

| Y\X          | **X=1** | **X=2** | **X=3** | **X=4** | **X=5** | **X=6** |
|--------------|---------|---------|---------|---------|---------|---------|
| **Y=0**      | 0       | 1/6     | 0       | 1/6     | 0       | 1/6     |
| **Y=1**      | 1/6     | 0       | 1/6     | 0       | 1/6     | 0       |

#### Solve

- Construct Joint PMF Table

| Y\X          | **X=1** | **X=2** | **X=3** | **X=4** | **X=5** | **X=6** | $P(Y = y_j)$ |
|--------------|---------|---------|---------|---------|---------|---------|--------------|
| **Y=0**      | 0       | 1/6     | 0       | 1/6     | 0       | 1/6     | 1/2          |
| **Y=1**      | 1/6     | 0       | 1/6     | 0       | 1/6     | 0       | 1/2          |
| $P(X = x_i)$ | 1/6     | 1/6     | 1/6     | 1/6     | 1/6     | 1/6     |              |


- Expected values
    + $E(X) = 1*\frac{1}{6} + 2*\frac{1}{6} + 3*\frac{1}{6} + 4*\frac{1}{6} + 5*\frac{1}{6} + 6*\frac{1}{6} = \frac{7}{2}$
    + $E(Y) = 0*\frac{1}{2} + 1*\frac{1}{2} = \frac{1}{2}$ 

- Calculate $E(XY)$

$$\begin{split}
E(XY) &= \sum\limits_i^m \sum\limits_j^n x_i y_j P(X=x_i \cap Y=y_j) \\
    &= (0)(1)(0) + (0)(2)(\frac{1}{6}) + (0)(3)(0) + (0)(4)(\frac{1}{6}) + (0)(5)(0) + (0)(6)(\frac{1}{6}) + (1)(1)(\frac{1}{6}) + (1)(2)(0) + (1)(3)(\frac{1}{6}) + (1)(4)(0) + (1)(5)(\frac{1}{6}) + (1)(6)(0) \\
    &= 1.5
\end{split}$$

- Calc cov(X,Y)
    + $cov(X,Y) = E(XY) - E(X)E(Y) = 1.5 - 3.5*0.5 = -0.25$ 


# 8. Correlation
- Another coefficient that can be used for showing some information about relation between two random variables.
- correlation is strictly related to covariance

## 8.1 Definition

$$corr(X,Y) = \frac{cov(X,Y)}{\sqrt{Var(X)Var(Y)}}$$

- $corr(X,Y) \in [-1, 1]$
- $corr(X,Y) = 0$ <=> `X,Y are uncorrelated`
- `Positive Correlation` = corr(X,Y) closer to 1 = If 1 variable obtained large value, then the other most likely will also be large
- `Negative Correlation` = corr(X,Y) closer to -1 = If 1 variable obtained large value, then the other most likely will be small

## 8.2 Properties
- Let $c, c_1, c_2$ = constant; X,Y = random variables
    - $corr(cX,Y) = corr(X,Y)$, if `c > 0`
    - $corr(X, c_1X + c_2) =  \frac{c_1}{|c_1|} = \begin{cases}   
        1 \text{, if } c_1>0 \\
        -1 \text{, if } c_1<0
\end{cases}$

    - If $corr(X,Y) =  \pm 1$ then $Y = c_1 X + c_2$
        + The closer corr(X,Y) to $\pm 1$, the more linear dependence between them

- $corr(X,Y) = 0$ <=> $cov(X,Y) = 0$
- if `X and Y are independent` then $corr(X,Y) = 0$, **Note**: The inverse is not true

