In [1]:
#renders matplotlib plots in the notebook
%matplotlib notebook 

#enables tab autocomplete feature
%config IPCompleter.greedy=True 

import numpy as np
import matplotlib.pyplot as plt
from scipy.stats import norm

## Jointly Distributed Random Variables

random variables maps the same outcome in the sample space to different numbers are said to be jointly distributed

ex: Consider an expirement of randomly picking a student, the random variables that map height and weight are called joint random variables.

$ [ X(s_i) Y(s_i) ]^T = [x_i y_i]^T$

- The terms Multiple random variables and random vector are used interchangeably

### Joint PMF or bivariate PMF

$P_{X,Y}[x_i, y_i] = P[X(s) = x_i, Y(s) = y_i]$

joint PMF maps the random vector (points in a plane if two joint random variables) to [0, 1]

Joint PMF follows all the properties of a normal PMF
- $0 \le P_{X,Y}[x_i, y_i] \le 1$ lies between 0 and 1
- $\sum_{i=1}^{N_x} \sum_{j=1}^{N_y} P_{X,Y}[x_i, y_i] = 1$ sum of probabilities of all possibilities is 1

### Marginal PDF

If the joint PMF is known, then the PMF for X and the PMF for Y can be derived. These derived PMFs are called Marginal PMFs

Inorder to find out the PMF of X, we iterate through each $x_i$ in all the pairs like $(x_i, y_j)$. For each $x_k$ find all tuples $(x_k, y_j)$ and sum all its probabilities

$P_X(x_i) = \sum_{j} P_{X,Y}(x_i, y_j)$

The term marginal orginates because people sum up the probabilites and write the sum in the gutter / margin of the graph.

#### Properties
- Given marginal probabilites of X and Y, the PMF cannot be constructed because there are infinite joint PMFs yeilding the same marginal PMFs.
- Marginal PMF of X is independent of Y and vice-versa, that is the probability of $y_k$ happenning regardless the outcome of X. The dependency information is lost when marginal probabilities are dervied, therefore it is impossible to build joint PMF from marginal PMFs, unless it is given that X and Y are independent.

### Joint Cumulative Distribution Function (CDF)

$F_{XY}(x,y) = P[X \le x, Y \le y]$

$F_{XY}(x,y) = \sum_{(i,j): x_i \le x, y_j \le y} P_{X,Y}[x_i, y_j]$

For all points / pairs $(x_i, y_j)$ where $x_i \le x, y_j \le y$ sum up their probabilities

#### Properties
- $0 \le F_{XY}(x,y) \le 1$
- Values at the endpoints
    - $F_{XY}(-\inf,-\inf) = 0$
    - $F_{XY}(\inf,\inf) = 1$
- Monotonically increasing with x or y increasing
- CDF is right continuous, $CDF(x^{-}, y) \lt CDF(x,y) = CDF(x^{+},y)$ (discreet random variables)

### Independence of Multiple Random Variables

$P_{X,Y}[x_i,y_i]=P_X[X_i]P_Y[Y_j]$

Hence,if X and Y are independent random variables, the joint PMF factors into the product of the marginal PMFs. Furthermore, the converse is true - if the joint PMF factors, then X and Y are independent.

### Transformation of Multiple Random Variables

X is transformed into W = g(X,Y)
Y is transformed into Z = h(X,Y)

As in the case of transformation of a single discreet random variable Y = f(X), the PMF of Y is sum of probabilities of all x_i which map to y_i, we can define the new PMF of W,Z as the sum of all pairs of (x_i, y_i) which map to (w_i, z_i)

$p_{W,Z} = \sum_{(k,l): g(x_k,y_l) = w_i, h(x_k, y_l) = z_i} p_{X,Y}[x_i,y_i]$

### Expected Values

$E_{X,Y}[g[X,Y]] = \sum_{i}\sum_{j}g(x_i,y_j)p_{X,Y}[x_i, y_j]$

#### Expected value of sum of random variables
$g(X,Y) = X+Y$

$E_{X,Y}[X+Y] = \sum_{i}\sum_{j}(x_i+y_j)p_{X,Y}[x_i, y_j]$

$E_{X,Y}[X+Y] = \sum_{i}\sum_{j}x_ip_{X,Y}[x_i, y_j] + \sum_{i}\sum_{j}y_jp_{X,Y}[x_i, y_j]$

$E_{X,Y}[X+Y] = \sum_{i}x_i(\sum_{j}p_{X,Y}[x_i, y_j]) + \sum_{j}y_j(\sum_{i}p_{X,Y}[x_i, y_j])$

applying Marginal PDF equation

$E_{X,Y}[X+Y] = \sum_{i}x_iP_X(x_i) + \sum_{j}y_jP_Y(y_j)$

applying single variable expected value equation

$E_{X,Y}[X+Y] = E_X[X] + E_Y[Y]$

in general

$E_{X,Y}[aX+bY] = aE_X[X] + bE_Y[Y]$

#### Expected value of product of random variables

$g(X,Y) = XY$

$E_{X,Y}[XY] = E_X[X]E_Y[Y]$

in general

$E_{X,Y}[g(X)h(Y)] = E_X[g(X)]E_Y[h(Y)]$

#### Variance of sum of random variables

$Var(X+Y) = Var(X) + Var(Y) + 2Cov(X,Y)$

$Cov(X,Y) = E_{XY}[(X - E_X[X])(Y - E_Y(Y))]$

$Cov(X,Y) = E_{XY}[XY] - E_X[X]E_Y[Y]$



In [2]:
import numpy as np

In [5]:
a = np.random.randn(3, 3)
b = np.random.randn(3, 1)
c = a*b
c

array([[-0.0472956 , -0.99253432,  0.14311947],
       [ 0.77520517,  0.06156918, -1.13162869],
       [ 0.10575575, -0.33657037,  0.88463182]])

In [7]:
v = np.random.random(3)
v.shape

(3,)