In [2]:
from datascience import *
import numpy as np
from math import *

## Transformations

In some cases, we may be interested in the distribution of a transformation of a random variable. For example, if we know the distribution of $X$, we may wish to know the distribution of $X^2$ or $2X$. 

It helps to consider the pmf/cdf of the original random variables. Let $Y=t(X)$ where $X$ is discrete:

$$
f_Y(y)=P(Y=y) = P(t(X)=y) = P( X = t^{-1}(y))
$$

In the continuous case, let's consider the cdf:

$$
F_Y(y)=P(Y\leq y) = P(t(X)\leq y) = P(X \leq t^{-1}(y)) = F_X(t^{-1}(y))
$$

### Discrete

#### Example 1

Suppose the pmf for $X$ is given by the following table: 

 | value of $X$  | -2 | -1 | 0 | 1 | 2 | 
 | ------ | ------ | ----- | ----- | ----- | ----- |
 | probability | 0.05 | 0.10 | 0.35 | 0.30 | 0.20 |

Find the distribution of $X^2$ and calculate $E(X^2)$. Does $E(X^2) = [E(X)]^2$? 

In [5]:
# Distribution of X^2 based upon mapping
P0 = 0.35
P1 = 0.30 + 0.10
P4 = 0.05 + 0.20
EX_squared = 0*P0 + 1*P1 + 4*P4 # using E(x) = sum(x*f(x))
EX = -2*0.05 + -1*0.10 + 0*0.35 + 1*0.30 + 2*0.20 # manually entering values instead of collecting row values, as above
print('The E(X^2) is',EX_squared,'and E(X)^2 is',EX ** 2,'. These results are not equal.')

The E(X^2) is 1.4 and E(X)^2 is 0.25 . These results are not equal.


#### Example 2
Let $X \sim \textsf{Binom}(n,p)$. What is the pmf for $X+3$? Make sure you specify the domain of $X+3$. 

If X ~ Binom(n,p), then the pmf for X equals $${n \choose x}*p^x*(1-p)^{n-x}$$

Using Y = X+3, we get $$t^{-1}(y) = X$$ $$t^{-1}(y) = Y-3$$

Printing the full equality above, we have $$F_y(y) = P(Y\leq y) = P(X+3\leq y) = P(X\leq Y-3) = F_x(Y-3)$$

Therefore, the pmf for X+3 is equal to $$F_y(y) = {n \choose y-3}*p^{y-3}*(1-p)^{n-y+3}$$

Since the domain has been shifted from [1,n] by 3, we find that the domain of X+3 is [3,n+3].

#### Example 3

Let $X \sim \textsf{Unif}(0,1)$. Let $Y=X^2$. Find the **pdf** of $Y$. Again, specify the domain of $Y$. 

If X ~ Unif(0,1), then $f_x(X) = 1$ and $$F_x(X) = \int\limits_0^x f_x(x) dx$$ Letting Y = $X^2$, we know that $$f_y(Y) = \frac{d}{dx}*F_y(Y)$$

To find $F_y(Y)$, we use the cdf operations above. For expediency, we get $t^{-1}(y) = \sqrt{y}$ and since we have a uniform distribution, $F_y(Y) = \sqrt{x}$ as well. Taking the derivative, this means that the pdf of Y is $$f_y(Y) = \frac{0.5}{\sqrt{y}}$$ where y is an element of the doman (0,1].

## Moment Generating Functions (MGF)

One powerful concept in probability is the moment generating function (mgf). Let $X$ be a random variable. The mgf of $X$ is denoted by $M_X(t)$. This function is powerful because it can be used as a shortcut to find the $k$th central moment. Specifically,

$$
E(X^k) = \frac{d^k}{dt^k} M_X(t) \bigg |_{t=0}
$$

If you know the moment generating function of $X$, you can simply take the derivative of it with respect to $t$, evaluate at $t=0$ and the result is the expected value of $X$, $E(X)$. 

The mgf of $X$ is found by

$$
M_X(t) = E(e^{tX})
$$

#### Example 4: 

Let $X$ be a random variable with the exponential distribution with parameter $\lambda >0$. Recall that $f_X(x) = \lambda e^{-\lambda x}$, for $x>0$. Find the mgf of $X$. Use it to verify that $E(X) = \frac{1}{\lambda}$. 



We have that $$E(X^k) = \frac{d^k}{dt^k}M_x(t)|_{t=0}$$ Therefore, the mgf of X is $$M_x(t) = \int \limits_x e^{tX}$$ $$M_x(t) = \int \limits_x e^{tX}\lambda e^{-\lambda x}$$

In order for this integral to converge, when evaluated from 0 to $\infty$, t must be less than $\lambda$. Solving, we get $$M_x(t) = -\frac{\lambda}{t-\lambda}$$

For the expected value, we take the first derivative of $M_x(t)$ with respect to t evaluated at $t = 0$ and get $E(x) = \frac{1}{\lambda}$. Here, $\lambda$ must be greater than 0 for the function to be defined at $t = 0$.

#### Example 5:

The moment generating function of a random variable with the binomial distribution (with parameters $n$ and $p$) is given by $M_X(t) = (pe^t + 1 - p)^n$. Use the mgf to verify that $E(X)=np$ and $V(X)=np(1-p)$. Note that $V(X)=E(X^2)-[E(X)]^2$. 

If we evaluate the first time derivative of $M_x(t) = (pe^t + 1 - p)^n$ at $t = 0$, we get $E(X) = np$.
To find $V(X)$, we use $$\frac{d^2}{dt^2}M_x(t) = (n^2-n)(pe^t)^2(pe^t-1+p)^{n-2}+npe^t(pe^t+1-p)^{n-1}$$

Evaluated at t = 0, this leads us to the result that $V(X) = np(1-p)$.

### Important Results

1) Let $X$ and $Y$ be random variables with mgfs $M_X$ and $M_Y$. $X$ and $Y$ are said to be identically distributed if and only if $M_X(t) = M_Y(t)$ for all $t$ in som interval containing 0. 

2) MGF of linear transformation of random variable: If $a$ and $b$ are constants, then 

$$
M_{aX+b}(t) = e^{bt}M_X(at)
$$

3) MGF of sum of independent random variables: If $X$ and $Y$ are independent random variables with mgfs $M_X$ and $M_Y$, then

$$
M_{X+Y}(t)=M_X(t) \cdot M_Y(t)
$$

 

#### Example 6 

Let $X \sim \textsf{Exp}(\lambda)$. Find the distribution of $Y=3X$.

Recycling the result from example 4, we have $M_x(3t) = -\frac{\lambda}{3t-\lambda}$.

#### Example 7 

Suppose $X_1, X_2, ..., X_n$ are independent identically distributed $\textsf{Norm}(\mu,\sigma)$. Find the distribution of $S=X_1+X_2+...+X_n$ and $\bar{X} = \frac{X_1+X_2+...+X_n}{n}$. Note that the mgf of a normally distributed random variable is $M_X(t)=e^{\mu t+\sigma^2 t^2/2}$.

Distribution: $M_s(t) = e^{n\mu t + \sigma^2t^2/2}$

Mean: $M_\bar x(t) = e^{\mu t + \sigma^2t^2/2n}$