# Chapter 2 Random Variables Part VI

#### *Zhuo Jianchao* 

Feb 16, 2020 *Rev 1*

## Moment Generating Function

We've known the definition of moments, let's calculate one as practice, $\sigma_{X}^2$.

$$ \begin{aligned}
\sigma_{X}^2 &= E[(X-\mu_{X})^2]\\
&= E[X^2-2X\mu_{X}+\mu_{X}^2]\\
&= E(X^2)-2\mu_{X}^2+\mu_{X}^2\\
&= E(X^2)-\mu_{X}^2
\end{aligned}
$$

where $E(X^2)=\int_{-\infty}^{+\infty}x^2f_{X}(x)\:dx$

From this deduction, we have an another expression of the variance
$$\sigma_{X}^2=E(X^2)-\mu_{X}^2$$

We can see that *k*-th central moment can be induced from *k*-th moment, but when $f_{X}(x)$ becomes more complex, we may find increasingly difficult to calculate it, needless to say higher order of moments where there are higher terms for integral.

To obtain higher order moments, we may turn to a technique called **moment generating function** which will bring calculative convenience.

### Definition 13 Moment Generating Function

The MGF of a random variable $X$ with PMF or PDF $f_{X}(x)$ is defined as
$$\begin{aligned}
M_{X}(t) = E(e^{tX})
  = \begin{cases}
  \sum_{x \in \Omega_{X}}e^{tX}f_{X}(x) & \text{if X is a DRV}, \\
  \int_{-\infty}^{+\infty}e^{tX}f_{X}(x)\: dx & \text{if X is a CRV}.
    \end{cases}
\end{aligned}
$$

The MGF of $X$, $M_X(t)$ is the function of $t$ instead of $X$ since expectation takes in the expression of $X$ and results in a real value.

***Theorem:*** The *k*-th derivative of $M_{X}(t)$ evaluated at $t=0$ is the *k*-th moment of $X$, that is,
$$M_{X}^{(k)}(0)=E(X^k)$$

Such a convenience to calculate *k*-th moment, but why?

***Proof:*** We can expand $e^{tX}$ into power series with respect to $t$, as

$$e^{tX} = 1+tX+\frac{(tX)^2}{2!}+\frac{(tX)^3}{3!}+\cdots+\frac{(tX)^k}{k!}+\cdots = \sum_{n=0}^{\infty}\frac{(tX)^n}{n!}.$$

By the definition of MGF of $X$,

$$\begin{aligned}
M_X(t) &= E(e^{tX})=E(1+tX+\frac{(tX)^2}{2!}+\cdots+\frac{(tX)^k}{k!}+\cdots)\\
&= 1+tE(X)+\frac{t^2E[X^2]}{2!}+\cdots+\frac{t^kE[X^t]}{k!}+\cdots\\
&= 1+\mu_{X}t+\frac{\mu_2}{2!}t^2+\cdots+\frac{\mu_k}{k!}t^k+\cdots\\
&= \sum_{n=0}^{\infty}\frac{\mu_n}{n!}t^n
\end{aligned}
$$

where $\mu_{k}$ stands for *k*-th moment of $X$.

Taking *k*-th order derivative with respect to $t$, all terms of $t$ to the order less than *k* will be ruled out, which yields, 
$$M_{X}^{(k)}(t)=\sum_{n=0}^{\infty}\mu_k t^n$$

Taking into $t=0$, all terms which is multiplied by $t$ becomes zero, which yields
$$M_{X}^{(k)}(0)=\mu_k=E(X^k)$$

**Notice** that
1. MGF of $X$ may not even exist if $E(e^{tX})$ doesn't exist in the neighborhood of $t=0$. It happens in some cases where $M_{X}(0)$ is infinite or just undefined. 
2. If $M_{X}(t)$ exists in the neighborhood of $t=0$, it indicates any order of moment of $X$ exists.

### Properties of MGF

There are some properties of MGF which will benefit our construction of MGF of more complex random variable.

#### Property 1 Linear Transformation

Suppose $Y=a+bX$, where $a$ and $b$ are two constants, and the MGF of $X$, $M_{X}(t)$, exists in  the neighborhood of $t=0$, then the MGF of $Y$ is that
$$M_{Y}(t)=e^{at}M_{X}(bt)$$

***Proof:*** By the definition of $M_{Y}(t)$, and take into $Y=a+bX$, yields
$$M_{Y}(t)=E(e^{tY})=E[e^{t(a+bX)}]\\=E[e^{at+btX}]\\=E[e^{at}e^{btX}]\\=e^{at}E[e^{btX}]\\=e^{at}M_{X}(bt)$$

#### Property 2 Convolution

Suppost $X$ and $Y$ are two independent random variable with MGF $M_{X}(t)$ and $M_{Y}(t)$ respectively. Then the MGF of $Z=X+Y$ is that
$$M_Z(t)=M_X(t)M_Y(t)$$

***Proof:*** By the definition of $M_{Z}(t)$,
$$M_Z(t)=E(e^{tZ})\\=E[e^{t(X+Y)}]\\=E[e^{tX+tY}]\\=E[e^{tX}e^{tY}]$$
By the property of the multiplication of two independent random variable, 
$$M_Z(t)=E[e^{tX}e^{tY}]=E[e^{tX}]E[e^{tY}]\\=M_{X}(t)M_{Y}(t)$$

## Characteristic Function

We say a moment generating function of $X$, $M_X(t)$, exists as long as $M_{X}(t)$ is well defined at neighborhood of $t=1$, and it may not exist for some random variables. A more general function which is well defined at all $t$ for all random variables is **characteristic function**.

### Definition 14 Characteristic Function

The characteristic function of a random variable $X$ with cumulative distribution function $F_{X}(x)$ is defined as
$$\begin{aligned}
\phi_X(t)&=E(e^{\mathrm{i}tX})\\
&=\int_{-\infty}^{+\infty}e^{\mathrm{i}tx}dF_{X}(x)\\
&=\int_{-\infty}^{+\infty}e^{\mathrm{i}tx}f_{X}(x) \: dx,
\end{aligned}$$
where $\mathrm{i}=\sqrt{-1},$ and 
$$e^{\mathrm{i}tx}=\cos(tx)+\mathrm{i}\sin(tx)$$

The characteristic function of $X$, $\phi_X(t)$, is much similar to MGF of $X$, $M_{X}(t)$ in the following ways:

***Theorem:*** Suppose *k*-th moment of $X$ exists. Then the *k*-th derivative of $\phi_{X}(t)$ evaluated at $t=0$ is the *k*-th moment of $X$ multiplied by $\mathrm{i}$ to the power of *k*, that is,
$$\phi_{X}^{(k)}(0)=\mathrm{i}^k E(X^k)$$

#### Property 1 Linear Transformation

Suppose $Y=a+bX$, where $a$ and $b$ are two constants, and $X$ has the characteristic function $\phi_X(t)$, then the characteristic function of $Y$ is that
$$\phi{Y}(t)=e^{\mathrm{i}at}\phi_{X}(bt)$$

#### Property 2 Convolution

Suppost $X$ and $Y$ are two independent random variable with characteristic function $\phi_{X}(t)$ and $\phi_{Y}(t)$ respectively. Then the characteristic function of $Z=X+Y$ is that
$$\phi_Z(t)=\phi_X(t)\phi_Y(t)$$