# Moment, characteristic function and cumulant

[Back to index](https://shotahorii.github.io/math-for-ds/)

---

## Table of contents
1. **Generating function**
2. **Moment**  
2.1. Definition of moment  
2.2. Connection with named properties of distributions  
2.3. Moment-generating function  
2.4. Getting n-th moment from the moment-generating function  
3. **Characteristic function**  
3.1. Definition of characteristic function  
3.2. Connection with moment-generating function  
3.3. Getting n-th moment from the characteristic function  
4. **Cumulant**  
4.1. Cumulant-generating function  
4.2. 0th, 1st and 2nd cumulant  

---

## 1. Generating function

When a sequence $a_0, a_1, ..., a_n, ...$ is coefficients of power series expansion of a function $G$ as below:

$G(a_n;x) = \sum^\infty_{n=0}a_nx^n$

$G$ is called the ordinary generating function of a sequence $\{a_n\}$

When the function is in the following form, it's called the exponential generating function. 

$EG(a_n;x) = \sum^\infty_{n=0}a_n\frac{x^n}{n!}$

---

## 2. Moment
### 2.1. Definition of moment
The $n$-th moment of a real-valued continuous function $f(x)$ of a real variable about a value $c$ is defined as below. 

$\mu^{(c)}_n = \int^\infty_{-\infty} (x-c)^n f(x) dx$

Usually, "the $n$-th moment" refers to the one with $c=0$

$\mu^{(0)}_n = \int^\infty_{-\infty} x^n f(x) dx$

And if $f$ is a probability density function of a random variable $X$, 

$\mu^{(0)}_n = \int^\infty_{-\infty} x^n f(x) dx = E[X^n]$

**Central Moment**

Especially, the $n$-th moment about $E[X]$ is called the $n$-th central moment.

Let: $\mu = \mu^{(0)}_1 (= E[X])$

$\mu_n = \int^\infty_{-\infty} (x-\mu)^n f(x) dx = E[(X-E[X])^n]$

**Standardised Moment**

The standardized moment of degree $n$ is the ratio of the $n$-th moment about the mean to the $n$-th power of the standard deviation.

$\tilde{\mu}_n = \frac{\mu_n}{\sigma^n}$

$where \,\,\, \sigma^n = (E[(X-\mu)^2])^{\frac{n}{2}}$

**The 0-th moment of probability density function $p(x)$**

$\mu^{(c)}_0 = \int^\infty_{-\infty} (x-c)^0 p(x) dx = \int^\infty_{-\infty} p(x) dx = 1$

---

### 2.2. Connection with named properties of distributions

|Moment ordinal|Moment (Raw)|Central Moment|Standardised Moment|
|---|---|---|---|
|1|Mean|0|0|
|2|-|Variance|1|
|3|-|-|Skewness|
|4|-|-|(Non-excess) Kurtosis|

--- 

### 2.3. Moment-generating function
The moment-generating function of a random variable $X$ is defined as below.

$M_X(t) = E[e^{tX}]$

$where \,\,\, t \in \mathbb{R}$

When ${\bf X}$ is a $d$-dimentional random vector ${\bf X} = (X_1,X_2,...,X_d)^T$, and ${\bf t}$ is a $d$-dimentional fixed vector, the fomula uses ${\bf t}\cdot{\bf X} = {\bf t}^T{\bf X}$ instead of $tX$ as below.

$M_{\bf X}({\bf t}) = E[e^{{\bf t}^T{\bf X}}]$

**Proof that $M_X(t)$ is a generating function**

The series expantion of $e^{tX}$ is below.

$e^{tX} = 1 + tX + \frac{t^2}{2!}X^2 + \frac{t^3}{3!}X^3 + ... + \frac{t^n}{n!}X^n + ... = \sum_{n=0}^\infty \frac{t^n}{n!}X^n$

Hence 

$E[e^{tX}] = E[\sum_{n=0}^\infty \frac{t^n}{n!}X^n] = \sum_{n=0}^\infty \frac{t^n}{n!}E[X^n]$

This is the form of $EG(a_n;t) = \sum^\infty_{n=0}a_n\frac{t^n}{n!}$ where the sequence $\{a_n\}$ is $E[X^0],E[X^1],E[X^2],...,E[X^n],...$

---

### 2.4. Getting n-th moment from the moment-generating function

$\mu_n^{(0)} = E[X^n] = M_X^{(n)}(0) = \frac{d^nM_X}{dt^n}|_{t=0}$

Because

$M_X(t) = E[e^{tX}] = 1 + tE[X] + \frac{t^2}{2!}E[X^2] + \frac{t^3}{3!}E[X^3] + ... + \frac{t^n}{n!}E[X^n] + ...$

$\Longrightarrow \frac{d^nM_X}{dt^n} = E[X^n] + tE[X^{n+1}] + \frac{t^2}{2!}E[X^{n+2}] + ...$

$\Longrightarrow \frac{d^nM_X}{dt^n}|_{t=0} = E[X^n]$

---

## 3. Characteristic function
### 3.1. Definition of characteristic function
A key problem of the moment-generating function is that, given a random variable $X$, moments and the moment-generating function may not exist. However, the characteristic function always exists.  
Given a random variable $X$, the characteristic function is defined as below.

$\varphi_X(t) = E[e^{itX}]$

$where \,\,\, i = \sqrt{-1}$

When ${\bf X}$ is a $d$-dimentional random vector ${\bf X} = (X_1,X_2,...,X_d)^T$, and ${\bf t}$ is a $d$-dimentional fixed vector, the fomula uses ${\bf t}\cdot{\bf X} = {\bf t}^T{\bf X}$ instead of $tX$ as below.

$\varphi_{\bf X}({\bf t}) = E[e^{i{\bf t}^T{\bf X}}]$

---

### 3.2. Connection with moment-generating function
If a random variable has a moment-generating function $M_X(t)$ then, 

$\varphi_X(-it) = E[e^{i(-it)X}] = E[e^{tX}] = M_X(t)$

---

### 3.3. Getting n-th moment from the characteristic function  

If a random variable X has moments up to $n$-th order, then, 

$\mu_n^{(0)} = E[X^n] = i^{-n} \varphi_X^{(n)}(0) = i^{-n} \frac{d^n\varphi_X}{dt^n}|_{t=0}$

Because

$\varphi_X(t) = E[e^{itX}] = 1 + itE[X] + \frac{(it)^2}{2!}E[X^2] + \frac{(it)^3}{3!}E[X^3] + ... + \frac{(it)^n}{n!}E[X^n] + ...$

$\Longrightarrow \frac{d^n\varphi_X}{dt^n} = i^n E[X^n] + i^{n+1} tE[X^{n+1}] + i^{n+2} \frac{t^2}{2!}E[X^{n+2}] + ...$

$\Longrightarrow \frac{d^n\varphi_X}{dt^n}|_{t=0} = i^n E[X^n]$

$\Longrightarrow i^{-n} \frac{d^n\varphi_X}{dt^n}|_{t=0}= i^{-n} i^n E[X^n] = E[X^n]$

---

## 4. Cumulant
### 4.1. Cumulant-generating function
The cumulants of a random variable $X$ are defined using the cumulant-generating function $K(t)$ defined as below.

$K(t) = log M_X(t) = log E[e^{tX}]$

Then, $n$-th cumulant is obtained as the $n$-th coefficient of the power series expansion of the cumulant generating function as below.

$K(t) = \sum^\infty_{n=1}k_n\frac{t^n}{n!}$

$k_n = K^{(n)}(0) = \frac{d^nK}{dt^n}|_{t=0}$

**Alternative**

Some define the cumulant-generating function as the natural logarithm of the characteristic function, which is sometimes also called the second characteristic function as below.

$H(t) = log\varphi_X(t) = log E[e^{itX}] = \sum^\infty_{n=1}k_n\frac{(it)^n}{n!}$

---

### 4.2. 0th, 1st and 2nd cumulant
**0th cumulant**

$k_0 = K^{(0)}(0) = K(0)$

$= log M_X(0) = log E[e^{0\cdot X}] = log E[1] = log 1 = 0$

**1st cumulant**

$k_1 = K^{(1)}(0) = \frac{dK}{dt}|_{t=0} = \frac{d(log M_X(t))}{dt}|_{t=0}$

Note: For given $f(x(t))$, $\frac{df}{dt} = \frac{df}{dx}\frac{dx}{dt}$

$= (\frac{1}{M_X(t)}\frac{dM_X(t)}{dt})|_{t=0}$

Note: $M_X(0) = 1$ and $M_X'(0) = E[X]$

$= \frac{1}{1}E[X] = E[X]$

**2nd cumulant**

$k_2 = K^{(2)}(0) = \frac{d^2K}{dt^2}|_{t=0} = \frac{d^2(log M_X(t))}{dt^2}|_{t=0}$

Note: For given $f(x(t))$, $\frac{d^2f}{dt^2} = \frac{d^2f}{dx^2}\cdot(\frac{dx}{dt})^2 + \frac{df}{dx}\cdot\frac{d^2x}{dt^2}$

$=\{-\frac{1}{(M_X(t))^2}(\frac{dM_X(t)}{dt})^2 + \frac{1}{M_X(t)}\frac{d^2M_X(t)}{dt^2}\}|_{t=0}$ 

Note: $M_X''(0) = E[X^2]$

$= -\frac{1}{1}(E[X])^2 + \frac{1}{1}E[X^2]$

$= E[X^2] - (E[X])^2 = V[X]$