## Stochastic Process

**Stochastic process** is a family of random variables $\{X_t:t\in\mathcal{T}\}$, where each $X_t$ takes some value in the state space, denote $\mathbb{S}$. We call $\mathcal{T}$ the index set, often we refer to the index set as time.

### Various stochastic processes

1. Discrete time, discrete state: Markov chains

2. Discrete time, continuous state: auto-regressive model, Kalman filter, $X_{t+1}=AX_t+\epsilon_t$

3. Continuous time, discrete state: Birth & Death process

4. Continuous time, continuous state: Brownian motion, $\lim_{\delta\rightarrow 0}X_{t+\delta}=X_t+\mathcal{N}(0,\sigma_\delta^2)$ 

## Markov chain

Returning to the discrete time, discrete state stochastic process, if we assume $X_0,X_1,\cdots$ are independent, then

$$\mathbb{P}(X_{t+1}=j\mid X_1=i_1,\cdots,X_t=i_t)=\mathbb{P}(X_{t+1}=j)$$

In words, the past gives us no information, but it is not the reality, and another model is $X_{t+1}$ is dependent on all of the past, but the model will be cumbersome.

We define the middle ground: $k$-order Markov chain as

$$\mathbb{P}(X_{t+1}=j\mid X_1=i_1,\cdots,X_t=i_t)=\mathbb{P}(X_{t+1}=j\mid X_{t-k+1}=i_{t-k+1},\cdots,X_t=i_t)$$

Conditional on $X_{t-k+1},\cdots,X_t$ the state $X_{t+1}$ is independent of $X_0,\cdots,X_{t-k}$

First-order Markov Chain is the most popular for all Markov chain can be converted to it.

$$\mathbb{P}(X_{t+1}=j\mid X_1=i_1,\cdots,X_t=i_t)=\mathbb{P}(X_{t+1}=j\mid X_t=i)\equiv P_{ij}(t)$$

$P_{ij}(t)$ is the transition probability of $t$ to $t+1$.

\begin{definition}[Homogeneous Markov chain]

If for $\forall i,j \in \mathbb{S}$, the transition probability $P_{ij}(n)$ of a Markov chain $\{X_t:t\in\mathcal{T}\}$ is a time-invariant, then the Markov chain is homogeneous.

\end{definition}

All the Markov chains we discussing are homogeneous.

It is natural to place the transition probabilities for a Markov chain in a matrix. This is **transition matrix**, denote $\mathbf{P}$.

$$\mathbf{P}=\begin{bmatrix}
P_{00}&P_{01}&P_{02}&\cdots\\
P_{10}&P_{11}&P_{12}&\cdots\\
P_{20}&P_{21}&P_{22}&\cdots\\
\vdots&\vdots&\vdots&\ddots\\
\end{bmatrix}$$

We call a Markov chain **finite** if $\mathbb{S}$ consists of a finite number of states.

\begin{property}

$$\sum_jP_{ij}=1$$

$$\sum_j\mathbb{P}({X_{t+1}=j\mid X_t=i})=1$$

\end{property}

Let 

$$P_{ij}(n)=\mathbb{P}({X_{m+n}=j\mid X_m=i})$$

be the probability of going from state $i$ to $j$ in $n$ steps. And $\mathbf{P}_n$ be the $n$-step transition matrix.

\begin{theorem}[The Chapman-Kolmogorov equations]

The $n$ step probabilities satisfy

$$P_{ij}(m+n)=\sum_kP_{ik}(m)P_{kj}(n)$$

\end{theorem}

\begin{proof}

Recall the chain rule of probability:

$$\mathbb{P}(X=x,Y=y\mid Z=z)=\mathbb{P}(X=x\mid Z=z)\mathbb{P}(Y=y\mid X=x,Z=z)$$

and the law of total probability:

$$\mathbb{P}(X=x)=\sum_y\mathbb{P}(X=x,Y=y)$$


\begin{align*}
P_{ij}(m+n) &= \mathbb{P}(X_{m+n}=j\mid X_0=i)\\
&=\sum_k \mathbb{P}(X_{m+n}=j,X_m=k\mid X_0=i)\\
&=\sum_k \mathbb{P}(X_{m+n}=j\mid X_m=k,X_0=i)\mathbb{P}(X_{m}=k\mid X_0=i)\\
&=\sum_k \mathbb{P}(X_{m+n}=j\mid X_m=k)\mathbb{P}(X_{m}=k\mid X_0=i)\\
&=\sum_kP_{ik}(m)P_{kj}(n)
\end{align*}

And $\mathbf{P}_{m+n}=\mathbf{P}_{m}\mathbf{P}_{n}$, where $\mathbf{P}_{n}=\mathbf{P}^{n}$.
\end{proof}

\begin{definition}[Absorbing states]

A state $j$ of a Markov chain is called **absorbing** if $P_{jj}=1$, i.e. if the chain reaches state $j$, it is stuck there forever.
\end{definition}

\begin{definition}[Reaching probability]

The reaching probability $f_{ij}$ is the probability that the Markov chain will reach state $j$ at any time in future if it starts at state $i$.

$$f_{ij}=\mathbb{P}(X_n=j\text{ for some } n > 0 \mid X_0=i )$$

\end{definition}

\begin{definition}[Recurrent and transient]

If $f_{ii}=1$, then state $i$ is recurrent.

If $f_{ii}<1$, then the state is transient.
\end{definition}

\begin{theorem}

The expected total number of visits to state $j$ is

$$\mathbb{E}\big[\sum_{k=0}^\infty\mathbb{I}\{X_k=j\}\mid X_0=i\big] = \sum_{k=0}^\infty P_{ij}(k)$$

\end{theorem}

\begin{proof}

Define 

$$
y_k=
\begin{align*}
1, &\text{if in state } j \text{ at time } k \\
0, &\text{otherwise}
\end{align*}
$$


\begin{align*}
\mathbb{E}\big[\sum_{k=0}^\infty\mathbb{I}\{X_k=j\}\mid X_0=i\big]&=\sum_{k=0}^\infty\mathbb{E}\big[y_k\mid X_0=i\big]\\
&=\sum_{k=0}^\infty\mathbb{P}\big(y_k=1\mid X_0=i\big)\\
&=\sum_{k=0}^\infty\mathbb{P}\big(X_k=j\mid X_0=i\big)\\
&=\sum_{k=0}^\infty P_{ij}(k)
\end{align*}

\end{proof}



\begin{theorem}

A state $i$ is recurrent if and only if $\sum_{k=0}^\infty P_{ii}(k) = \infty$.

\end{theorem}

\begin{proof}

**Part I**

If state $i$ is recurrent then starting at state $i$ will return to $i$ with probability $1$. Since this is a Markov chain, once back in $i$ forget past and restart the process from state $i$. Repeating this argument, the expected number of visit to $i$ from $i$ is infinite. Hence,

$$\sum_{k=0}^\infty P_{ii}(k) =\infty$$

**Part II**

Start in state $i$ and every time we return to state $i$ we are effectively restarting the process again. Consider a visit to state $i$ as a new trail, just like a geometric RV.

If chain returns to $i$ called "failure" and does not return called "success". And $\mathbb{P}(\text{success})=1-f_{ii}$. The number of trails tosee  first "success" is the number of visits to state $i$.

Therefore, 

$$\mathbb{E}\big[\text{number of visits to state } i\big]=\frac{1}{1-f_{ii}}$$

If $\sum_{k=0}^\infty P_{ii}(k) < \infty$, then $\frac{1}{1-f_{ii}}<\infty$, it means $f_{ii}<1$, in other words, $i$ is not recurrent.

\end{proof}

For two different states $i$ and $j$, we say that the state $j$ is accessible from $i$ if $f_{ij}>0$, denote $i\to j$. If $i\to j$ and $j\to i$ then we say $i$ and $j$ are communicate, denote $i\leftrightarrow j$. All states that communicate with each other form a class. An important property of Markov chain is that **every state belongs to only ine class**. 

\begin{definition}[Irreducible]
A Markov chain with only one class is called irreducible.
\end{definition}

\begin{theorem}

If $i\leftrightarrow j$ that the state $i$ is reccurent, the state $j$ is also recurrent.

\end{theorem}

\begin{proof}

$i$ is reccurent, then $f_{ii}=1$, and $i\leftrightarrow j$ implies $f_{ij}>0,f_{ji}>0$.

Consider a Markov chain starts at $i$ reaching $j$. $f_{ji}=1$ because $i$ is reccurent, it must return to $i$ w.p. 1.

Consider a Markov chain starts at $i$ reaching $i$. It is sure that chain hits $j$ because of the communication.

Hence $f_{jj}=1$.

\end{proof}

From the discussion, we know transience and recurrent are the group properties.