**Terminology**:
- *Randomized algorithms*: Algorithms that make use of random number generators. 

- *Aperiodic*: 
 - A state $s_i$ is aperiodic if the "period" of the state is 1. The period $d(s_i) := gcd\{n\ge1: (P^n)_{i,i}>0\}$, which says after d (or any interger times of d) steps, it is possible (positive probability) to return to the same state $i$.  
  - A Markov chain is aperiodic if all the states are aperiodic.

- *Stationary distribution $\pi$*: 
  - Def1. $\pi P = \pi$, where $P$ is the transition matrix of the Markov chain.
  - Def2. The distribution $\mu_n$ (at step n in a Markov chain) converges in total variance (t.v.) to $\pi$, i.e., $\lim_{n\to\infty}d_{TV}(\mu_n, \pi) = 0$
- *Reversible distribution*: 
A probability distribution $\pi$ is said to be reversible on the Markov chain (or the transition matrix) if for any $i,j\in \{1,2,...,k\}$, $\pi_i P_{i,j} = \pi_j P_{j,i}$. A Markov chain is reversible if it has at least one reversible distribution.

# Stationary Distribution of a Markov Chain

## Why do we need "aperiodic"？
Stationary distribution is an important property in Markov chain theory. For a discrete time and finite state space Markov chain, being "irreducible" and "aperiodic" implies it has one and only one stationary distribution. 

Compared with irreducibility, aperiodicity is a less intuitive concept. 

For state $s_i$, the definition of aperiodicty tells us that $gcd(N_i) := gcd\{n\ge1: (P^n)_{i,i}>0\}=1$, but it does not necessarily mean that 1 is an element in $N_i$.In other words, we don't have $P_{i,i}>0$ yet.

However, being aperiodic leads us to Thm 4.1, which says "there exists a $N<\inf$ such that $(P^n)_{i,i}>0$ for all $n>N$". In other words, when $n>N$ and the chain move forward one step, we can always go from state $i$ to $i$ with positive probability. This is very nice as it **gives hopes for convergence for $n$ goes to infinity**. The proof of Thm 4.1 utilizes a lemma from number theory which guarantees that all the $n$ after $N$ belongs to the set $N_i$. 

## Example: Aperiodic or periodic Markov chain?
To determine whether a Markov chain is aperiodic or not, there are some creterions that can be applied. 

Step 1. We can draw a transition diagram from the transition matrix if the number of states are not too many. With the diagram, we can easily tell if the Markov chain is irreducible or not. 

Step 2. It can be shown that **if a Markov chain is irreducible, all of its states have the same period**. 
![Example 1. Transition diagrams of two Markov chains](https://miro.medium.com/max/2404/1*bDNsx76wQoE9uyWJby1zwQ@2x.png)
Take a look at the two MCs in the figure above. From the transition diagram, both chains are irreducible (as the states can all be reached from each other). 
Therefore, we can look at only one of their states for the full information of periodicity. 

For the chain on the left, state 1 has periods of 2, 4, ..., therefore its period is 2. State 1 is periodic and then the whole chain is not aperiodic. Simimlarly, look at state 1 (in the center). It has periods of 3, 6, ..., and thus its period is 3. The MC on the right is not aperiodic either. 

A useful tip:**If a Markov chain is irreducible and it has a state i such that $P_{i,i}>0$, then it is aperiodic.** (adapted from Prb 4.2. in the book)

*Proof*: $P_{i,i} > 0$ means the chain can come to state i from state i in one step, and thus the period of i (gcd{$n\ge1: P^n_{i,i}>0$}) is 1. As we mentioned in step2, all the states have the same period for irreducible MC, and thus the chain is aperiodic. 

## Reversible and Stationary
We have seen the definition of "stationary" distribution and "reversible" distribution in the terminology part, but it is not that obvious why a reversible distribution is also a stationary one for the Markov chain. Let's prove it here. 

A distribution $\pi$ is stationary means the distribution remains the same after the transition. Mathematically, it is $\pi P=\pi$. We can also write it in element-wise format, $\sum_i \pi_{i} P_{i,j} = \pi_j$. NOTE: The distribution on the right hand side of the equation would not be $\pi$ (but a different distribution) if it is not stationary. 

If $\pi$ is reversible, we have $$\sum_i \pi_i P_{i,j} =\sum_j \pi_j P_{j,i},$$ so $$\sum_i \pi_i P_{i,j} = \sum_i \pi_j P_{j,i} = \pi_j \sum_i P_{j,i} = \pi_j$$ and therefore, $\pi$ is also a stationary distribution. 

In [0]:
import BeautifulSoup