# Policy Gradient Methods

A limiting distribution is closely related to a stationary distribution but with a subtle difference. Let's explore the concept:

Definition:
The limiting distribution of a Markov chain is the long-term probability distribution of the chain, regardless of the initial state, as the number of steps approaches infinity.

Mathematically, for a Markov chain with transition matrix P, if the limit exists, the limiting distribution π is defined as:

π = lim(n→∞) μPⁿ

where μ is any initial probability distribution.

Key points about limiting distributions:

1. Convergence:
   It represents the probabilities of being in each state after a large number of transitions.

2. Relation to stationary distribution:
   - For finite, irreducible, and aperiodic Markov chains, the limiting distribution exists and is equal to the unique stationary distribution.
   - For periodic chains, the limiting distribution may not exist, even when a stationary distribution does.

3. Independence from initial state:
   If it exists, the limiting distribution is the same regardless of the starting state of the Markov chain.

4. Ergodic theorem:
   For ergodic Markov chains, the limiting distribution equals the long-run proportion of time spent in each state.

5. Calculation:
   Can be computed by taking powers of the transition matrix P and observing the convergence of row probabilities.

6. Existence:
   Not all Markov chains have a limiting distribution. Conditions for existence include irreducibility and aperiodicity for finite state spaces.

7. Applications:
   Used in predicting long-term behavior of systems in various fields including economics, biology, and physics.

Differences from stationary distribution:
1. A stationary distribution always exists for finite, irreducible Markov chains, while a limiting distribution may not (e.g., for periodic chains).
2. The stationary distribution is defined by the equation πP = π, while the limiting distribution is defined as a limit of repeated matrix multiplication.
3. For some chains (like periodic ones), there can be a stationary distribution without a corresponding limiting distribution.

Example:
Consider a Markov chain with transition matrix:

P = [[0.7, 0.3],
     [0.4, 0.6]]

As n increases, Pⁿ converges to:

[[0.571, 0.429],
 [0.571, 0.429]]

So the limiting distribution is [0.571, 0.429], which is also the stationary distribution for this chain.

Would you like me to elaborate on any aspect of limiting distributions or compare them further with stationary distributions?

The difference between limiting distributions and stationary distributions is subtle but important. Let's compare them:

1. Definition:
   - Stationary distribution: A probability distribution π that satisfies πP = π, where P is the transition matrix.
   - Limiting distribution: The distribution that the Markov chain converges to as the number of steps approaches infinity, if such convergence occurs.

2. Existence:
   - Stationary distribution: Always exists for finite, irreducible Markov chains.
   - Limiting distribution: May not exist for all Markov chains, even when a stationary distribution does.

3. Calculation method:
   - Stationary distribution: Found by solving the equation πP = π, subject to Σπᵢ = 1.
   - Limiting distribution: Computed as lim(n→∞) μPⁿ, where μ is any initial distribution.

4. Periodicity impact:
   - Stationary distribution: Exists even for periodic chains.
   - Limiting distribution: Does not exist for periodic chains.

5. Convergence:
   - Stationary distribution: Describes an equilibrium state but doesn't imply convergence.
   - Limiting distribution: Implies convergence of the chain to a specific distribution.

6. Uniqueness:
   - Stationary distribution: Unique for irreducible chains.
   - Limiting distribution: If it exists, it's unique and independent of the initial state.

7. Relation:
   - For finite, irreducible, aperiodic Markov chains, the limiting distribution exists and equals the unique stationary distribution.
   - For periodic or reducible chains, they may differ or the limiting distribution may not exist.

8. Interpretation:
   - Stationary distribution: Can be interpreted as a stable state of the system.
   - Limiting distribution: Represents the long-term behavior of the system.

Example illustrating the difference:

Consider a periodic Markov chain with two states and transition matrix:

P = [[0, 1],
     [1, 0]]

This chain alternates between states deterministically.

- Stationary distribution: π = [0.5, 0.5] (exists and satisfies πP = π)
- Limiting distribution: Does not exist because the chain never converges to a stable distribution (it keeps alternating)

This example shows a case where a stationary distribution exists, but a limiting distribution does not.

In summary, while often identical for well-behaved Markov chains, stationary and limiting distributions can differ in their existence, calculation, and interpretation, especially for chains with special structures like periodicity or reducibility.