# I. Analytical Simplification Techniques

Make the math solvable on paper.

## 1. Independence Assumptions

Factor joint distributions:

$$
p(x_1,\dots,x_n)=\prod_i p(x_i)
$$

Core of Naive Bayes, classical statistics.

## 2. Conditional Independence

Use graphical structure to reduce dimensionality.

Foundation of Bayesian Networks, HMMs.

## 3. Sufficient Statistics

Compress data without losing information about parameters.

Exponential family magic.

## 4. Conjugate Priors

Posterior has same functional form as prior.

Enables closed-form Bayesian updates.

## 5. Change of Variables

Jacobian tricks to simplify distributions.

Basis of normalizing flows (modern revival).

## 6. Moment Closure

Approximate infinite hierarchies by truncating moments.

Used in physics, population models.

---

# II. Approximation via Optimization

Replace probability with optimization.

## 7. Maximum Likelihood Estimation (MLE)

Replace integrals with argmax problems.

## 8. Maximum A Posteriori (MAP)

Bayesian inference → optimization.

## 9. Laplace Approximation

Approximate posterior by a Gaussian around the mode.

One of the oldest asymptotic tools.

## 10. Saddle-Point Approximation

High-precision asymptotics for integrals.

Widely used in statistical physics.

---

# III. Sampling-Based Techniques (Monte Carlo World)

Replace integrals with averages.

## 11. Monte Carlo Integration

Law of Large Numbers turns expectation into arithmetic.

## 12. Importance Sampling

Reweight samples from easier distributions.

## 13. Rejection Sampling

Sample from complex distributions using envelopes.

## 14. Markov Chain Monte Carlo (MCMC)

Sample without knowing normalization constant.

Metropolis–Hastings

Gibbs Sampling

Hamiltonian Monte Carlo (HMC)

Langevin Dynamics

## 15. Sequential Monte Carlo (Particle Filters)

Time-evolving distributions.

---

# IV. Variational & Functional Approximations

Turn inference into function fitting.

## 16. Variational Inference (VI)

Minimize KL divergence instead of computing integrals.

## 17. Mean-Field Approximation

Break dependencies into independent factors.

## 18. Expectation Propagation (EP)

Local moment matching instead of global KL.

## 19. Free Energy Minimization

Physics → statistics bridge.

Basis of ELBO.

---

# V. Transformational Representations

Change the space so probability becomes easy.

## 20. Fourier / Characteristic Functions

Convolution → multiplication.

## 21. Generating Functions

Encode distributions into algebraic objects.

## 22. Laplace Transforms

Differential equations → algebraic equations.

## 23. Probability Generating Functions

Discrete distributions tractability.

---

# VI. Structural Decomposition

Exploit structure instead of brute force.

## 24. Graphical Models

Factorization via graphs.

Bayesian Networks

Markov Random Fields

## 25. Message Passing Algorithms

Belief Propagation

Sum-Product / Max-Product

## 26. Dynamic Programming

HMM forward-backward

Viterbi algorithm

---

# VII. Asymptotic & Limit Theorems

Let infinity do the work.

## 27. Law of Large Numbers

Random → deterministic.

## 28. Central Limit Theorem

Everything → Gaussian.

## 29. Large Deviations Theory

Exponentially small probabilities become analyzable.

---

# VIII. Continuous-Time & Differential Methods

Probability as dynamics.

## 30. Stochastic Differential Equations (SDEs)

Replace distributions with stochastic flows.

## 31. Fokker–Planck Equations

Track density evolution deterministically.

## 32. Score Matching

Learn gradients instead of densities.

## 33. Diffusion & Reverse-Time Processes

Probability generation via dynamics.

---

# IX. Discretization & Relaxation

Trade exactness for computability.

## 34. Discretization of Continuous Spaces

Grids, bins, quantization.

## 35. Relaxation of Constraints

Replace hard constraints with penalties.

## 36. Surrogate Objectives

Lower bounds, upper bounds.

---

# X. Symmetry & Invariance Exploitation

Reduce degrees of freedom.

## 37. Exchangeability

Leads to de Finetti representation.

## 38. Stationarity & Ergodicity

Time averages = ensemble averages.

---

# XI. Learning-Based Tractability

Let models learn the probability for you.

## 39. Density Estimation Models

Autoregressive models

Normalizing Flows

Energy-Based Models

## 40. Implicit Models

GANs (sampling without likelihoods).

## 41. Score-Based Models

Avoid density, learn vector fields.

---

# XII. Decision-Theoretic Collapse

Probability becomes action.

## 42. Bayes Risk Minimization

Integrals replaced by expected loss.

## 43. Proper Scoring Rules

Turn distribution learning into optimization.
