Here‚Äôs a clear, intuitive, and mathematically precise description of the **multinomial distribution**, grounded in the sources we retrieved.

---

# üé≤ The Multinomial Distribution ‚Äî A Complete, Intuitive Overview

The **multinomial distribution** is the natural generalization of the **binomial distribution** when you have **more than two possible outcomes** in each trial. Instead of ‚Äúsuccess/failure,‚Äù you might have categories like *red/green/blue*, or *A/B/C/D*, or the faces of a die.

According to standard references, the multinomial distribution models the probability of obtaining specific counts for each of $k$ categories across $n$ independent trials, where each trial results in exactly one category and each category has a fixed probability.

---

## üß† When to Use It

Use the multinomial distribution when:

- You repeat the **same experiment** $n$ times  
- Each trial has **k mutually exclusive outcomes**  
- The probability of each outcome is **fixed**  
- Each trial is **independent**  

Examples:

- Rolling a $k$-sided die $n$ times  
- Counting survey responses across multiple categories  
- Tracking colors drawn from an urn *with replacement*  

---

## üìê Formal Definition

Let:

- $n$ = number of trials  
- $k$ = number of categories  
- $p_1, \dots, p_k$ = probabilities of each category (with $\sum p_i = 1$)  
- $X_1, \dots, X_k$ = counts of each category (with $\sum X_i = n$)

Then the probability of observing counts $x_1, \dots, x_k$ is:

$$
P(X_1 = x_1, \dots, X_k = x_k)
= \frac{n!}{x_1! \cdots x_k!} \, p_1^{x_1} \cdots p_k^{x_k}
$$

This formula appears in multiple sources, including Statology and Wikipedia.

---

## üîó Relationship to Other Distributions

The multinomial distribution connects to several familiar distributions:

| Special Case | Resulting Distribution | Source |
|--------------|------------------------|--------|
| $k = 2$, $n = 1$ | Bernoulli |  |
| $k = 2$, $n > 1$ | Binomial |  |
| $k > 2$, $n = 1$ | Categorical |  |

This makes the multinomial the ‚Äúparent‚Äù of many common discrete distributions.

---

## üìä Intuition: A Vector of Counts

Instead of a single number (like in the binomial), the multinomial gives you a **vector**:

$$
(X_1, X_2, \dots, X_k)
$$

This vector tells you how many times each category occurred. Statlect emphasizes that you can think of this vector as the sum of independent ‚ÄúMultinoulli‚Äù (categorical) trials.

---

## üß© Why It Matters

The multinomial distribution is foundational in:

- **Machine learning** (e.g., Naive Bayes classifiers)  
- **Natural language processing** (word counts)  
- **Genetics** (allele frequencies)  
- **Survey statistics**  
- **Bayesian inference** (Dirichlet‚Äìmultinomial models)  

It‚Äôs one of those distributions that quietly powers a huge amount of modern statistical modeling.

---

## üìà Example

Suppose you roll a fair 6-sided die $n = 10$ times.  
Each face has probability $p_i = \frac{1}{6}$.

What‚Äôs the probability of getting:

- 2 ones  
- 1 two  
- 3 threes  
- 0 fours  
- 2 fives  
- 2 sixes  

Plug into the PMF:

$$
\frac{10!}{2!1!3!0!2!2!}
\left(\frac{1}{6}\right)^{10}
$$

In [None]:
import scipy.stats as stats
x = [2, 1, 3, 0, 2, 2]
p = [1/6, 1/6, 1/6, 1/6, 1/6, 1/6]
stats.multinomial.pmf(x, sum(x), p)

np.float64(0.0012502857796067677)