# 📊 Chapter 03: More Distributions and the Central Limit Theorem

You'll learn about the binomial distribution for visualizing the probability of binary outcomes, and one of the most important distributions in statistics, the normal distribution. You'll see how distributions can be described by their shape, along with discovering the Poisson distribution and its role in calculating the probabilities of events occurring over time. You'll also gain an understanding of the central limit theorem!

## 🎯 🔸 Binomial Distribution
- **Used For**: Binary outcomes (e.g., coin flips, success/failure).
- **Parameters**:
  - `n`: number of independent trials  
  - `p`: probability of success
- **Expected Value**:  
  $$\text{Expected value} = n \times p$$
- **Key Point**: Requires **independent** events.
- **Example Applications**:
  - Clinical drug trials (effective or not)
  - Sports betting (win or lose)

## 📈 🔸 Normal Distribution
- **Shape**: Symmetrical bell curve  
- **Total Area Under Curve**: 1 
- Curve never hits 0
- Described by mean and standard deviation
- **Empirical Rule**:
  - 68% within 1 standard deviation  
  - 95% within 2 standard deviations  
  - 99.7% within 3 standard deviations
- **Importance**:
  - Many real-world datasets follow this pattern.
  - Essential for hypothesis testing and many other statistical methods.

### 🔹 ⚖️ Skewness
- **Definition**: Measures the **asymmetry** of a distribution.
- A distribution is:
  - **Symmetric** when skewness ≈ 0
  - **Positively skewed** (right tail longer) when skewness > 0

  ![Positive Skewness](./assets/positive.png)
  - **Negatively skewed** (left tail longer) when skewness < 0

  ![Negative Skewness](./assets/negative.png)
- **Example**: Income distribution is often positively skewed.

### 🔹 🌋 Kurtosis
- **Definition**: Describes the **"tailedness"** or the frequency of extreme values.
- Types of kurtosis:
  - **Mesokurtic** (normal shape): moderate tails (kurtosis ≈ 3)
  - **Leptokurtic**: fatter tails (kurtosis > 3), more extreme values
  - **Platykurtic**: thinner tails (kurtosis < 3), fewer extremes

  ![Types of kurtosis](./assets/kurtosis.png)
  
- **Useful for**:
  - Identifying potential outliers
  - Understanding how volatile or "peaky" a distribution is

## 🔄 🔸 Central Limit Theorem (CLT)
- **Definition**: The sampling distribution of a statistic becomes closer to the normal distribution as the size of the sample increases.

  ![Sampling Distribution](./assets/sampling.png)
  
### 🧪 Requirements:
  - Random sampling
  - Independent observations
  - Sufficiently large sample size (n ≥ 30 is a decent rule of thumb)

### 🔄 CLT Applies To:
  - Means of samples
  - Proportions of samples

### 📐 What Gets Normalized?
  - Not your raw data — that can be skewed, binary, or just plain strange.
  - Instead, it’s the sampling distribution of the mean that becomes normal. That’s the keyword here: sampling distribution.
  - It’s not about what you measure — it’s about how many times you measure and how consistently.

### ✅ Benefits of the Central Limit Theorem (CLT)

1. **Normality of Sampling Distributions**  
   Sample means (or proportions) tend to form a **normal distribution**, even when the population distribution is not normal — as long as the sample size is large enough.

2. **Enables Inferential Statistics**  
   CLT underpins **confidence intervals** and **hypothesis testing**, allowing valid conclusions from sample data even without full population information.

3. **Simplifies Analysis**  
   Makes it possible to use **normal distribution-based methods** (z-scores, p-values, etc.) for real-world data that might otherwise be messy or complex.

4. **Improves Estimation**  
   Allows accurate estimation of **population parameters** (e.g., mean, proportion) from sample data, with quantifiable uncertainty.

5. **Reduces Random Variability**  
   Averages (sample means) are less variable than individual data points, offering **more consistent and reliable estimates**.

6. **Works Across Many Distributions**  
   The CLT applies broadly — to **binomial**, **Poisson**, **exponential**, and other distributions — making it a **universal tool** in statistics.

7. **Effective with Modest Sample Sizes**  
   Often works well with samples of **n ≥ 30**, which is practical for many real-world scenarios.

## ⏱️ 🔸 Poisson Distribution
- **Used For**: Counting the number of events in a fixed time or space.
- **Assumptions**:
  - Events are **independent**
  - Events occur at a **constant average rate**
- **Parameter**:
  - $\lambda$ (lambda): average number of events per interval  
    $$\lambda = \text{expected value}$$
- **Examples**:
  - Website visits per day
  - Animal adoptions per week
  - Customer arrivals per hour
- **Note**: The **CLT can also apply** to Poisson-distributed data with large samples.
