# Q&A: More Distributions & the Central Limit Theorem

A beginner-friendly notebook of **questions & answers**. Run the setup cell first, then explore each section. We use simple **simulation** to build intuition—no advanced libraries needed.

## 0) Setup

In [None]:
# Run this cell first
import numpy as np
import pandas as pd
import matplotlib.pyplot as plt
from math import comb, factorial, exp, sqrt
from scipy.stats import norm, binom, poisson, expon

rng = np.random.default_rng(2025)
pd.set_option('display.precision', 3)


## 1) The Normal Distribution
**Q1.** What defines a normal distribution?

**Q2.** Generate 1,000 samples from a normal distribution with mean 20 and SD 3, then plot a histogram.

In [None]:
# To generate samples from a normal distribution you can use norm.rvs

**Q3.** About what percentage of values fall within 1 SD of the mean in a normal distribution?

## 2) The Central Limit Theorem (CLT)
**Q4.** State the CLT in plain English.

**Q5.** Demonstrate the CLT by sampling means from a **uniform** distribution on [0,1]. Compare histograms of sample means for different sample sizes (n=2, n=5, n=30).

In [1]:
# Sample means from Uniform(0,1)
# I write the function to help but make sure that you understand this
def sample_means_from_uniform(n, reps=2000):
    means = []
    for _ in range(reps):
        x = rng.uniform(0, 1, size=n)
        means.append(np.mean(x))
    return np.array(means)


**Q6.** What happens to the **spread** of the sampling distribution of the mean as sample size increases?

## 3) The Poisson Distribution
**Q7.** When is the Poisson distribution a good model?

**Q8.** If λ=8 adoptions per week, simulate 500 weeks of adoptions and plot a histogram.

In [None]:
# If you don't remember the poisson sampling function, you do have Google


**Q9.** Compute P(X=5), P(X≤5), and P(X>5) for X~Poisson(λ=8) **without** SciPy.

## 4) The Exponential Distribution
**Q10.** How is the exponential distribution related to the Poisson distribution?

**Q11.** If tickets arrive on average every 2 minutes, simulate 1,000 wait times and estimate P(wait ≤ 1 min).

In [None]:
# How do you make samples on the exponential dist?


## 5) The t-Distribution
**Q12.** What makes the t-distribution different from the normal distribution?

**Q13.** Simulate 5,000 draws from a t-like distribution with df=5 and compare its histogram to a standard normal (simulation-based).

In [None]:
# DO you know what a t-like distribution is?

## 6) The Mean of Means
**Q14.** Using a strongly **right-skewed** distribution (log-normal), compute sample means for n=30 over 2,000 repetitions. What do you notice about the shape of the distribution of the means?

In [None]:
# Simulate a skewed population (log-normal via exponentiating normal)


## 7) Wrap-up
- Normal distribution is defined by **μ** and **σ**; many histograms look roughly normal.
- **CLT**: sampling distributions of means tend to normal as **n** grows.
- **Poisson** counts events; **Exponential** models wait times; **t** handles small-sample uncertainty.
- Simulation is your friend for building intuition.