import numpy as np
import scipy.stats as stats
import matplotlib.pyplot as plt

# Markdown explanations embedded in this notebook

"""
# Statistics Advance Part 1 - Solutions

## 1. What is a random variable in probability theory?
A random variable is a function that assigns a numerical value to each possible outcome of a random experiment.
"""

"""
## 2. What are the types of random variables?
- **Discrete Random Variable**: Takes countable values (e.g., 1, 2, 3)
- **Continuous Random Variable**: Takes uncountable values (e.g., real numbers in an interval)
"""

"""
## 3. Difference between discrete and continuous distributions
| Discrete Distribution  | Continuous Distribution       |
|------------------------|-------------------------------|
| Countable outcomes     | Infinite possible outcomes    |
| Example: Binomial      | Example: Normal Distribution  |
"""

"""
## 4. What are probability distribution functions (PDF)?
Probability Distribution Function (PDF) defines the likelihood of a random variable taking on a particular value.
- **Discrete** → Probability Mass Function (PMF)
- **Continuous** → Probability Density Function (PDF)
"""

"""
## 5. Difference between PDF and CDF
- **PDF**: Gives probability density at a specific point (continuous)
- **CDF**: Probability that variable ≤ value x.

CDF(x) = ∫[−∞ to x] PDF(t) dt
"""

# Example Plot: Normal Distribution PDF and CDF
x = np.linspace(-4, 4, 1000)
pdf = stats.norm.pdf(x)
cdf = stats.norm.cdf(x)

plt.figure(figsize=(12,6))
plt.plot(x, pdf, label='PDF')
plt.plot(x, cdf, label='CDF')
plt.title('PDF and CDF of Standard Normal Distribution')
plt.legend()
plt.grid(True)
plt.show()

"""
## 6. What is a discrete uniform distribution?
A discrete uniform distribution is where each outcome has **equal probability**.
Example: Rolling a fair die → P(X=x) = 1/6 for x ∈ {1,2,3,4,5,6}
"""

"""
## 7. Key properties of Bernoulli Distribution
- Only 2 outcomes: Success (1) or Failure (0)
- Parameter: p (probability of success)
"""

"""
## 8. What is the binomial distribution?
Number of successes in n independent Bernoulli trials.
P(X=k) = C(n, k) * p^k * (1-p)^(n-k)

Example: Tossing a coin 10 times → No. of heads ~ Binomial(n=10, p=0.5)
"""

"""
## 9. What is the Poisson distribution?
Models number of events in fixed interval of time or space.
Used for rare events. Example: Number of calls at a call center in an hour.
"""

"""
## 10. What is a continuous uniform distribution?
Equal probability over a continuous interval.
Example: Random number between 0 and 1 → Uniform(0,1)
"""

"""
## 11. Characteristics of Normal Distribution
- Bell-shaped
- Symmetric about mean (μ)
- Mean = Median = Mode
"""

"""
## 12. What is the standard normal distribution?
Normal distribution with mean = 0 and standard deviation = 1.
Used for standardizing variables (Z-scores).
"""

"""
## 13. What is the Central Limit Theorem (CLT)?
Regardless of distribution, **sample means** tend to form a **normal distribution** as n → ∞.
"""

"""
## 14. How does CLT relate to normal distribution?
CLT explains why normal distribution is used even if population isn’t normal.
"""

"""
## 15. Application of Z statistics in hypothesis testing
Z = (X - μ) / (σ / √n)
Used to determine **how many standard deviations** a sample mean is from population mean.
"""

"""
## 16. How to calculate a Z-score, and what does it represent?
Z = (X - μ) / σ → **Standardizes** scores across different distributions.
"""

"""
## 17. Point estimates vs Interval estimates
- **Point Estimate**: Single value estimate (e.g., sample mean)
- **Interval Estimate**: Range of values → **Confidence Interval**
"""

"""
## 18. Significance of confidence intervals
Shows range within which **true population parameter** is likely to lie → Provides **uncertainty** measure.
"""

"""
## 19. Relationship between Z-score and confidence interval
Confidence Interval = X ± Z * (σ/√n)
"""

"""
## 20. How are Z-scores used to compare different distributions?
By converting different distributions into **standard normal form**, we can easily compare them.
"""

"""
## 21. Assumptions for applying Central Limit Theorem
1. Sample size should be large (n ≥ 30 usually).
2. Independent random sampling.
3. Finite variance.
"""

"""
## 22. Expected value in probability distribution
E[X] = Σ [x * P(X=x)] → Mean of a random variable.
"""

"""
## 23. Probability distribution relation to expected outcome
Expected outcome = Long-run average of a probability distribution.
Example: Dice → E[X] = (1+2+3+4+5+6)/6 = 3.5
"""