###1. Define the z-statistic and explain its relationship to the standard normal distribution. How is it used in hypothesis testing?

A )
z-statistic measures how many standard deviations a data point or sample mean is from the population mean. It is calculated as: z =​xˉ−μ​  / σ/n

Relationship to standard normal distribution:
- The z-statistic follows the standard normal distribution (mean = 0, SD = 1) when population variance is known and sample size is large.

Use in hypothesis testing:
- It is used to compare sample results with population claims.
- The z-value is converted into a p-value, which determines whether to reject the null hypothesis.

####2. What is a p-value, and how is it used in hypothesis testing? What does it mean if the p-value is very small (e.g., 0.01).

A )
p-value is the probability of observing the sample result—or something more extreme—assuming the null hypothesis is true.

Usage :
- If p ≤ significance level (α) → reject the null hypothesis.
- If p > α → fail to reject the null hypothesis.

If p = 0.01:
- There is only 1% chance the result occurred due to random chance.
- Indicates strong evidence against the null hypothesis.

####3. Compare and contrast the binomial and Bernoulli distributions.

A )
| Feature    | **Bernoulli**                  | **Binomial**                            |
| ---------- | ------------------------------ | --------------------------------------- |
| Definition | One trial with success/failure | Repeated n independent Bernoulli trials |
| Outcomes   | 0 or 1                         | 0,1,2,...,n                             |
| Parameters | p                              | n, p                                    |
| Example    | Coin flip once                 | Flip coin 10 times                      |

Binomial = Sum of Bernoulli trials

####4. Under what conditions is the binomial distribution used, and how does it relate to the Bernoulli distribution?

A )
Use a binomial distribution when:
- Fixed number of trials (n)
- Trials independent
- Each trial has two outcomes (success/failure)
- Probability of success (p) is constant

Relation:
- A binomial distribution is the sum of n Bernoulli trials.

####5. What are the key properties of the Poisson distribution, and when is it appropriate to use this distribution?

A )
Properties :
- Models number of events in a fixed interval.
- Events occur independently.
- Average event rate = λ (lambda).
- Mean = λ, Variance = λ.

Used when:
- Events are rare (e.g., accidents, calls to a call center, decay events in physics).

####6. Define the terms "probability distribution" and "probability density function" (PDF). How does a PDF differ from a probability mass function (PMF)?

A )
PDF (Probability Density Function):
- Used for continuous variables.
- Probability is area under the curve.
- PDF value can exceed 1; probability cannot.

PMF (Probability Mass Function):
- Used for discrete variables.
- Gives probability for specific values.
- Sum of all probabilities = 1.

####7. Explain the Central Limit Theorem (CLT) with example.

A )The Central Limit Theorem states:

- For large sample sizes (n ≥ 30), the sampling distribution of the sample mean becomes approximately normal, regardless of population distribution.

Example:

Population = exam scores (not normally distributed).

Take repeated samples of size 40.

The mean of each sample forms a distribution which becomes normal.

####8. Compare z-scores and t-scores. When should you use a z-score, and when should a t-score be applied instead?

A )

---
| Criteria      | **Z-score**     | **T-score**                    |
| ------------- | --------------- | ------------------------------ |
| Population SD | Known           | Unknown                        |
| Sample size   | Large (n > 30)  | Small (n ≤ 30)                 |
| Distribution  | Standard normal | t-distribution (heavier tails) |
---


Use z-score → population variance known.

Use t-score → population variance unknown & small sample.


####9.  Given a sample mean of 105, a population mean of 100, a standard deviation of 15, and a samplesize of 25, calculate the z-score and p-value. Based on a significance level of 0.05, do you reject or fail toreject the null hypothesis?

####Task: Write Python code to calculate the z-score and p-value for the given data.

####Objective: Apply the formula for the z-score and interpret the p-value for hypothesis testing.

A )
Given:
- Sample mean = 105
- Population mean = 100
- σ = 15
- n = 25

Calculate z-score & p-value:

Z= (105-100)/(15/√25) = 5/3 =1.6667


Decision (α = 0.05, two-tailed)

p-value ≈ 0.095

Since p > 0.05, we fail to reject the null hypothesis.

In [None]:
import math
from scipy.stats import norm

# Given values
sample_mean = 105
population_mean = 100
std_dev = 15
n = 25

# Calculate z-score
z = (sample_mean - population_mean) / (std_dev / math.sqrt(n))

# Two-tailed p-value
p_value = 2 * (1 - norm.cdf(abs(z)))

print("Z-score:", z)
print("P-value:", p_value)

if p_value < 0.05:
    print("Reject the null hypothesis")
else:
    print("Fail to reject the null hypothesis")


####10. Simulate a binomial distribution with 10 trials and a probability of success of 0.6 using Python.Generate 1,000 samples and plot the distribution. What is the expected mean and variance?

####Task: Use Python to generate the data, plot the distribution, and calculate the mean and variance.

####Objective: Understand the properties of a binomial distribution and verify them through simulation.

A )
Expected:
- Mean = np = 10 × 0.6 = 6
- Variance = np(1-p) = 10 × 0.6 × 0.4 = 2.4

In [None]:
import numpy as np
import matplotlib.pyplot as plt

# Parameters
n = 10
p = 0.6
samples = 1000

# Generate binomial samples
data = np.random.binomial(n, p, samples)

# Plot distribution
plt.hist(data, bins=range(n+2), edgecolor='black')
plt.title('Binomial Distribution (n=10, p=0.6)')
plt.xlabel('Number of successes')
plt.ylabel('Frequency')
plt.show()

# Mean & variance
print("Simulated Mean:", np.mean(data))
print("Simulated Variance:", np.var(data))

# Expected
print("Expected Mean:", n*p)
print("Expected Variance:", n*p*(1-p))
