In [None]:
Z-Test Problem Solving

In [None]:
Problem 1: One-Sample Z-Test (Two-Tailed)

In [None]:
Scenario:
A bakery claims its cookies weigh 50g on average (σ = 5g). You randomly sample 36 cookies and find an average weight of 48g.
Test at α = 0.05 if the average weight differs from the claim.

In [None]:
Step 1: State Hypotheses
H₀: μ = 50g (Bakery’s claim is true)

H₁: μ ≠ 50g (Average weight differs)

Step 2: Calculate Z-Score

![image.png](attachment:eb2141da-7d7f-4479-a58f-98df8017f0b0.png)

In [None]:
Step 3: Determine Critical Value
For α = 0.05 (two-tailed), critical Z = ±1.96.

|Z| = 2.4 > 1.96 → Reject H₀.

Conclusion
There’s significant evidence (p < 0.05) that the average cookie weight differs from 50g

In [None]:
Problem 2: One-Tailed Z-Test (Right-Tailed)

In [None]:
Scenario:
A school claims students score ≥75 on exams (σ = 10). You test 25 students and find an average of 78.
Test at α = 0.01 if scores are higher than claimed.

In [None]:
Step 1: State Hypotheses
H₀: μ ≤ 75 (School’s claim is true or lower)

H₁: μ > 75 (Scores are higher)

Step 2: Calculate Z-Score

![Screenshot 2025-07-07 131258.png](attachment:60efae43-069d-4cd6-9770-1a99de56e88e.png)

In [None]:
Step 3: Compare to Critical Value
For α = 0.01 (right-tailed), critical Z = 2.326.

Z = 1.5 < 2.326 → Fail to reject H₀.

Conclusion
No significant evidence (p > 0.01) that scores are higher than 75.

In [None]:
Problem 3: Two-Sample Z-Test

In [None]:
Scenario:
Company A claims its batteries last longer than Company B’s.

Sample A (n=50): Mean = 120h, σ = 10h

Sample B (n=50): Mean = 115h, σ = 12h
Test at α = 0.05.

In [None]:
Step 1: State Hypotheses
H₀: μ_A ≤ μ_B (A’s batteries last ≤ B’s)

H₁: μ_A > μ_B (A’s batteries last longer)

Step 2: Calculate Z-Score

![image.png](attachment:abe02a34-ef20-44bb-883f-2801779dc6d9.png)

In [None]:
Step 3: Compare to Critical Value
For α = 0.05 (right-tailed), critical Z = 1.645.

Z = 2.07 > 1.645 → Reject H₀.

Conclusion
Company A’s batteries last significantly longer (p < 0.05).

In [None]:
Key Formulas Cheat Sheet

![image.png](attachment:cbbc2ee9-9147-46ea-95dc-31a89c7b9c27.png)

In [None]:
Practice Problem (Try It!)

In [None]:
Scenario:
A coffee machine should dispense 200ml per cup (σ = 5ml). You test 40 cups and find an average of 198ml.
Test at α = 0.01 if the machine is underfilling.

In [None]:
Steps:

H₀: μ ≥ 200ml

H₁: μ < 200ml (Left-tailed test)

Calculate Z:

![image.png](attachment:b43b2ed8-3179-42de-af57-ff47b18f28ee.png)





In [None]:
Critical Z (α=0.01, left-tailed): -2.326

Decision: Z (-2.53) < -2.326 → Reject H₀.

Answer: The machine is significantly underfilling (p < 0.01).

In [None]:
T-Test vs. Chi-Squared Test

In [None]:
1. T-Test (Comparing Means)

In [None]:
1. When to Use a Z-Test?

Population standard deviation (σ) is known.

Sample size is large (n ≥ 30).

Data is normally distributed (or approximately normal due to CLT).

In [None]:
Types of T-Tests
Test Type	Use Case	                                            Python Function (scipy.stats)
One-Sample	Compare sample mean to a known value	                ttest_1samp
Independent	Compare means of two unrelated groups	                ttest_ind
Paired	    Compare means of the same group before/after treatment	ttest_rel


In [None]:
Example: Independent T-Test

In [1]:
from scipy.stats import ttest_ind

group1 = [22, 25, 30, 28, 26]  # e.g., test scores of Group A
group2 = [18, 20, 25, 22, 21]  # Group B

t_stat, p_value = ttest_ind(group1, group2)
print(f"T-statistic: {t_stat:.2f}, P-value: {p_value:.4f}")

T-statistic: 2.80, P-value: 0.0231


In [None]:
Interpretation: If p_value < 0.05, reject H₀ (groups differ significantly).

In [None]:
2. Chi-Squared Test (Testing Relationships)

In [None]:
When to Use?
Test relationships between categorical variables (e.g., gender vs. voting preference).

Two types:

Chi-Squared Goodness-of-Fit: Compare observed vs. expected frequencies in one categorical variable.

Chi-Squared Test of Independence: Check if two categorical variables are independent.

Key Assumptions
Observations are independent.

Expected frequency in each cell ≥ 5 (otherwise, use Fisher’s Exact Test).

Example: Chi-Squared Test of Independence

In [2]:
from scipy.stats import chi2_contingency

# Contingency table (e.g., gender vs. product preference)
data = [[30, 10],  # 30 men prefer Product A, 10 prefer B
        [20, 40]]   # 20 women prefer A, 40 prefer B

chi2, p, dof, expected = chi2_contingency(data)
print(f"Chi2: {chi2:.2f}, P-value: {p:.4f}")

Chi2: 15.04, P-value: 0.0001


In [None]:
Interpretation: If p_value < 0.05, variables are dependent (e.g., gender affects product preference).

In [None]:
3. Key Differences

In [None]:
Feature	              T-Test	                     Chi-Squared Test
Data Type	    Continuous (means)	               Categorical (counts/frequencies)
Purpose  	    Compare means	                   Test relationships/independence
Example Use     Compare exam scores of 2 groups	   Check if smoking is linked to cancer
Assumptions	    Normality, equal variance	       Expected frequencies ≥ 5
Python Function	scipy.stats.ttest_*	s              cipy.stats.chi2_contingency

In [None]:
4. Practical Examples

In [None]:
A) One-Sample T-Test (Is sample mean = 100?)

In [3]:
from scipy.stats import ttest_1samp

data = [98, 102, 105, 97, 100]
t_stat, p_value = ttest_1samp(data, popmean=100)
print(f"P-value: {p_value:.4f}")  # If > 0.05, no significant difference.

P-value: 0.7943


In [None]:
B) Paired T-Test (Before vs. After Treatment)

In [None]:
from scipy.stats import ttest_rel

before = [70, 72, 68, 75]  # Blood pressure before
after = [65, 70, 62, 68]   # Blood pressure after
t_stat, p_value = ttest_rel(before, after)
print(f"P-value: {p_value:.4f}")  # If < 0.05, treatment had an effect.

In [None]:
C) Chi-Squared Goodness-of-Fit (Is dice fair?)

In [4]:
from scipy.stats import chisquare

observed = [10, 12, 8, 15, 9, 6]  # Observed dice rolls (1-6)
expected = [10, 10, 10, 10, 10, 10]  # Expected if fair
chi2, p_value = chisquare(observed, f_exp=expected)
print(f"P-value: {p_value:.4f}")  # If < 0.05, dice is biased.

P-value: 0.4159


In [None]:
5. Common Pitfalls

In [None]:
T-Test Mistakes
❌ Using independent t-test for paired data.
❌ Ignoring non-normality in small samples (use Mann-Whitney U instead).

Chi-Squared Mistakes
❌ Using with expected counts < 5 (combine categories or use Fisher’s test).
❌ Assuming causation from significant results (only shows association).

In [None]:
Final Answer

In [None]:
Use T-tests to compare means (continuous data).

Use Chi-Squared tests to analyze categorical relationships.

Always check assumptions before interpreting results.