
### **1. One-Sample T-Test**

**Scenario:**
A nutritionist claims that the average daily sodium intake of a certain population is 2300 mg. To test this claim, she collects a random sample of 10 individuals from the population and records their daily sodium intake as follows (in mg): 2400, 2200, 2250, 2350, 2400, 2500, 2100, 2300, 2450, 2350.

**Objective:**
Determine if the average daily sodium intake of this population is significantly different from the claimed 2300 mg.

**Hypotheses:**
- Null Hypothesis ($(H_0)$): The average sodium intake is 2300 mg ($(\mu = 2300)$).
- Alternative Hypothesis (\(H_1\)): The average sodium intake is not 2300 mg ($(\mu \neq 2300)$).

**Solution:**

1. **Calculate the sample mean ($(\bar{x})$):**

   $[
   \bar{x} = \frac{2400 + 2200 + 2250 + 2350 + 2400 + 2500 + 2100 + 2300 + 2450 + 2350}{10} = 2330 \text{ mg}
   ]$

2. **Calculate the sample standard deviation (\(s\)):**

   $[
   s = \sqrt{\frac{\sum_{i=1}^{n}(x_i - \bar{x})^2}{n-1}} = \sqrt{\frac{(2400 - 2330)^2 + \dots + (2350 - 2330)^2}{10-1}} \approx 105.41 \text{ mg}
   ]$

3. **Calculate the t-statistic:**

   $[
   t = \frac{\bar{x} - \mu}{\frac{s}{\sqrt{n}}} = \frac{2330 - 2300}{\frac{105.41}{\sqrt{10}}} \approx 0.903
   ]$

4. **Determine the critical t-value:**
   - For a two-tailed test with $(n-1 = 9)$ degrees of freedom and $(\alpha = 0.05)$, the critical t-value is approximately 2.262.

5. **Decision:**
   - Since the calculated t-value (0.903) is less than the critical t-value (2.262), we fail to reject the null hypothesis. There is not enough evidence to suggest that the average daily sodium intake is significantly different from 2300 mg.

---


In [2]:
import numpy as np
from scipy import stats

# Sample data
data = np.array([2400, 2200, 2250, 2350, 2400, 2500, 2100, 2300, 2450, 2350])
claimed_mean = 2300

# Calculate sample mean and standard deviation
sample_mean = np.mean(data)
sample_std = np.std(data, ddof=1)  # ddof=1 for sample standard deviation
sample_size = len(data)

# Perform the one-sample t-test
t_statistic, p_value = stats.ttest_1samp(data, claimed_mean)

print(f"Sample Mean: {sample_mean:.2f}")
print(f"Sample Standard Deviation: {sample_std:.2f}")
print(f"T-Statistic: {t_statistic:.2f}")
print(f"P-Value: {p_value:.4f}")

# Interpret the result
alpha = 0.05  # Significance level
if p_value < alpha:
    print("Reject the null hypothesis. The average daily sodium intake is significantly different from 2300 mg.")
else:
    print("Fail to reject the null hypothesis. There is not enough evidence to suggest that the average daily sodium intake is different from 2300 mg.")


Sample Mean: 2330.00
Sample Standard Deviation: 120.65
T-Statistic: 0.79
P-Value: 0.4519
Fail to reject the null hypothesis. There is not enough evidence to suggest that the average daily sodium intake is different from 2300 mg.


In [None]:
from scipy import stats

# Given values
alpha = 0.05  # significance level
df = 9        # degrees of freedom

# Calculate the t-value for a two-tailed test
t_value = stats.t.ppf(1 - alpha/2, df)

print(f"Critical t-value: {t_value:.3f}")


Critical t-value: 2.262



### **2. Independent T-Test**

**Scenario:**
A researcher wants to compare the exam scores of two groups of students who were taught using different teaching methods. Group A (n = 15) received traditional instruction, and Group B (n = 15) received online instruction. The exam scores are as follows:

- Group A: 78, 85, 82, 88, 90, 76, 80, 84, 79, 87, 91, 82, 83, 85, 86
- Group B: 81, 79, 80, 83, 85, 77, 75, 78, 82, 80, 86, 83, 81, 79, 77

**Objective:**
Determine if there is a significant difference in exam scores between the two groups.

**Hypotheses:**
- Null Hypothesis (\(H_0\)): The mean exam scores of the two groups are equal $((\mu_A = \mu_B))$.
- Alternative Hypothesis (\(H_1\)): The mean exam scores of the two groups are not equal ($(\mu_A \neq \mu_B)$).

**Solution:**

1. **Calculate the means ($(\bar{x}_A) and (\bar{x}_B)$):**

   $[
   \bar{x}_A = \frac{78 + 85 + \dots + 86}{15} = 83.73
   ]$
   $[
   \bar{x}_B = \frac{81 + 79 + \dots + 77}{15} = 80.07
   ]$

2. **Calculate the standard deviations ($(s_A)$ and $(s_B)$):**

   $[
   s_A \approx 4.43
   ]$
   $[
   s_B \approx 3.14
   ]$

3. **Calculate the pooled standard deviation (\(s_p\)):**

   $[
   s_p = \sqrt{\frac{(n_A-1)s_A^2 + (n_B-1)s_B^2}{n_A + n_B - 2}} = \sqrt{\frac{(14)(4.43)^2 + (14)(3.14)^2}{28}} \approx 3.91
   ]$

4. **Calculate the t-statistic:**

   $[
   t = \frac{\bar{x}_A - \bar{x}_B}{s_p \sqrt{\frac{1}{n_A} + \frac{1}{n_B}}} = \frac{83.73 - 80.07}{3.91 \times \sqrt{\frac{1}{15} + \frac{1}{15}}} \approx 2.72
   ]$

5. **Determine the critical t-value:**
   - For a two-tailed test with $(n_A + n_B - 2 = 28)$ degrees of freedom and \($alpha = 0.05)$, the critical t-value is approximately 2.048.

6. **Decision:**
   - Since the calculated t-value (2.72) is greater than the critical t-value (2.048), we reject the null hypothesis. There is sufficient evidence to suggest a significant difference in exam scores between the two groups.

---


In [None]:
import numpy as np
from scipy import stats

# Exam scores for the two groups
group_A = np.array([78, 85, 82, 88, 90, 76, 80, 84, 79, 87, 91, 82, 83, 85, 86])
group_B = np.array([81, 79, 80, 83, 85, 77, 75, 78, 82, 80, 86, 83, 81, 79, 77])

# Perform the independent t-test
t_statistic, p_value = stats.ttest_ind(group_A, group_B)

print(f"T-Statistic: {t_statistic:.2f}")
print(f"P-Value: {p_value:.4f}")

# Interpret the result
alpha = 0.05  # Significance level
if p_value < alpha:
    print("Reject the null hypothesis. There is a significant difference in exam scores between the two groups.")
else:
    print("Fail to reject the null hypothesis. There is no significant difference in exam scores between the two groups.")


T-Statistic: 2.43
P-Value: 0.0220
Reject the null hypothesis. There is a significant difference in exam scores between the two groups.



### **3. Paired T-Test**

**Scenario:**
A fitness coach wants to determine if a new training program is effective in reducing body fat percentage. The coach measures the body fat percentage of 8 participants before and after the 12-week program.

**Data:**

| Participant | Before (%) | After (%) |
|-------------|------------|-----------|
| 1           | 24.0       | 22.5      |
| 2           | 27.0       | 25.0      |
| 3           | 30.0       | 28.0      |
| 4           | 26.0       | 24.5      |
| 5           | 29.0       | 27.5      |
| 6           | 32.0       | 29.0      |
| 7           | 25.0       | 23.0      |
| 8           | 28.0       | 26.0      |

**Objective:**
Determine if the training program leads to a significant reduction in body fat percentage.

**Hypotheses:**
- Null Hypothesis (\(H_0\)): The mean difference in body fat percentage before and after the program is zero ($(\mu_D = 0)$).
- Alternative Hypothesis (\(H_1\)): The mean difference in body fat percentage before and after the program is not zero ($(\mu_D \neq 0)$).

**Solution:**

1. **Calculate the differences (\(D\)) and the mean difference ($(\bar{D})$):**

   $[
   D = \text{Before} - \text{After}
   ]$
   $[
   D = [1.5, 2.0, 2.0, 1.5, 1.5, 3.0, 2.0, 2.0]
   ]$
   $[
   \bar{D} = \frac{1.5 + 2.0 + 2.0 + 1.5 + 1.5 + 3.0 + 2.0 + 2.0}{8} = 1.9375
   ]$

2. **Calculate the standard deviation of differences ($(s_D)$):**

   $[
   s_D = \sqrt{\frac{\sum_{i=1}^{n}(D_i - \bar{D})^2}{n-1}} = \sqrt{\frac{(1.5 - 1.9375)^2 + \dots + (2.0 - 1.9375)^2}{7}} \approx 0.53
   ]$

3. **Calculate the t-statistic:**

   $[
   t = \frac{\bar{D}}{\frac{s_D}{\sqrt{n}}} = \frac{1.9375}{\frac{0.53}{\sqrt{8}}} \approx 10.36
   ]$

4. **Determine the critical t-value:**
   - For a two-tailed test with $(n-1 = 7)$ degrees of freedom and $(\alpha = 0.05)$, the critical t-value is approximately 2.365.

5. **Decision:**
   - Since the calculated t-value (10.36) is much greater than the critical t-value (2.365), we reject the null hypothesis. There is sufficient evidence to suggest that the training program significantly reduces body fat percentage.

---

These examples provide a clear demonstration of how each type of t-test is used in different scenarios

In [None]:
import numpy as np
from scipy import stats

# Body fat percentages before and after the training program
before = np.array([24.0, 27.0, 30.0, 26.0, 29.0, 32.0, 25.0, 28.0])
after = np.array([22.5, 25.0, 28.0, 24.5, 27.5, 29.0, 23.0, 26.0])

# Perform the paired t-test
t_statistic, p_value = stats.ttest_rel(before, after)

print(f"T-Statistic: {t_statistic:.2f}")
print(f"P-Value: {p_value:.4f}")

# Interpret the result
alpha = 0.05  # Significance level
if p_value < alpha:
    print("Reject the null hypothesis. The training program leads to a significant reduction in body fat percentage.")
else:
    print("Fail to reject the null hypothesis. There is no significant evidence that the training program reduces body fat percentage.")


T-Statistic: 11.06
P-Value: 0.0000
Reject the null hypothesis. The training program leads to a significant reduction in body fat percentage.
