# **Paired Samples/Dependent Samples**

Paired samples, also known as dependent samples or matched samples, refer to pairs of observations that are related in some way. This relationship can be due to the same subjects being measured under different conditions or times, or due to the matching of subjects based on specific criteria. The key idea is that the observations within each pair are not independent of each other.


## **Examples of Paired Samples**

1. **Before-and-after Studies**
    - Measuring the same subjects before and after a treatment or intervention. For example, measuring blood pressure before and after administering a new drug to the same group of patients.

2. **Matched Subjects:**
    - Matching subjects based on certain characteristics (e.g., age, gender) and then assigning each subject in the pair to different conditions. For example, pairing students based on their pre-test scores and then assigning them to different teaching methods.

3. **Repeated Measures**
    - Measuring the same subjects under different conditions or at different times. For example, measuring the performance of the same employees on different tasks or at different times of the year.




## **Paired Sample t-Test**

A paired sample t-test is used to determine whether the mean difference between paired observations is significantly different from zero. It is often used in before-and-after studies or repeated measures designs.

**Hypotheses**

- Let D1,.... Dn be a small random sample (n <= 30) of the differences in pairs.
    - **Null Hypothesis (H0)**: The mean difference between the paired observations is zero (μD = 0).
    - **Alternative Hypothesis (H1)**: The mean difference between the paired observations is not zero (μD ≠ 0).


**Test Statistic**

The test statistic of a paired sample t-test is calculated as: 
$$
t = \frac{\bar{D}}{s_D / \sqrt{n}}
$$
where:
- Dˉ = mean of the differences between paired observations 
- sD = standard deviation of the differences 
- n = number of pairs 

**Steps in paired Sample t-Test**

1. State the Hypotheses: 
     - H0: The mean difference between paired obseration is zero. 
     - H1: The mean difference between paired observations is not zero. 

2. Calculate the difference between each pair of observation 
3. Use the mean and standard deviation of the differences to calculate the t-statistic.
4. Compare the t-statistic to the t-distribution with n - 1 degrees of freedom to find the p-value. 
5. Compare the p-value to the significance level (a). If the p-value is less than or equal to a, reject H0. 




In [None]:
import numpy as np
from scipy.stats import ttest_rel

# Scores before and after training
before = [70, 75, 80, 85, 90, 95, 100]
after = [78, 80, 85, 88, 92, 96, 105]

# Perform paired t-test
t_stat, p_value = ttest_rel(before, after)
print(f"T-statistic: {t_stat}")
print(f"P-value: {p_value}")

# Decision based on significance level
alpha = 0.05
if p_value <= alpha:
    print("Reject the null hypothesis (H0).")
    print("The training program significantly improves test scores.")
else:
    print("Fail to reject the null hypothesis (H0).")
    print("The training program does not significantly improve test scores.")
