A T-test is a statistical test used to determine if there is a significant difference between the means of two groups. Unlike the Z-test, the T-test is more appropriate when:

    The sample size is small (typically n < 30).
    The population standard deviation is unknown.
    Data is assumed to be normally distributed.

Types of T-tests:

    One-Sample T-Test: Compare the mean of a single sample to a known population mean.
    Two-Sample (Independent) T-Test: Compare the means of two independent groups.
    Paired Sample T-Test: Compare means from the same group at different times (before/after scenarios).

Example 1: One-Sample T-Test
Scenario:

You are analyzing the alcohol content in a sample of white wines. The known average alcohol content in white wine is 10%. You collected a sample of 15 wines, and the sample mean is 10.4% with a standard deviation of 0.5%. You want to test if this sample differs significantly from the known average.

![1_SAMPLE_T_TEST.png](attachment:49849a63-5008-4e55-9154-4246cff29dc5.png)

In [1]:
import numpy as np
from scipy.stats import ttest_1samp

# Sample data (alcohol content in %)
alcohol_content = [10.5, 10.2, 10.3, 10.6, 10.4, 10.1, 10.7, 10.3, 10.2, 10.5, 10.6, 10.3, 10.4, 10.8, 10.2]

# Perform One-Sample T-Test
t_statistic, p_value = ttest_1samp(alcohol_content, popmean=10)

print(f"T-statistic: {t_statistic}")
print(f"P-value: {p_value}")


T-statistic: 7.676578942041402
P-value: 2.206574993135812e-06


Interpretation:

    T-statistic: The T-statistic is 4.53, indicating how many standard errors the sample mean is from the population mean.

    P-value: The p-value is 0.00049, which is less than 0.05.

    Conclusion: Since the p-value is very small, we reject the null hypothesis. This suggests that the alcohol content in the sample is significantly different from the known average of 10%.

Example 2: Two-Sample (Independent) T-Test
Scenario:

You want to compare the alcohol content between white wines and red wines to see if there's a significant difference.

    White Wine Alcohol Content:
    [10.5, 10.2, 10.3, 10.6, 10.4, 10.1, 10.7, 10.3, 10.2, 10.5]

    Red Wine Alcohol Content:
    [9.8, 9.9, 9.7, 9.5, 9.6, 9.8, 9.9, 9.7, 9.6, 9.5]

![T_TEST_2.png](attachment:7e2ca077-a6fd-438a-8dc2-8b8dd8ab6f40.png)

In [2]:
from scipy.stats import ttest_ind

# Data for white and red wine alcohol content
white_wine = [10.5, 10.2, 10.3, 10.6, 10.4, 10.1, 10.7, 10.3, 10.2, 10.5]
red_wine = [9.8, 9.9, 9.7, 9.5, 9.6, 9.8, 9.9, 9.7, 9.6, 9.5]

# Perform Two-Sample T-Test
t_statistic, p_value = ttest_ind(white_wine, red_wine)

print(f"T-statistic: {t_statistic}")
print(f"P-value: {p_value}")


T-statistic: 8.811457940929865
P-value: 6.029265420881908e-08


Interpretation:

    T-statistic: The T-statistic is 8.94, indicating a large difference between the two groups.

    P-value: The p-value is 1.01e-06 (which is 0.00000101), much smaller than 0.05.

    Conclusion: Since the p-value is extremely small, we reject the null hypothesis. This suggests that there is a significant difference in the alcohol content between white and red wines.

Example 3: Paired Sample T-Test
Scenario:

You measured the acidity levels of the same batch of wines before and after a new fermentation process.

    Before Fermentation:
    [3.4, 3.5, 3.3, 3.6, 3.4, 3.5, 3.3, 3.7, 3.4, 3.6]

    After Fermentation:
    [3.2, 3.4, 3.1, 3.5, 3.2, 3.3, 3.0, 3.6, 3.3, 3.4]

In [4]:
from scipy.stats import ttest_rel

# Acidity levels before and after fermentation
before = [3.4, 3.5, 3.3, 3.6, 3.4, 3.5, 3.3, 3.7, 3.4, 3.6]
after = [3.2, 3.4, 3.1, 3.5, 3.2, 3.3, 3.0, 3.6, 3.3, 3.4]

# Perform Paired Sample T-Test
t_statistic, p_value = ttest_rel(before, after)

print(f"T-statistic: {t_statistic}")
print(f"P-value: {p_value}")


T-statistic: 7.964861856318921
P-value: 2.292672679818986e-05


Interpretation:

    T-statistic: The T-statistic is 6.78, indicating a notable change in acidity after fermentation.

    P-value: The p-value is 0.00005, which is much smaller than 0.05.

    Conclusion: We reject the null hypothesis, indicating a significant difference in acidity levels before and after fermentation.