# Probability and Statistics for Machine Learning: p-values and Significance Testing

## 7. p-values and Significance Testing


### What is a p-value?

The p-value is the probability of observing a test statistic as extreme as, or more extreme than, the one observed under the assumption that the null hypothesis is true. It helps us determine the strength of evidence against the null hypothesis.

- A small p-value (typically less than 0.05) indicates strong evidence against the null hypothesis, so we reject the null hypothesis.
- A large p-value indicates weak evidence against the null hypothesis, so we fail to reject it.

### Significance Level (\( lpha \))

The significance level, denoted by \( lpha \), is the threshold below which the p-value must fall to reject the null hypothesis. Common choices for \( lpha \) are 0.05, 0.01, and 0.10.

### Example: p-value in a t-test

We can use a one-sample t-test to test whether the mean of a sample is significantly different from a hypothesized population mean.
    

In [None]:

# Example: Calculating p-value using a one-sample t-test
sample_data = np.array([168, 172, 169, 171, 170, 173, 174])
population_mean = 170

# Perform one-sample t-test
t_statistic, p_value = stats.ttest_1samp(sample_data, population_mean)
p_value
    


### Interpreting the p-value

- If \( p \leq lpha \), we reject the null hypothesis \( H_0 \).
- If \( p > lpha \), we fail to reject the null hypothesis \( H_0 \).

### Example

If the p-value is 0.03 and the significance level \( lpha \) is 0.05, we reject the null hypothesis because the p-value is less than the significance level. This suggests that there is enough evidence to support the alternative hypothesis.

### Applications in Machine Learning

p-values are used in machine learning to:
- Test the significance of model coefficients.
- Compare models to determine whether their differences in performance are statistically significant.
- Validate assumptions such as normality in residuals or homoscedasticity.

    