
## 1. Properties of the F-distribution
- The F-distribution is not symmetric; it is positively skewed.
- It is defined only for positive values (greater than zero).
- The shape depends on the degrees of freedom of the numerator (ν1) and the denominator (ν2).
- As ν1 and ν2 increase, the distribution approaches normality.
- The mean of the F-distribution is ν2/(ν2 - 2) for ν2 > 2.

## 2. Statistical Tests Using the F-distribution
- **ANOVA (Analysis of Variance)**: Used to compare the variances between multiple groups.
- **F-test for Equality of Variances**: Checks if two populations have the same variance.
- The F-distribution is appropriate for these tests as it compares variance estimates.

## 3. Key Assumptions for an F-test
- Data must be independent and randomly sampled.
- Populations should follow a normal distribution.
- Samples must have equal variances.

## 4. Purpose of ANOVA vs. t-test
- **ANOVA**: Compares means of three or more groups.
- **t-test**: Compares means of two groups.
- ANOVA controls the Type I error rate better than multiple t-tests.

## 5. Using One-Way ANOVA Instead of Multiple t-tests
- One-way ANOVA is used to compare more than two groups while controlling the overall Type I error rate.
- More efficient and valid than performing multiple independent t-tests.

## 6. Partitioning Variance in ANOVA
- **Between-group variance**: Variance due to group mean differences.
- **Within-group variance**: Variance within each group.
- The F-statistic is the ratio of between-group variance to within-group variance, indicating significant differences between group means.

## 7. Frequentist vs. Bayesian ANOVA
- **Frequentist ANOVA**: Uses sample data for hypothesis testing with p-values.
- **Bayesian ANOVA**: Provides probability distributions for parameters, handling uncertainty comprehensively.
- **Key differences**: Bayesian ANOVA incorporates prior beliefs, unlike frequentist.







In [1]:
## 8. F-test for Incomes of Two Professions
import numpy as np
from scipy import stats

# Data for the two professions
profession_a = np.array([48, 52, 55, 60, 62])
profession_b = np.array([45, 50, 55, 52, 47])

# Perform the F-test for equality of variances
f_statistic, p_value = stats.levene(profession_a, profession_b)
print(f"F-statistic: {f_statistic}, p-value: {p_value}")

F-statistic: 0.7368421052631583, p-value: 0.4156507222081854


### Interpretation
## - Check if the p-value is below the significance level (e.g., 0.05) to decide if the null hypothesis (equal variances) should be rejected.


In [2]:
## 9. One-Way ANOVA for Heights

import numpy as np
from scipy import stats

# Data for the three regions
region_a = np.array([160, 162, 165, 158, 164])
region_b = np.array([172, 175, 170, 168, 174])
region_c = np.array([180, 182, 179, 185, 183])

# Perform the one-way ANOVA
f_statistic, p_value = stats.f_oneway(region_a, region_b, region_c)
print(f"F-statistic: {f_statistic}, p-value: {p_value}")



F-statistic: 67.87330316742101, p-value: 2.8706641879370266e-07


### Interpretation
## - If the p-value is below the significance level (e.g., 0.05), there is a significant difference between the group means.