In [1]:
#1. Properties of the F-Distribution
#Shape: Asymmetric, skewed right, and depends on the degrees of freedom.
#Non-negative values: F-distribution values are always ≥ 0.
#Parameters: Defined by two degrees of freedom:
#𝑑𝑓1df 1​(numerator) and 𝑑𝑓2df 2​ (denominator).
#Mean: Approximates 1 when 𝑑𝑓2>2df 2>2.
#Asymptotic behavior: As 𝑑𝑓1df1 and 𝑑𝑓2df2 increase, the F-distribution approaches normality.


In [2]:
#2. Statistical Tests Using F-Distribution
#Tests: Used in ANOVA, regression analysis, and comparing variances.
#Appropriate because:
#It compares two variances (ratios) and assumes a specific distribution under the null hypothesis.
#Evaluates whether variance differences are due to chance.

In [3]:
#3. Assumptions for Conducting an F-Test
#Populations follow a normal distribution.
#Samples are independent.
#Random sampling is used.
#The ratio of variances follows the F-distribution

In [4]:
#4. Purpose of ANOVA vs.                               |        t-Test
#Aspect	ANOVA	t-Test                                   |
#Purpose	Compare means across multiple groups         |     	Compare means of two groups
#Use case	More than two groups                         |       	Exactly two groups
#Output	F-statistic and p-value	                       |         t-statistic and p-value


In [5]:
#5. Why Use One-Way ANOVA Instead of Multiple t-Tests?
#Efficiency: Reduces computational effort when comparing multiple groups.
#Control of Type I error: Maintains the significance level without inflating it (avoids error accumulation).


In [6]:
#6. Variance Partitioning in ANOVA
#Between-group variance: Variance due to differences in group means.
#Within-group variance: Variance within individual groups.
#F-statistic: 𝐹=
#Between-group variance \ Within-group variance

#It determines if group means significantly differ.


In [7]:
#7. Classical vs.                                                      |   Bayesian ANOVA
#Aspect	Classical ANOVA                                                |    Bayesian ANOVA
#Uncertainty	Uses p-values; fixed conclusions	                       |     Uses probability distributions
#Parameter estimation	Based on sample data	                           |   Incorporates prior information
#Hypothesis testing	Binary reject/fail to reject approach              |	Provides a range of likely values


In [8]:
#8. F-Test for Comparing Variances (Python Code)

import numpy as np
from scipy.stats import f

# Data
profession_A = [48, 52, 55, 60, 62]
profession_B = [45, 50, 55, 52, 47]

# Variance
var_A = np.var(profession_A, ddof=1)
var_B = np.var(profession_B, ddof=1)

# F-statistic
F = var_A / var_B

# Degrees of freedom
df1 = len(profession_A) - 1
df2 = len(profession_B) - 1

# p-value
p_value = 1 - f.cdf(F, df1, df2)

print(f"F-statistic: {F:.2f}, p-value: {p_value:.4f}")


F-statistic: 2.09, p-value: 0.2465


In [9]:
#9. One-Way ANOVA for Heights (Python Code)


import scipy.stats as stats

# Data
region_A = [160, 162, 165, 158, 164]
region_B = [172, 175, 170, 168, 174]
region_C = [180, 182, 179, 185, 183]

# One-way ANOVA
F, p_value = stats.f_oneway(region_A, region_B, region_C)

print(f"F-statistic: {F:.2f}, p-value: {p_value:.4f}")


F-statistic: 67.87, p-value: 0.0000
