#### 1. F-statistic is a ratio of variances or mean square values.
#### 2. Critical value is based on the F-distribution and helps determine if the test result is significant.
#### 3. If 𝐹 > critical value, the result is statistically significant (reject 𝐻0).
#### 4. F-distribution is skewed and depends on both numerator and denominator degrees of freedom.
#### 5. ANOVA uses F-test to compare means of multiple groups, while the basic F-test compares variances between two groups.

In [2]:
import numpy as np

worker1 = [18, 19, 22, 25, 27, 28, 41, 45, 51, 55] # Sample 1 data
worker2 = [14, 15, 15, 17, 18, 22, 25, 25, 27, 34] # Sample 2 data

# Calculate variances of both groups and Calculate F-statistic (ratio of variances)
f_test = np.var(worker1) / np.var(worker2)
print(f_test)

4.387122002085506


In [6]:
# Degrees of freedom

df1 = len(worker1) - 1 # dfn - degree of freedom for numerator
df2 = len(worker2) - 1 # dfd - degree of freedom for denominator

significance_value = 0.05 # Significance level (alpha)

# Critical value from the F-distribution
import scipy.stats as stat
critical_value = stat.f.ppf(q = 1 - significance_value, dfn = df1, dfd = df2)
print(critical_value)

3.178893104458269


#### In the expression stat.f.ppf(), each part has a specific meaning and plays an important role in statistical computations. Let's break it down:

## 1. stat:
#### The stat part refers to the SciPy Stats module, which is part of the SciPy library in Python. SciPy is a scientific computing library that includes many tools for mathematical, scientific, and statistical functions. Specifically, scipy.stats contains a large number of statistical distributions and functions to perform various types of statistical tests, including functions for probability density, cumulative distribution, and percent point functions.

## 2. f:
#### The f refers to the F-distribution within the scipy.stats module. The F-distribution is a probability distribution that arises frequently in the context of statistical tests, like the F-test and ANOVA (Analysis of Variance).
#### The F-distribution is a family of curves that depends on two parameters: the degrees of freedom for the numerator and the denominator. These degrees of freedom relate to the variability between and within groups in tests like ANOVA.
#### The F-distribution is typically used when comparing two sample variances to test if they are equal. It is also used when testing if several group means are equal (ANOVA).
#### In Python, stats.f gives you access to various functions related to the F-distribution.

## 3. ppf:
#### The ppf stands for Percent Point Function. The percent point function is the inverse of the cumulative distribution function (CDF). It gives you the value of the random variable such that the probability of observing a value less than or equal to this value is equal to a specified probability (quantile).

#### Mathematically:
#### x=ppf(q)
#### Where:
#### 𝑞 is the probability or quantile,
#### 𝑥 is the value from the distribution that corresponds to the quantile 𝑞.

#### In simpler terms, for the F-distribution:
#### ppf(q) returns the value of the F-statistic that corresponds to a cumulative probability of 𝑞. This means it tells you the value on the F-distribution curve where the area under the curve (up to that point) equals 𝑞.

#### For example:
#### If q = 0.95 and you call ppf(0.95), the function returns the value of the F-statistic such that 95% of the distribution is to the left of this value, and 5% is to the right

In [7]:
# Compare F-statistic with critical value

if f_test > critical_value :
    print('Reject the hypothesis')
else :
    print('Accept the hypothesis')

Reject the hypothesis
