#### $\bullet$ F test
F Ratio

$F=\dfrac{variability~between~groups}{variability~within~groups}$

An F test of the null hypothesis is based on the notion that if the null hypothesis is true, both the numerator and the denominator of the F ratio would tend to be about the same, but if the null hypothesis is false, the numerator would tend to be larger than the denominator.

#### If Null Hypothesis Is True

$F=\dfrac{random~error}{random~error}$

Except for chance, estimates in both the numerator and the denominator are similar, and generally, F varies about a value of 1.

#### If Null Hypothesis Is False

$F=\dfrac{random~error + treatment~effect}{random~error}$

When the null hypothesis is false, the presence of a treatment effect tends to cause a chain reaction: The observed differences between group means tend to be large, as does the variability between groups. Accordingly, the numerator term tends to exceed the denominator term, producing an F whose value is larger than 1. When the null hypothesis is false because of a large treatment effect, there is an even more pronounced chain reaction, beginning with very large observed differences between group means and ending with an F whose value tends to be considerably larger than 1.

#### $\bullet$ VARIANCE ESTIMATES

Sum of Squares (sum of squared deviations about their mean)

$SS=\Sigma(X-\overline{X})^2$

Sample Variance, $s^2$

$s^2=\dfrac{SS}{n-1}=\dfrac{SS}{df}$

#### Mean Square (MS) - A variance estimate in ANOVA, referred to as a mean square, consists of some sum of squares divided by its degrees of freedom.

$MS=\dfrac{SS}{df}$

#### $\bullet$ Sum of Squares (SS ): Definitional Formulas

$SS_{between}$ the sum of squares for variability between groups

$SS_{within}$ the sum of squares for variability within groups

$SS_{total}$ the sum of squares for the total of these two, equals the sum of the squared deviations of all scores about the grand mean.

#### $ * SS_{total}$

$SS_{total}=\Sigma(X-\overline{X}_{grand})^2$

where $X$ represents each score and $\overline{X}_{grand}$ represents the one overall mean for all scores. Although $SS_{total}$ isn’t directly involved in the calculation of the F ratio, it serves as a valuable computational check.

#### $ * SS_{between}$

$SS_{between}=n\Sigma(\overline{X}_{group}-\overline{X}_{grand})^2$

where $n$ represents the number of scores in each group, $\overline{X}_{group}$ is the mean for each group, and $\overline{X}_{grand}$ is the overall mean for all groups. This term contributes to the numerator of the $F$ ratio. The sample size for each group, $n$, in the expression for $SS_{between}$ reflects the fact that the deviation $\overline{X}_{group}-\overline{X}_{grand}$ is the same for every score, $n$, in that group.

#### $ * SS_{within}$

$SS_{within}=\Sigma(X-\overline{X}_{group})^2$

where $X$ represents each score and $\overline{X}_{group}$ is the mean for each group. This term contributes to the denominator of the $F$ ratio. Essentially, it requires that we calculate the sum of squares, $SS$, within each group and then add these terms across all groups—in a procedure similar to that used with the two $SS$ terms in the numerator of Formula 14.2 (page 254) for the polled variance estimate, $s^2_p$. Since $SS_{within}$ always reflects only the pooled variability among subjects treated similarly, it can be referred to, more generally, as the sum of squares for random error and symbolized as $SS_{error}$.

#### $\bullet$ Sum of Squares (SS): Computation Formulas

$SS=\Sigma(X-\overline{X})^2=\Sigma{X^2}-\dfrac{(\Sigma{X})^2}{n}$

where the total, ΣX, represents a key component in the conversion from means in the definition formulas to totals in the computation formulas.

### $\bullet$ WORD, DEFINITION, AND COMPUTATION FORMULAS FOR SS TERMS

$\bullet$ For the total sums of squares,

$SS_{total}=$ the sum of squared deviations for scores about the grand mean
           
$=\Sigma(X-\overline{X}_{grand})^2$

$SS_{total}=\Sigma{X}^2-\dfrac{G^2}{N}$, where $G$ is the grand total and $N$ is its sample size

$\bullet$ For the between sum of squares,

$SS_{between}=$ the sum of squared deviations for group means about the grand mean

$=n\Sigma(\overline{X}_{group}-\overline{X}_{grand})^2$

$SS_{between}=\Sigma\dfrac{T^2}{n}-\dfrac{G^2}{N}$, where $T$ is the group total and $n$ is its sample size

$\bullet$ For the within sum of squares,

$SS_{within}=$ the sum of squared deviations of scores about their respective
group means

$=\Sigma(X-\overline{X}_{group})^2$

$SS_{within}=\Sigma{X^2}-\Sigma\dfrac{T^2}{n}$

#### $\bullet$ REMINDER:

$X=$ raw score

$T=$ group total

$n=$ group sample size

$G=$ grand total

$N=$ grand (combined) sample size

#### $\bullet$ FORMULAS FOR $df$ TERMS

$N$ = number of total scores for all the groups

$k$ = number of groups

$df_{total}=N-1$, that is, the number of all scores − 1

$df_{between}=k-1$, hat is the number of groups − 1

$df_{within}=N-k$, that is, the number of all scores − number of groups

DEGREES OF FREEDOM (ONE FACTOR)

$df_{total}=df_{between}+df_{within}$

In [16]:
# Function for Calculation of SS Terms
X1_0 = [0,4,2]
X2_24 = [3,6,6]
X3_48 = [6,8,10]
def SS_terms(a,b,c):
    import statistics
    import math

    n = len(a)
    total1 = sum(a)
    total2 = sum(b)
    total3 = sum(c)
    print(f'total1 = {total1}\ntotal2 = {total2}\ntotal3 = {total3}')
    
    Xtotal = a+b+c
    G = sum(Xtotal)
    squares_of_Xtotal = [m**2 for m in Xtotal]
    print(squares_of_Xtotal)
    
    SS_between = ((total1**2/n) + (total2**2/n) + (total3**2/n)) - (G**2)/len(Xtotal)
    print(f'SS_between = {SS_between}')

    SS_within = sum(squares_of_Xtotal) - ((total1**2/n) + (total2**2/n) + (total3**2/n))
    print(f'SS_within = {SS_within}')

    SS_total = sum(squares_of_Xtotal) - (G**2)/len(Xtotal)
    print(f'SS_total = {SS_total}')

    accuracy_check = (SS_between + SS_within) == SS_total
    print(f'accuracy_check = {accuracy_check}')
    
SS_terms(X1_0,X2_24,X3_48)

total1 = 6
total2 = 15
total3 = 24
[0, 16, 4, 9, 36, 36, 36, 64, 100]
SS_between = 54.0
SS_within = 22.0
SS_total = 76.0
accuracy_check = True
