# Effect size

## <a name="overview"></a> Overview

In this section we will go over the topic of <a href="https://en.wikipedia.org/wiki/Effect_size">effect size</a>. According to Wikipedia, _...an effect size is a value measuring the strength of the relationship between two variables in a population, or a sample-based estimate of that quantity._ So an effect size is an index that shows the strength of the relationship between dependent and independent variables. 

The indices of effect size are important because they answer questions such as [1]:

- Does the independent variable have a, powerful, effect on the dependent variable?
- How strong is the relationship between the dependent and independent variables?
- What is the prediction error reduction is we use scores from one variable to predict scores on another?

## <a name="sec1"></a> Effect size

There are indeed many indices of effect size [1]: $\eta^2$, $r^2$, $\omega^2$ and <a href="https://en.wikiversity.org/wiki/Cohen%27s_d">Cohen's $d$ statistic</a> just to name a few. These indices differ primarily with respect to the type of data they are appropriate for and the information they provide.

Compared to a null hypothesis significance test, an index of effect size may be less likely to be misinterpreted [1]. This is because the former have strong dependence on the sample size whereas the latter does not.

We have already mentioned that there is a number of indices of effect size. The majority of them can be classified in either one of the two categories [1]:

- Standardized difference indices
- Variance accounted for indices

### <a name="subsec1"></a> Standardized difference indices

An index in this category, indicates the size of the difference between two means [1]. This is measured in terms of standard deviations [1]. A very common example is Cohen's $d$ statistic. For a study with two independent samples the $d$ statistic is computed as 

$$d = \frac{\bar{x}_1 - \bar{x}_2}{s_p}$$

where $s_p$ is the so called pooled estimate of the population standard deviation. Other statistics in the category include <a href="https://www.itl.nist.gov/div898/software/dataplot/refman2/auxillar/hedgeg.htm">Hedge's $g$ statistic</a> and Glass's $\Delta$ statistic.

### <a name="subsec2"></a> Variance accounted for indices

In this category, an index communicates the percent of variance in the dependent variable that is accounted for by the independent variables. Recall for example the <a herf="https://en.wikipedia.org/wiki/Coefficient_of_determination">**coefficient of determination**</a> $r^2$ from simple linear regression. When the independent variable(s) does not do a good job in explaining the dependent variable, an index in this category will typically be zero. On the other hand, if the independent variables perfectly account for the scores in the dependent variable, such an index will, typically, be one or $100\%$.

### <a name="subsec3"></a> Interpreting the calculate indices

Ok so we have calculated the index or indexes that we see fit in our study. We now need to interpret the results. We won't go much in detail here. For further insights on the topic the reader is referred to [1] and the references therein. 

For  standardized difference indices, Cohen provided several real world examples of differences. He classified these differences as small, medium and large [1]. The size of the effect would be classified as follows [1]

| d           | Size of the effect |
| ----------- | ------------------ |
| $\pm$ 0.2   | Small              |
| $\pm$ 0.5   | Medium             |
| $\pm$ 0.8   | Large              |

Cohen also gave the following criteria for interpreting the size of the Pearson correlation coefficient [1]

| $\rho$           | Size of the effect |
| ----------- | ------------------ |
| $\pm$ 0.1   | Small              |
| $\pm$ 0.3   | Medium             |
| $\pm$ 0.5   | Large              |

## <a name="sum"></a> Summary

In this section we reviewed the topic of effect size. An effect size is a value measuring the strength of the relationship between two variables in a population [2]. Thus when we think effect size we should think strength [1]. There are many indices of effect size. Most of them fall in either of the two categories:

- Standardized difference indices
- Variance accounted for indices

The indices of effect size are important because they answer questions such as:

- Does the independent variable have a, powerful, effect on the dependent variable?
- How strong is the relationship between the dependent and independent variables?
- What is the prediction error reduction is we use scores from one variable to predict scores on another?

## <a name="refs"></a> References

1. Larry Hatcher, _Advanced statistics in research_, Shadow Finch Media.
2. <a href="https://en.wikipedia.org/wiki/Effect_size">Effect size</a>