######  An effect size refers to the size or magnitude of an effect or result as it would be expected to occur in a population. The effect size is estimated from samples of data. 

Two main groups of methods for calculating effect size are:
 Association. Statistical methods for quantifying an association between variables (e.g.
correlation).
 Difference. Statistical methods for quantifying the difference between variables (e.g.
difference between means).


Three types of calculated result include:
 Standardized Result. The effect size has a standard scale allowing it to be interpreted
generally regardless of application (e.g. Cohen’s d calculation).
 Original Units Result. The effect size may use the original units of the variable, which
can aid in the interpretation within the domain (e.g. difference between two sample
means).
 Unit Free Result. The effect size may not have units such as a count or proportion (e.g.
a correlation coefficient).

### Association Effect Size

In [1]:
# calculate the pearson ' s correlation between two variables
from numpy.random import randn
from numpy.random import seed
from scipy.stats import pearsonr
# seed random number generator
seed(1)
# prepare data
data1 = 10 * randn(10000) + 50
data2 = data1 + (10 * randn(10000) + 50)
# calculate pearson ' s correlation
corr, _ = pearsonr(data1, data2)
print( ' Pearsons correlation: %.3f ' % corr)

 Pearsons correlation: 0.712 


### Difference Effect Size

In [2]:
# calculate the cohen ' s d between two samples
from numpy.random import randn
from numpy.random import seed
from numpy import mean
from numpy import var
from math import sqrt
# function to calculate cohen ' s d for independent samples
def cohend(d1, d2):
    # calculate the size of samples
    n1, n2 = len(d1), len(d2)
    # calculate the variance of the samples
    s1, s2 = var(d1, ddof=1), var(d2, ddof=1)
    # calculate the pooled standard deviation
    s = sqrt(((n1 - 1) * s1 + (n2 - 1) * s2) / (n1 + n2 - 2))
    # calculate the means of the samples
    u1, u2 = mean(d1), mean(d2)
    # calculate the effect size
    return (u1 - u2) / s
# seed random number generator
seed(1)
# prepare data
data1 = 10 * randn(10000) + 60
data2 = 10 * randn(10000) + 55
# calculate cohen ' s d
d = cohend(data1, data2)
print( ' Cohens d: %.3f ' % d)

 Cohens d: 0.500 


###### Two other popular methods for quantifying the difference effect size are:
 Odds Ratio. Measures the odds of an outcome occurring from one treatment compared
to another.
 Relative Risk Ratio. Measures the probabilities of an outcome occurring from one
treatment compared to another.