### Effect size

The "effect size" between two groups is a measure of how far apart the means of two groups are. One common measure of effect size is Cohen's $d$ which is defined as 

$s=\frac{\overline{x}_1 - \overline{x}_2}{s}$

where $\overline{x}_i$ is the mean of group $i$ and $s$ is the standard deviation of the dataset. Recall that the standard deviation is the square root of the variance.* 

Cohen's $d$ is one way to tell how far apart the means of the treatment and control group are. In general, the bigger the effect size, the easier it will be to tell that the difference between groups is unlikely to be observed by chance, i.e. that the difference is "statistically significant" (more on those magic words to come.

*Note, some versions of Cohen's $d$ use a more complex formula for $s$, let's stick with the simple one for now.

### Questions from last time
- What is the Cohen's $d$ between the years experience in the white and black groups?
- Why is it important to include $s$ in measuring effect size?

In [8]:
import pandas as pd
import altair as alt
import numpy as np

np.random.seed(42)

mu1 = 0
mu2 = -2
std1 = 0.8
std2 = 1.8
N = 1000
treatment = np.random.normal(mu1, std1, int(N/2))
control = np.random.normal(mu2, std2, int(N/2))

# Generating Data
source = pd.DataFrame({
    'Treatment': treatment,
    'Control': control
})

alt.Chart(source).transform_fold(
    ['Treatment', 'Control'],
    as_=['Experiment', 'Measurement']
).mark_area(
    opacity=0.3,
    interpolate='step'
).encode(
    alt.X('Measurement:Q', bin=alt.Bin(maxbins=100)),
    alt.Y('count()', stack=None),
    alt.Color('Experiment:N')
)

### Questions 

1. Compute Cohen's d based on the dataset above. Write a function to compute Cohen's d

2. If you modified the code above so that `mu1=2` what would you expect to happen to Cohen's d? Verify your guess by using your Cohen's d function. Make another plot like the one above with the new parameter `mu1=2`. What do you observe?

3. If you modified the code above so that `std1` and `std2` were .01, what would you expect to happen to Cohen's d? Verify your guess by using your Cohen's d function. Make another plot like the one above with the new parameter `std1` and `std2`. What do you observe?

4. Say the treatment has no effect. What would you expect the Cohen's d to be?

5. Modify the code above so that `mu1=mu2` and `std1=std2`. Compute Cohen's d. What do you observe? If you were to make a faceted histogram like above, what do you think it will look like? Make an actual plot to check your guess. 

6. Say the treatment has no effect. In that case, your treatment and controls would each be draws from the same underlying data distribution (i.e. distribution of data, regardless of treatment). What would happen if you simulated data like this and made a faceted histogram again. What do you think the plot will look like?