# W8: Confidence Intervals

This week we will focus on understanding confidence intervals. Even though standard errors give us some sense of how likely is it that our estimate occurs due to chance, they are hard to interpret. Confidence intervals, on the other hand, are easier to interpret and allow us to make inferences about our results. 

Today, we will focus on: 
- Calculating confidence intervals
- Interpreting confidence intervals
- Drawing inference from confidence intervals

For this, we will use the experimental data from this week's lecture.

## What are confidence intervals?

Confidence intervals tell us, over repeated samples there is a 95 percent chance the confidence interval contains the true average treatment effect.
- In every confidence interval, the lower interval indicates the most pessimistic outcome, whereas the upper interval indicates the most optimistic outcome

In [1]:
#load package and dataset 
library(estimatr)
women <- read.csv("ps3_week8_electing_women.csv")
head(women)

Unnamed: 0_level_0,unique_id,treat,prop_sd_fem2014,sd_onefem2014,county,pc_male,mormon
Unnamed: 0_level_1,<int>,<chr>,<dbl>,<int>,<chr>,<int>,<int>
1,27215,supply,0.0,0,Grand,1,0
2,27386,control,0.0,0,Grand,0,0
3,27496,control,1.0,1,Grand,1,0
4,16202,demand,1.0,1,Daggett,1,1
5,16241,control,0.5,1,Daggett,1,0
6,26601,control,0.0,0,Emery,1,0


As a reminder, here is what each variable means: 

- `unique_id`: Precinct ID
- `treat`: treatment variable
    - `'control'`: control group
    - `'supply'`: supply group; party chair instructed to recruit 2-3 women
    - `'demand'`: demand group; party chair reads letter at precinct convention
    - `'both'`: a fourth group getting both the supply and demand treatments; party chair instructed to read letter *and* to recruit 2-3 women
- `prop_sd_fem2014`: Proportion of 2014 elected state delegates from that precinct who were women
- `sd_onefem2014`: 1 if at least one woman was selected; 0 otherwise
- `county` : County name in Utah
- `pc_male`: 1 if precinct chair is male; 0 otherwise (precinct chair is person who runs precinct meeting, would read letter if assigned to do so, etc.)

Calculate the effect of the `both` treatment condition, relative to the control condition on at least one women being elected in 2014 (`sd_onefem2014`).

In [None]:
both.dim <- NULL #your code here
both.dim

Now extract the lower bound of the confidence interval and the higher bound of the confidence interval. (Hint: Refer to the cheatsheet if you don't remember how to do this.)

In [None]:
ci.lower <- NULL #your code here
ci.lower

In [None]:
ci.upper <- NULL #your code here
ci.upper

If you were going to recommend rolling out the letter with both the supply and demand information to 3000 precincts, what is the range of possible outcomes you could achieve? 

In [None]:
best.outcome <- NULL #your code here
best.outcome 

In [None]:
worst.outcome <- NULL #your code here
worst.outcome 

Interpret your results: 

- What does your estimate mean?
- Is your t-statistic statistically significant?
- Is your p-value statistically significant?
- What does your confidence interval mean?
- How would you describe the range of possible outcomes if you were to roll out the `both` letter to 3000 precincts?