# POLSCI 3

## Week 12, Lecture 1: Reading Regression Tables of Experimental Results


We're done learning new coding!

You now know how to do a lot:

- make and create two-way tables
- understand why observational data typically can't determine causality
- understand why experiments allow us to determine causality
- calculate an estimate, standard error, $p$-value, and confidence interval and interpret them
    - in an experiment
    - in a descriptive study
- estimate treatment effects among different subsets
- think about how experimental results might generalize to new samples
- understand how we can draw random samples from populations to make inferences about populations
- visualize data to understand relationships between variables
- run bivariate regressions to calculate a line of best fit
- run multivariate regressions to understand relationships between multiple variables and an outcome
- run multivariate regressions to analyze experiments and interpret them

In some situations in other classes and in life, you'll actually be running the code to get numbers from R and interpret them. But far more often, _other people_ will show you the output of a statistical procedure that they ran. And when they do this:
- they usually use regression; regression is by far the most common tool for analyzing data in political science (it is so flexible and can be used for experimental and non-experimental data)
- they usually don't just give you a screenshot of their R (or other statistical software) output; they format the output more nicely in a Table called a **regression table**

This week, we're going to see what **regression tables** look like and learn how to read them. Today we'll use one example from an observational study (our Week 11 dataset) and one example from an experimental study. There aren't really major differences between how the Tables are presented in each, I just think it's good to go through multiple examples.

Once you learn how to read a regression table, you're going to see them _all the time_ in your other classes!

### Reading regression tables

Let's start with some regressions we saw last week and see how they'd look as a regression tables.

Here's screenshots of some regressions we made last week, predicting Democratic vote share in US House elections:

<img src="week11_l1_lm.png" width="50%">

-----

<img src="week11_a1_lm.png" width="50%">

As a **regression table** in a paper, they would look like this:

<img src="week11_both_nice_table.png" width="60%">

There's a few things about this table that are common across almost all regression tables:
- Each **column** reports the results of one regression. Multiple columns means we are looking at the results of multiple different regressions (in this example, two). The **column titles** should tell you what is different about the regressions.
    - Sometimes this will be switching subsets of data (as in this example).
    - Sometimes it will involve switching _outcomes_.
    - Sometimes the outcome and subset will be the same, but new variables will be introduced into the regression.
    - You have to use context and the table notes to be sure what is changing across the columns.
- Each **row** represents the coefficients/estimates on one of the variables in the regression.
    - The **first column** shows the variable that is being estimated.
    - The **last row** of estimates contains the Intercept or Constant term (these are synonymous).
- In each **cell**, the first number is the coefficient, and the number in brackets below it is the standard error. These are the only numbers we need to calculate a t-statistic, confidence interval, and $p$-value.
- The **stars** next to each number indicate if the estimates are statistically significant and, if so, how small the $p$-value is. (The **Table Notes** along the bottom will tell you how to interpret the stars.)
- **Missing cells** mean that this variable was not included in this regression. Sometimes it will just be blank, sometimes there will be a "-" to indicate it is being left blank on purpose.
- The **sample size** ($N$) and a statistic called $R^2$ are printed at the bottom of each column.
    - (I'm not focusing on the $R^2$ statistic in this class much; see the end of the last lecture on correlation if you're interested in this. It measures how much of the variation in the outcome the left-hand-side variables can predict. Social scientists are rarely intersted in this.)
- There is a title that should help you interpret anything else about the table you need to know. (In practice, how helpful these notes are vary.)
- There are notes at the bottom that help you interpret anything else about the table you need to know. (In practice, how helpful these notes are vary.)

### Example: An experiment on motivating hairdressers to distribute condoms in Africa

Now let's look at an example from a real study.

HIV is a huge problem in Africa, and governments and non-profits have tried to promote condom use to limit the spread of HIV.

In their paper "No margin, no mission? A field experiment on incentives for public service delivery," <a href="https://doi-org.libproxy.berkeley.edu/10.1016/j.jpubeco.2014.06.014">Ashraf et al. (2014)</a> describe the results of a collaboration with a non-proft that recuited hairdressers and barbers to provide information about HIV prevention and sell condoms in their shops.

In this week's lecture and activity notebook, we will examine Tables from this paper to understand its results. This is way easier than downloading the data and analyzing it ourselves, but still takes a bit of work.

#### Experimental Design

There are four groups in the experiment:

- control group
- large financial reward
- small financial reward
- non-financial reward ("star reward")

Here's the paper's description of the groups in the experiment:

> Agents in the control group receive no rewards, while agents in the three treatment groups receive financial margins at the bottom and the top of the feasible range, and non-financial rewards, respectively. The smaller and larger financial-margin treatments pay a 10% and 90% margin on each condom sale, respectively, whereas the non-financial scheme (“star” treatment) gives agents a “thermometer” display, showing condom sales and stamps, with one star stamp for each sale.

####  Table 3

![](AshrafTable3.png)

#### Reading this Table

- The title tells us that we are looking at effects on measures of effort.
- If there is a control group in the experiment, it is usually the omitted category in a regression. That appears to be the case here, as the other groups all show up in the regression but the control group doesn't. So, we can interpret the results as effects relative to the control group.
- As usual in regression tables, each column in the Table below reports a separate regression. What do the columns mean? In this case, we can see that each column has a different outcome measure, also known as "dependent variable".
    - Let's focus on the first column as an example. We can see that the outcome for the first column is "Total displays", which the Table note helpfully tells us is the number of visible posters, brochures, "sold here" signs, flipcharts, condom dispensers, and certificates visible in the shot. This means the first column reports the results of running a regression where "total displays" is the outcome.
- Each of the rows in the main body of the table show you coefficients and, below, standard errors.
- The stars next to the coefficients tell us whether the results are statistically significant and at what level.
- What is "Controls: Yes"? You sometimes see this: instead of printing all the control variables (since we're not really interested in them and they don't have a causal interpretation -- they are just for precision), in experiments, we save space by just noting controls are included. Elsewhere in the article it says what the controls are. Remember: when analyzing experiments with regression, we don't care about what the coefficients on the control variables are, we just sometimes include them in case they might help increase the precision of our estimates.