## Coffee and Code 2: Murders and Executions

Amy Binner

Download the countymurders.csv from ELE

### Load the data

In [None]:
mydata <- read.csv("countymurders.csv")
head(mydata)

### Plot Murders vs Executions

In [None]:
# Change plot size to 4 x 3
options(repr.plot.width=6, repr.plot.height=4)

plot(murders~execs, data=mydata, main = "County level murders vs executions in US, 1996")

### Estimate the equation

$$ murders_i= β_0+ β_1 execs_i+u_i $$

by OLS and report the results

In [None]:
library(AER)
reg1 <- lm(murders~execs, data = mydata)
cov <- vcovHC(reg1, type="HC1")
robust <- sqrt(diag(cov))

library(stargazer)
stargazer(reg1, reg1, type = "text", se=list(NULL, robust), column.labels=c("Default SE", "Robust SE"))

### Interpret the coefficients

(double click to enter your answer)

The coefficient is positive - indicating that as the number of executions increases by 1 the number of murders is expected to increase by 65.47 (2 decimal places) - holding everything else constant.

### Does the estimated slope coefficient suggest a deterrent effect of capital punishment (execution) on murders? 

In [None]:
# plot the data and the estimated regression line
plot(mydata$murders~mydata$execs, 
     main="County-level execution and murder cases in the US, 1996",
     ylab = "Murder cases",
     xlab = "Executions or capital punishments")
abline(reg1, col = "skyblue", lwd = 2)

In [None]:
new <- data.frame(execs = c(0))
predict(reg1, new)

### Calculate the mean residuals for those with 0 murder cases

In [None]:
mean(residm0 <- reg1$residuals[mydata$murders==0])

### Calculate the mean residuals for those with 0 murders and 0 execs

In [None]:
mean(reside0 <- reg1$residuals[mydata$murders==0 & mydata$execs==0])

### How many counties had zero murders in 1996? How many had at least one execution?

In [None]:
# using length and which function, count counties that had 0 murders 
cat("\nZero murders = ", length(which(mydata$murders==0)))

cat("\nAt least one execution =",length(which(mydata$execs>=1)))

### Explain why a simple regression analysis is not well suited for determining whether capital punishment has a deterrent effect on murders.

(double click to enter answer)

- Missing explanatory variables
- Factors influencing both executions and murders
- Little variation in executions (low variance)
- Nonlinear dependent variable

In [None]:
hist(mydata$murders)

# Adding title, axis labels, axis limits, non-default breaks
hist(mydata$murders, breaks = 200, xlim = c(0,100), main = "Histogram: Murders", xlab = "Murders")

In [None]:
library(ggplot2)
ggplot(mydata, aes(x=murders)) + 
    geom_histogram(fill="skyblue", alpha=0.5, bins = 15)+
    ggtitle("Histogram of Murders by US county in 1996") +
    xlim(0,100)+
    ylim(0,10000)+
    theme_minimal()