Skip to content
Branch: master
Find file History
dmil State of the state (#36)
add state-of-the-state speeches data
Latest commit f909f36 Jun 13, 2019
Type Name Latest commit message Commit time
Failed to load latest commit information.
speeches State of the state (#36) Jun 13, 2019 State of the state (#36) Jun 13, 2019
index.csv State of the state (#36) Jun 13, 2019

State of the State

This folder contains the data behind the story What America’s Governors Are Talking About.

index.csv contains a listing of each of the 50 speeches, one for each state as well as the name and party of the state's governor and a link to an official source for the speech. If an official government source could not be found, we have linked to a news media source that had a transcript of the speech.

The speeches/ folder contains 50 .txt files containing the text of each of the speeches.

words.csv contains every one-word phrase that was mentioned in at least 10 speeches and every two- or three-word phrase that was mentioned in at least five speeches after a list of stop-words was removed and the word "healthcare" was replaced with "health care" so that they were not counted as distinct phrases. It also contains the results of a chi^2 test that shows the statistical significance of and associated p-value of phrases.

Column Definition
n-gram one-, two- or three-word phrase
category thematic categories for n-grams hand-coded by FiveThirtyEight staff: economy/fiscal issues, education, health care, energy/environment, crime/justice, mental health/substance abuse
d_speeches number of Democratic speeches containing the n-gram
r_speeches number of Republican speeches containing the n-gram
total total number of speeches containing the n-gram
percent_of_d_speeches percent of the 23 Democratic speeches containing the phrase
percent_of_r_speeches percent of the 27 Republican speeches containing the phrase
chi2 chi^2 statistic
pval p-value for chi^2 test
You can’t perform that action at this time.