Skip to content

Commit

Permalink
update READMEs
Browse files Browse the repository at this point in the history
  • Loading branch information
dmil committed Feb 9, 2018
1 parent 2200622 commit fd36e47
Show file tree
Hide file tree
Showing 40 changed files with 150 additions and 121 deletions.
4 changes: 4 additions & 0 deletions congress-age/README.md
Original file line number Diff line number Diff line change
@@ -1,3 +1,7 @@
# Congress Age

This folder contains the data behind the story [Both Republicans And Democrats Have an Age Problem](https://fivethirtyeight.com/features/both-republicans-and-democrats-have-an-age-problem/)

`congress-terms.csv` has an entry for every member of congress who served at any point during a particular congress between January 1947 and Februrary 2014.

House membership data is from the [@unitedstates project](http://theunitedstates.io/), with Congress meeting numbers added using code from [GovTrack](https://www.govtrack.us/developers/api):
Expand Down
2 changes: 1 addition & 1 deletion congress-generic-ballot/README.md
Original file line number Diff line number Diff line change
Expand Up @@ -5,4 +5,4 @@ files:
---
# Congress Generic Ballot Polls

This contains the raw data behind "[Are Democrats Winning The Race For Congress?](https://projects.fivethirtyeight.com/congress-generic-ballot-polls/)"
This readme contains links to the data behind [Are Democrats Winning The Race For Congress?](https://projects.fivethirtyeight.com/congress-generic-ballot-polls/). For the latest version of this updating data set, visit the links at the top of this README.
2 changes: 1 addition & 1 deletion congress-resignations/README.md
Original file line number Diff line number Diff line change
@@ -1,6 +1,6 @@
# Congressional Resignations

Data behind the story [We’ve Never Seen Congressional Resignations Like This Before](https://fivethirtyeight.com/features/more-people-are-resigning-from-congress-than-at-any-time-in-recent-history/).
This folder contains data behind the story [We’ve Never Seen Congressional Resignations Like This Before](https://fivethirtyeight.com/features/more-people-are-resigning-from-congress-than-at-any-time-in-recent-history/).

`congressional_resignations.csv` contains information about the 615 members of Congress who resigned or were removed from office from March 4, 1901 (the first day of the 57th Congress) through January 15, 2018, including the resigning member’s party and district, the date they resigned, the reason for their resignation and the source of the information about their resignation.

Expand Down
4 changes: 2 additions & 2 deletions cousin-marriage/README.md
Original file line number Diff line number Diff line change
@@ -1,6 +1,6 @@
### Cousin Marriage Data
# Cousin Marriage

The raw data behind the story [Dear Mona: How Many Americans Are Married To Their Cousins?]
This folder contains data behind the story [Dear Mona: How Many Americans Are Married To Their Cousins?](https://fivethirtyeight.com/features/how-many-americans-are-married-to-their-cousins/).

Header | Definition
---|---------
Expand Down
6 changes: 3 additions & 3 deletions daily-show-guests/README.md
Original file line number Diff line number Diff line change
@@ -1,13 +1,13 @@
### Daily Show Guests
# Daily Show Guests

The raw data behind the story [Every Guest Jon Stewart Ever Had On ‘The Daily Show’](http://fivethirtyeight.com/datalab/every-guest-jon-stewart-ever-had-on-the-daily-show/)
This folder contains data behind the story [Every Guest Jon Stewart Ever Had On ‘The Daily Show’](http://fivethirtyeight.com/datalab/every-guest-jon-stewart-ever-had-on-the-daily-show/).

Header | Definition
---|---------
`YEAR` | The year the episode aired
`GoogleKnowlege_Occupation` | Their occupation or office, according to Google's Knowledge Graph or, if they're not in there, how Stewart introduced them on the program.
`Show` | Air date of episode. Not unique, as some shows had more than one guest
`Group` | A larger group designation for the occupation. For instance, us senators, us presidents, and former presidents are all under "politicians"
`Raw_Guest_List` | The person or list of people who appeared on the show, according to Wikipedia. The GoogleKnowlege_Occupation only refers to one of them in a given row.
`Raw_Guest_List` | The person or list of people who appeared on the show, according to Wikipedia. The GoogleKnowlege_Occupation only refers to one of them in a given row.

Source: Google Knowlege Graph, The Daily Show clip library, Wikipedia.
4 changes: 3 additions & 1 deletion democratic-bench/README.md
Original file line number Diff line number Diff line change
@@ -1,4 +1,6 @@
### Democratic bench
# Democratic bench

This folder contains data behind the story [Some Democrats Who Could Step Up If Hillary Isn’t Ready For Hillary](https://fivethirtyeight.com/features/some-democrats-who-could-step-up-if-hillary-isnt-ready-for-hillary/).

Header | Definition
---|---------
Expand Down
6 changes: 3 additions & 3 deletions drug-use-by-age/README.md
Original file line number Diff line number Diff line change
@@ -1,8 +1,8 @@
### Drug Use By Age
# Drug Use By Age

This directory contains the data behind the story [How Baby Boomers Get High](http://fivethirtyeight.com/datalab/how-baby-boomers-get-high/). It covers 13 drugs across 17 age groups.
This directory contains data behind the story [How Baby Boomers Get High](http://fivethirtyeight.com/datalab/how-baby-boomers-get-high/). It covers 13 drugs across 17 age groups.

Source: [National Survey on Drug Use and Health from the Substance Abuse and Mental Health Data Archive](http://www.icpsr.umich.edu/icpsrweb/content/SAMHDA/index.html).
Source: [National Survey on Drug Use and Health from the Substance Abuse and Mental Health Data Archive](http://www.icpsr.umich.edu/icpsrweb/content/SAMHDA/index.html).

Header | Definition
---|---------
Expand Down
3 changes: 3 additions & 0 deletions early-senate-polls/README.md
Original file line number Diff line number Diff line change
@@ -0,0 +1,3 @@
# Early Senate Polls

This folder contains data behind the story [Early Senate Polls Have Plenty to Tell Us About November](https://fivethirtyeight.com/features/early-senate-polls-have-plenty-to-tell-us-about-november/).
4 changes: 2 additions & 2 deletions elo-blatter/README.md
Original file line number Diff line number Diff line change
@@ -1,6 +1,6 @@
### FIFA teams under Blatter
# FIFA teams under Blatter

The raw data behind the story [Blatter’s Reign At FIFA Hasn’t Helped Soccer’s Poor](http://fivethirtyeight.com/features/blatters-reign-at-fifa-hasnt-helped-soccers-poor/)
This folder contains data behind the story [Blatter’s Reign At FIFA Hasn’t Helped Soccer’s Poor](http://fivethirtyeight.com/features/blatters-reign-at-fifa-hasnt-helped-soccers-poor/).

Header | Definition
---|---------
Expand Down
6 changes: 3 additions & 3 deletions endorsements-june-30/README.md
Original file line number Diff line number Diff line change
@@ -1,8 +1,8 @@
### Endorsements through June 30
# Endorsements through June 30

The raw data behind the story [Pols And Polls Say The Same Thing: Jeb Bush Is A Weak Front-Runner](http://fivethirtyeight.com/features/pols-and-polls-say-the-same-thing-jeb-bush-is-a-weak-front-runner/)
This folder contains data behind the story [Pols And Polls Say The Same Thing: Jeb Bush Is A Weak Front-Runner](http://fivethirtyeight.com/features/pols-and-polls-say-the-same-thing-jeb-bush-is-a-weak-front-runner/).

This data includes something we call "endorsement points," an attempt to quantify the importance of endorsements by weighting each one according to the position held by the endorser: 10 points for each governor, 5 points for each senator and 1 point for each representative
This data includes something we call "endorsement points," an attempt to quantify the importance of endorsements by weighting each one according to the position held by the endorser: 10 points for each governor, 5 points for each senator and 1 point for each representative.

Header | Definition
---|---------
Expand Down
12 changes: 7 additions & 5 deletions fandango/README.md
Original file line number Diff line number Diff line change
@@ -1,17 +1,19 @@
# Fandango

This directory contains the data behind the story [Be Suspicious Of Online Movie Ratings, Especially Fandango’s](http://fivethirtyeight.com/features/fandango-movies-ratings/).

`fandango_score_comparison.csv` contains every film that has a Rotten Tomatoes rating, a RT User rating, a Metacritic score, a Metacritic User score, and IMDb score, and at least 30 fan reviews on Fandango. The data from Fandango was pulled on Aug. 24, 2015.

Column | Definition
--- | -----------
FILM | The film in question
RottenTomatoes | The Rotten Tomatoes Tomatometer score for the film
RottenTomatoes_User | The Rotten Tomatoes user score for the film
RottenTomatoes | The Rotten Tomatoes Tomatometer score for the film
RottenTomatoes_User | The Rotten Tomatoes user score for the film
Metacritic | The Metacritic critic score for the film
Metacritic_User | The Metacritic user score for the film
IMDB | The IMDb user score for the film
Fandango_Stars | The number of stars the film had on its Fandango movie page
Fandango_Ratingvalue | The Fandango ratingValue for the film, as pulled from the HTML of each page. This is the actual average score the movie obtained.
Fandango_Ratingvalue | The Fandango ratingValue for the film, as pulled from the HTML of each page. This is the actual average score the movie obtained.
RT_norm | The Rotten Tomatoes Tomatometer score for the film , normalized to a 0 to 5 point system
RT_user_norm | The Rotten Tomatoes user score for the film , normalized to a 0 to 5 point system
Metacritic_norm | The Metacritic critic score for the film, normalized to a 0 to 5 point system
Expand All @@ -34,5 +36,5 @@ Column | Definiton
--- | ---------
FILM | The movie
STARS | Number of stars presented on Fandango.com
RATING | The Fandango ratingValue for the film, as pulled from the HTML of each page. This is the actual average score the movie obtained.
VOTES | number of people who had reviewed the film at the time we pulled it.
RATING | The Fandango ratingValue for the film, as pulled from the HTML of each page. This is the actual average score the movie obtained.
VOTES | number of people who had reviewed the film at the time we pulled it.
2 changes: 1 addition & 1 deletion fifa/README.md
Original file line number Diff line number Diff line change
@@ -1,4 +1,4 @@
### FIFA
# FIFA

This directory contains the data behind the story [How To Break FIFA](http://fivethirtyeight.com/features/how-to-break-fifa/).

Expand Down
6 changes: 4 additions & 2 deletions flying-etiquette-survey/README.md
Original file line number Diff line number Diff line change
@@ -1,3 +1,5 @@
### Flying Etiquette Survey
# Flying Etiquette Survey

Results of a SurveyMonkey survey commissioned by FiveThirtyEight for the story [41 Percent of Fliers Say It’s Rude To Recline Your Airplane Seat](http://fivethirtyeight.com/datalab/airplane-etiquette-recline-seat)
This folder contains data behind the story [41 Percent of Fliers Say It’s Rude To Recline Your Airplane Seat](http://fivethirtyeight.com/datalab/airplane-etiquette-recline-seat).

`flying-etiquette.csv` contains the results of a SurveyMonkey survey commissioned by FiveThirtyEight for the story.
16 changes: 16 additions & 0 deletions food-world-cup/README.md
Original file line number Diff line number Diff line change
@@ -0,0 +1,16 @@
# Food World Cup

This folder contains data behind the stories:
* [The FiveThirtyEight International Food Association’s 2014 World Cup](https://fivethirtyeight.com/features/the-fivethirtyeight-international-food-associations-2014-world-cup/)
* [What is Americans’ Favorite Global Cuisine?](https://fivethirtyeight.com/features/what-is-americans-favorite-global-cuisine/)

Anwser key for the responses to the "Please rate how much you like the traditional cuisine of X:" questions.

Value | Description
------|--------------
5 | I love this country's traditional cuisine. I think it's one of the best in the world.
4 | I like this country's traditional cuisine. I think it's considerably above average.
3 | I'm OK with this county's traditional cuisine. I think it's about average.
2 | I dislike this country's traditional cuisine. I think it's considerably below average.
1 | I hate this country's traditional cuisine. I think it's one of the worst in the world.
N/A | I'm unfamiliar with this country's traditional cuisine.
8 changes: 0 additions & 8 deletions food-world-cup/readme.txt

This file was deleted.

4 changes: 2 additions & 2 deletions forecast-methodology/README.md
Original file line number Diff line number Diff line change
@@ -1,6 +1,6 @@
### Historical FiveThirtyEight Senate Forecasts
# Historical FiveThirtyEight Senate Forecasts

The data behind the story [How The FiveThirtyEight Senate Forecast Model Works](http://fivethirtyeight.com/features/how-the-fivethirtyeight-senate-forecast-model-works/)
This folder contains the data behind the story [How The FiveThirtyEight Senate Forecast Model Works](http://fivethirtyeight.com/features/how-the-fivethirtyeight-senate-forecast-model-works/).

Header | Definition
---|---------
Expand Down
4 changes: 2 additions & 2 deletions goose/README.md
Original file line number Diff line number Diff line change
@@ -1,6 +1,6 @@
### Goose
# Goose

The raw data behind the stories:
The data behind the stories:
* [The Save Ruined Relief Pitching. The Goose Egg Can Fix It](https://fivethirtyeight.com/features/goose-egg-new-save-stat-relief-pitchers/)
* [Kenley Jansen Is The Model Of A Modern Reliever](https://fivethirtyeight.com/features/kenley-jansen-is-the-model-of-a-modern-reliever/)

Expand Down
4 changes: 2 additions & 2 deletions hate-crimes/README.md
Original file line number Diff line number Diff line change
@@ -1,6 +1,6 @@
### Hate-crimes data
# Hate Crimes

The raw data behind the story [Higher Rates Of Hate Crimes Are Tied To Income Inequality](https://fivethirtyeight.com/features/higher-rates-of-hate-crimes-are-tied-to-income-inequality/)
This folder contains data behind the story [Higher Rates Of Hate Crimes Are Tied To Income Inequality](https://fivethirtyeight.com/features/higher-rates-of-hate-crimes-are-tied-to-income-inequality/).

Header | Definition
---|---------
Expand Down
9 changes: 5 additions & 4 deletions hip-hop-candidate-lyrics/README.md
Original file line number Diff line number Diff line change
@@ -1,17 +1,18 @@
### Every mention of the 2016 primary candidates in hip-hop songs
# Hip Hop Candidate Lyrics

The raw data behind the story [ Hip-Hop Is Turning On Donald Trump](http://projects.fivethirtyeight.com/clinton-trump-hip-hop-lyrics/)
This folder contains data behind the story [ Hip-Hop Is Turning On Donald Trump](http://projects.fivethirtyeight.com/clinton-trump-hip-hop-lyrics/).

`genius_hip_hop_lyrics.csv` contains every mention of the 2016 primary candidates in hip-hop songs.

Header | Definition
---|---------
`candidate` | Candidate referenced
`song` | Song name
`artist` | Artist name
`sentiment` | Positive, negative or neutral
`theme` | Theme of lyric
`theme` | Theme of lyric
`album_release_date` | Date of album release
`line` | Lyrics
`url` | Genius link


Source: [Genius](http://genius.com/)
3 changes: 3 additions & 0 deletions historical-ncaa-forecasts/README.md
Original file line number Diff line number Diff line change
@@ -0,0 +1,3 @@
# NCAA Bracket

This folder contains data behind the story [The NCAA Bracket: Checking Our Work](https://fivethirtyeight.com/datalab/the-ncaa-bracket-checking-our-work).
4 changes: 2 additions & 2 deletions inconvenient-sequel/README.md
Original file line number Diff line number Diff line change
@@ -1,5 +1,5 @@
# An Inconvenient Sequel

Raw data behind the story [Al Gore’s New Movie Exposes The Big Flaw In Online Movie Ratings](https://fivethirtyeight.com/features/al-gores-new-movie-exposes-the-big-flaw-in-online-movie-ratings/)
This folder contains data behind the story [Al Gore’s New Movie Exposes The Big Flaw In Online Movie Ratings](https://fivethirtyeight.com/features/al-gores-new-movie-exposes-the-big-flaw-in-online-movie-ratings/).

Data contains [IMDb ratings](http://www.imdb.com/title/tt6322922/ratings) for the film "An Inconvenient Sequel: Truth to Power" collected daily from July 17 to August 29, 2017.
`ratings.csv` contains [IMDb ratings](http://www.imdb.com/title/tt6322922/ratings) for the film "An Inconvenient Sequel: Truth to Power" collected daily from July 17 to August 29, 2017.
3 changes: 3 additions & 0 deletions infrastructure-jobs/README.md
Original file line number Diff line number Diff line change
@@ -0,0 +1,3 @@
# Infrastructure Jobs

This folder contains data behind the story [Using Infrastructure Jobs as a Measuring Stick For State-Level Spending](https://fivethirtyeight.com/features/using-infrastructure-jobs-as-a-measuring-stick-for-state-level-spending/).
2 changes: 1 addition & 1 deletion librarians/README.md
Original file line number Diff line number Diff line change
@@ -1,3 +1,3 @@
# Librarians

The data behind the story [Where Are America’s Librarians?](https://fivethirtyeight.com/features/where-are-americas-librarians/)
This folder contains data behind the story [Where Are America’s Librarians?](https://fivethirtyeight.com/features/where-are-americas-librarians/).
10 changes: 5 additions & 5 deletions love-actually/readme.md
Original file line number Diff line number Diff line change
@@ -1,10 +1,10 @@
### Love Actually
# Love Actually

This directory contains the data behind the story: [The Definitive Analysis Of ‘Love Actually,’ The Greatest Christmas Movie Of Our Time](https://fivethirtyeight.com/features/the-definitive-analysis-of-love-actually-the-greatest-christmas-movie-of-our-time/)
This directory contains the data behind the story [The Definitive Analysis Of ‘Love Actually,’ The Greatest Christmas Movie Of Our Time](https://fivethirtyeight.com/features/the-definitive-analysis-of-love-actually-the-greatest-christmas-movie-of-our-time/).

There are two data files:

* `love_actually_appearances.csv` - A table of the central actors in "Love Actually" and which scenes they appear in
* `love_actually_adjacencies.csv` - The adjacency matrix of which actors appear in the same scene together
`love_actually_appearances.csv` contains a table of the central actors in "Love Actually" and which scenes they appear in.

`love_actually_adjacencies.csv` contains the adjacency matrix of which actors appear in the same scene together.

You'll notice there are a lot of “Love Actually” actors who we didn’t track in the data. That’s because they rarely cross storylines. When they do, it’s in the company of the actor who we *did* include, the linchpin of that storyline.
2 changes: 1 addition & 1 deletion mad-men/README.md
Original file line number Diff line number Diff line change
@@ -1,4 +1,4 @@
### Mad Men
# Mad Men

This directory contains the data behind the story [‘Mad Men’ Is Ending. What’s Next For The Cast?](http://fivethirtyeight.com/datalab/mad-men-is-ending-whats-next-for-the-cast/).

Expand Down
10 changes: 3 additions & 7 deletions male-flight-attendants/README.md
Original file line number Diff line number Diff line change
@@ -1,12 +1,8 @@
### Male flight attendants
# Male Flight Attendants

This repo contains the data from the article on the gender divide in various U.S. occupations
This folder contains the data behind the story [Dear Mona, How Many Flight Attendants Are Men?](http://fivethirtyeight.com/datalab/dear-mona-how-many-flight-attendants-are-men/).

[Dear Mona, How Many Flight Attendants Are Men?](http://fivethirtyeight.com/datalab/dear-mona-how-many-flight-attendants-are-men/)

`male-flight-attendants.tsv`:

The tab-separated text file contains the percentage of U.S. employees that are male in 320 different job categories.
`male-flight-attendants.tsv` contains the percentage of U.S. employees that are male in 320 different job categories.

Source: [IPUMS](https://usa.ipums.org/usa/), 2012

7 changes: 4 additions & 3 deletions march-madness-predictions-2015/README.md
Original file line number Diff line number Diff line change
@@ -1,4 +1,5 @@
March Madness Predictions 2015
==============================
# March Madness Predictions

Data files for [FiveThirtyEight's 2015 March Madness Predictions](http://fivethirtyeight.com/interactives/march-madness-predictions-2015/), updated each time we calculate new odds.
This folder contains data behind the [2015 March Madness Predictions](http://fivethirtyeight.com/interactives/march-madness-predictions-2015/).

Data was updated each time we calculate new odds.
4 changes: 3 additions & 1 deletion march-madness-predictions/README.md
Original file line number Diff line number Diff line change
@@ -1 +1,3 @@
http://fivethirtyeight.com/interactives/march-madness-predictions/
# March Madness Predictions

This folder contains data behind the [2014 NCAA Tournament Predictions](http://fivethirtyeight.com/interactives/march-madness-predictions/).
8 changes: 6 additions & 2 deletions marriage/README.md
Original file line number Diff line number Diff line change
@@ -1,6 +1,10 @@
These files contain data used in <a href="http://fivethirtyeight.com/features/marriage-isnt-dead-yet/">FiveThirtyEight's story</a> on marriage trends.File names are self-explanatory. Source for all data is Decennial Census (years 1960 to 2000) and American Community Survey (years 2001-2012), via <a href="https://usa.ipums.org/usa/cite.shtml">IPUMS USA</a>.
# Marriage

Except in the divorce file, figures represent share of the relevant population that has never been married (MARST == 6 in the IPUMS data). Note that in the story, charts generally show the share that have <i>ever</i> been married, which is simply 1 - n. In the divorce file, figures are share of the relevant population that is <i>currently</i> divorced, conditional on having ever been married.
This folder contains data behind the story [Marriage Isn’t Dead — Yet](http://fivethirtyeight.com/features/marriage-isnt-dead-yet/).

Source for all data is Decennial Census (years 1960 to 2000) and American Community Survey (years 2001-2012), via [IPUMS USA](https://usa.ipums.org/usa/cite.shtml).

Except in the divorce file, figures represent share of the relevant population that has never been married (MARST == 6 in the IPUMS data). Note that in the story, charts generally show the share that have *ever* been married, which is simply 1 - n. In the divorce file, figures are share of the relevant population that is *currently* divorced, conditional on having ever been married.

Variable names are as follows. Number in variable names are age ranges, so `all_2534` is the marriage rate for everyone ages 25 to 34.

Expand Down
Loading

0 comments on commit fd36e47

Please sign in to comment.