Making a FiveThirtyEight analysis more reproducible with make
R Shell
Switch branches/tags
Nothing to show
Clone or download
Fetching latest commit…
Cannot retrieve the latest commit at this time.
Permalink
Failed to load latest commit information.
code
data
results
.Rprofile
.gitignore
LICENSE.md
Makefile
Makefile_finalish
README.md
baby_name_analysis.bash
family_report.Rmd

README.md

README

This is a repository that is used in a tutorial on how to use GNU Make. The tutorial is part of my Reproducible Research in Microbial Informatics series. If you aren't interested in the motivation from a microbiology perspective, you can skip ahead to around slide 15. The tutorial focuses on replicating an analysis performed by FiveThirtyEight to predict someone's age using their name.

If it isn't obvious, a significant limtation of using make to make your analysis more reproducible is that it only runs on Mac OS X and Linux. If you have a computer running windows, you'll need to install the Linux Bash window in Windows 10 (I'm not sure if it runs on Cygwin)

Datasets

The analysis draws names from two sources within the Social Security Administration:

Dependencies

  • make, curl, and unzip
  • R and the package zoo (zoo will be installed if it isn't already)