Skip to content

Latest commit

 

History

History
65 lines (47 loc) · 2.65 KB

index.md

File metadata and controls

65 lines (47 loc) · 2.65 KB

Boston Marathon Raw Data and Visualizations

This repository has two parts:

  • all of the data on Boston Marathon finishers available from 2001-2014
  • Visualizations of that data

Visualizations

The Data

Look in the results directory for the data. Do something interesting with it, and make sure you tell me about it!

Format

There are (unfortunately) two different data formats. 2013 and 2014 have more detailed timing data, with splits at 10k, 20k, 25k, half, 30k, 35k, and 40k.

Pre-2013, the data has only the finishing time, but adds the person's standing in their division, gender, and overall.

Caveats

  • The data includes wheelchair racers but not hand cyclists or other special groups... if you're interested in that data please submit a pull request!
  • The data does not include runners who did not finish. There's nothing I can do about that, as far as I can tell that data is unavailable from baa.org
  • The data is certainly missing a few people, but it ought to contain the large majority of runners who finished from each year.
  • The code is ugly. This is just about grinding the results out!

License

MIT License. Use it as you want to, don't feel obligated to give me credit. It's the BAA's data anyway. (Thanks for organizing, BAA)

Downloading The Data

I... already did that for you. Why do you want to do that?

Anyway, if you do, you'll want to run python multidl.py {year}

Viewing the notebook

  1. Install the prerequisites: pip install < requirements.txt
  2. Start the notebook: make notebook
  3. Play!