NYC Citi Bike system data and analysis
R Ruby Shell
Latest commit 614a05a Dec 17, 2016 @toddwschneider committed on GitHub Merge pull request #5 from toddwschneider/taxi_zone_shapefile
Assorted updates

README.md

NYC Citi Bike Data

Code in support of this post: A Tale of Twenty-Two Million Citi Bikes: Analyzing the NYC Bike Share System

This repo provides scripts to download, process, and analyze data for NYC's Citi Bike system data. The data is stored in a PostgreSQL database, uses PostGIS for spatial calculations, and R for data analysis.

Pretty much a copy of the taxi/Uber data repo, at some point the Citi Bike, taxi, and Uber datasets could probably be combined into a single unified NYC transit database...

Instructions

1. Install PostgreSQL and PostGIS

Both are available via Homebrew on Mac OS X

2. Download raw taxi data

./download_raw_data.sh

3. Initialize database and set up schema

./initialize_database.sh

4. Import taxi data into database and map to census tracts

./import_trips.sh

5. Analysis

Additional Postgres and R scripts for analysis are in the analysis/ folder

Other data sources

These are bundled with the repository, so no need to download separately, but:

Questions/issues/contact

todd@toddwschneider.com, or open a GitHub issue