Denver B-cycle Data Exploration for University of Washington Methods for Data Analysis class
Switch branches/tags
Nothing to show
Clone or download
Fetching latest commit…
Cannot retrieve the latest commit at this time.
Failed to load latest commit information.

Denver B-Cycle 2014 Ridership

Tyler Byers

August 23, 2015

Project Summary

This is my final project for University of Washington's Methods for Data Analysis class, course #2 of 3 in the Data Science Certificate program. This project looks at public data from the Denver B-cycle program, which is merged with distance data from Google Maps and weather data from

Project Files

The following project files are in this project directory:

  • -- This document, with project description.
  • -- Final project writeup.
  • bcycle_final_script.R -- Production-level final script.
  • exploring_bcycle_data.Rmd -- Contains code for data-set building (some processes are fairly complex and time-intensive and would not make sense to build in a production-level script) and initial data explorations.
  • ./data -- Directory containing data files used in the scripts.
  • ./figures -- Directory with figures loaded into the final project writeup.

Data Sources

Analysis Software

All data analysis was done using R in RStudio. The following R packages are required in order to re-run the final R-script and the Data Building/Exploratory Analysis file exploring_bcycle_data.Rmd. Note that to fully run the exploring_bcycle_data.Rmd file, you will need your own developer API key, and will need to run the kiosk_pairs code chunk over the course of at least 3 days (there is a limit of 2500 calls to Google Distance Matrix API using the mapdist code per day).

library(ggplot2); library(dplyr); library(tidyr)
library(lubridate); library(xml2); library(readxl)
library(ggmap); library(logging); library(jsonlite)

Comparison to B-cycle Annual Report

A tip of the hat to Denver B-Cycle's annual report. Due to some likely differences in data processing and analysis, some of our conclusions (such as number of trips and miles ridden) are different than the official B-cycle analysis. However, my analysis couldn't have been possible without first reading their analysis and learning about the data, and certainly would not have been possible without the public availability of their ridership data.