Skip to content

Denver B-cycle Data Exploration for University of Washington Methods for Data Analysis class

Notifications You must be signed in to change notification settings

tybyers/denver_bcycle

Repository files navigation

Denver B-Cycle 2014 Ridership

Tyler Byers

tybyers@gmail.com

August 23, 2015

Project Summary

This is my final project for University of Washington's Methods for Data Analysis class, course #2 of 3 in the Data Science Certificate program. This project looks at public data from the Denver B-cycle program, which is merged with distance data from Google Maps and weather data from forecast.io.

Project Files

The following project files are in this project directory:

  • README.md -- This document, with project description.
  • Denver_B-Cycle_2014.md -- Final project writeup.
  • bcycle_final_script.R -- Production-level final script.
  • exploring_bcycle_data.Rmd -- Contains code for data-set building (some processes are fairly complex and time-intensive and would not make sense to build in a production-level script) and initial data explorations.
  • ./data -- Directory containing data files used in the scripts.
  • ./figures -- Directory with figures loaded into the final project writeup.

Data Sources

Analysis Software

All data analysis was done using R in RStudio. The following R packages are required in order to re-run the final R-script and the Data Building/Exploratory Analysis file exploring_bcycle_data.Rmd. Note that to fully run the exploring_bcycle_data.Rmd file, you will need your own forecast.io developer API key, and will need to run the kiosk_pairs code chunk over the course of at least 3 days (there is a limit of 2500 calls to Google Distance Matrix API using the mapdist code per day).

library(ggplot2); library(dplyr); library(tidyr)
library(lubridate); library(xml2); library(readxl)
library(ggmap); library(logging); library(jsonlite)

Comparison to B-cycle Annual Report

A tip of the hat to Denver B-Cycle's annual report. Due to some likely differences in data processing and analysis, some of our conclusions (such as number of trips and miles ridden) are different than the official B-cycle analysis. However, my analysis couldn't have been possible without first reading their analysis and learning about the data, and certainly would not have been possible without the public availability of their ridership data.

About

Denver B-cycle Data Exploration for University of Washington Methods for Data Analysis class

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages