Skip to content

DataScienceWorks/PredictingBicycleTraffic

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

3 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Predicting Bicycle Traffic

Tags

Pandas, Visualization, RollingMean, GroupBy, PivotTable, TimeSeries, LinearRegression

Credits

Objective

Understand Bicylers habits and make predictions

DataSets Source

  • Seattle Gov - Data
    • Transportation : Fremont Bridge - Hourly Bicycle Counts by the month of October 2012 to present
      • The Fremont Bridge Bicycle Counter records the number of bikes that cross the bridge using the pedestrian/bicycle pathways. Inductive loops on the east and west pathways count the passing of bicycles regardless of travel direction. The data consists of a date/time field: Date, east pathway count field: Fremont Bridge NB, and west pathway count field: Fremont Bridge SB. The count fields represent the total bicycles detected during the specified one hour period. Direction of travel is not specified, but in general most traffic in the Fremont Bridge NB field is travelling northbound and most traffic in the Fremont Bridge SB field is travelling southbound.
      • The data we will use here are the hourly bicycle counts on Seattle's Fremont Bridge. These data come from an automated bicycle counter, installed in late 2012, which has inductive sensors under the sidewalks on either side of the bridge. The daily or hourly bicycle counts can be downloaded from http://data.seattle.gov/; here is the direct link to the hourly dataset. To download the data directly, you can uncomment the following curl command:
      • Code: !curl -o FremontBridge.csv# https://data.seattle.gov/api/views/65db-xm6k/rows.csv?accessType=DOWNLOAD
  • NOAA - Climate Data Online Search
    • While the datasets are free, the way you obtain data from this site is by submitting your email-id and as if you are making an online purchase for free.
    • NOAA makes available their daily weather station data (I used station ID USW00024233) and we can easily use Pandas to join the two data sources.

Lessons

  • Learn how to play with Time-Series data.