Skip to content

User Interaction Program that explores US bike-share data provided by Motivate bike-sharing system

Notifications You must be signed in to change notification settings

YashMotwani/US-Bikeshare-Data-Exploration-Program

 
 

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

21 Commits
 
 
 
 
 
 

Repository files navigation

US Bikeshare Data Analysis Project

12th Dec'18

Overview:

In this project, Python is used to explore data related to bike share systems for three major cities in the United States — Chicago, New York City, and Washington.

  • The source code takes in raw input from the user to create an interactive experience.
  • According to the input the code will import the data and will provide information by computing descriptive statistics.

Files used:

  • bikeshare_2.py

Credits:

Softwares needed:

  • Python 3, NumPy, and Pandas installed using Anaconda
  • A text editor, like VS Code or Atom.
  • A terminal application (Terminal on Mac and Linux or Cygwin on Windows).

Installation links for softwares:

Links for software tutorials:

Code explained in Detail:

How the program works:

The code developed takes in raw input to create an interactive experience in the terminal that answers questions about the dataset. The experience is interactive because depending on a user's input, the answers to the questions will change! There are four questions that will change the answers:

  • Would you like to see data for Chicago, New York, or Washington?
  • Would you like to filter the data by month, day, or not at all?
  • (If they chose month) Which month - January, February, March, April, May, or June?
  • (If they chose day) Which day - Monday, Tuesday, Wednesday, Thursday, Friday, Saturday, or Sunday?

The answers to the questions above will determine the city and timeframe on which you'll do data analysis. After filtering the dataset, users will see the statistical result of the data, and choose to start again or exit.

The Datasets:

Randomly selected data for the first six months of 2017 are provided for all three cities. All three of the data files contain the same core six (6) columns:

  • Start Time (e.g., 2017-01-01 00:07:57)
  • End Time (e.g., 2017-01-01 00:20:53)
  • Trip Duration (in seconds - e.g., 776)
  • Start Station (e.g., Broadway & Barry Ave)
  • End Station (e.g., Sedgwick St & North Ave)
  • User Type (Subscriber or Customer)

The Chicago and New York City files also have the following two columns:

  • Gender
  • Birth Year

Statistics Computed:

The code helps user to tell about bike share use in Chicago, New York City and Washington by computing a variety of descriptive statistics. In this project, the code output will provide the following information:

  • Popular times of travel (i.e., occurs most often in the start time):

    • most common month
    • most common day of week
    • most common hour of day
  • Popular stations and trip:

    • most common start station
    • most common end station
    • most common trip from start to end (i.e., most frequent combination of start station and end station)
  • Trip duration:

    • total travel time
    • average travel time
  • User info:

    • counts of each user type
    • counts of each gender (only available for NYC and Chicago)
    • earliest, most recent, most common year of birth (only available for NYC and Chicago)

About

User Interaction Program that explores US bike-share data provided by Motivate bike-sharing system

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • Python 100.0%