Monday, January 06, 2020
Data Analysis of a US Bikeshare Data from 3 major cities
The Bikesahare project will make use of Python to explore data related to bike share systems for three major cities in the United States - Chicago, New York City, and Washington. A script was written to take in raw input to create interactive experience in the terminal to present these statistics.
The following software are required for this project:
pip install pandas
pip install numpy
Git clone :- https://github.com/michaelbedson/pdsnd_github.git
A text editor such as like Atom.
A terminal application(Terminal on Mac and Linux or Cygwin on Windows)
The following files were used for the data analysis:-
run python bikeshare.py
Randomly selected data for the first six months of 2017 are provided for all three cities. All three of the data files contain the same core six (6) columns:
- Start Time (e.g., 2017-01-01 00:07:57)
- End Time (e.g., 2017-01-01 00:20:53)
- Trip Duration (in seconds - e.g., 776)
- Start Station (e.g., Broadway & Barry Ave)
- End Station (e.g., Sedgwick St & North Ave)
- User Type (Subscriber or Customer)
The Chicago and New York City files also have the following two columns:
- Gender
- Birth Year