To eBike or Not to eBike?

NYC Data Science Academy R Project (CitiBike)

The purpose of this study was to evaluate the economic feasibility of riding electric bikes within the NYC CitiBike fleet. The full conclusions can be found written here:

https://nycdatascience.com/blog/student-works/to-ebike-or-not-to-ebike/

The following Jupyter notebooks have been used in this order to prepare the data for visualization:

CB_Python_Combine_File.ipynb to combine the separate CitiBike .csv files for each month and export to .parquet
CB_Python_Clean_File.ipynb to remove null values, duplicates, and other errata
CB_Python_Feature_Eng_Prelim.ipynb to convert timestamp strings to actual timestamps and extract time data
CB_Python_Geolocation.ipynb to extract district station names, locations, and tag to their district respective boroughs (or state - Hudson County, NJ) and neighborhood (or city - Hoboken or Jersey City)
CB_Python_Feature_Eng_Final.ipynb to normalize coordinates and associated distance and speed calculations and attach station borough and neighborhood associations to the main dataframe to complete the final data file to be used for visualization

Finally, to streamline the whole workflow from the notebooks developed above, the following one has been developed:

CB_Full_Service.ipynb

In addition, a duplicate notebook specific to that of electric bikes has been created to utilize a time frame (basically one month later) that has better data on electric bike usage:

CB_Full_Service-eBike.ipynb

At this point, all future work for the capstone project, which is derived from this one, shall be found at the following location:

https://github.com/jchatterjee/nycdsa_capstone

The New York City neighborhoods used to map the locations of their respective stations derive from a GeoJSON available here:

https://data.beta.nyc/dataset/pediacities-nyc-neighborhoods/resource/35dd04fb-81b3-479b-a074-a27a37888ce7

Any other coordinates that could not be grouped within the aforementioned GeoJSON were assumed to be that of Hudson County, NJ and labeled accordingly.

The following has been used for the R project (superseded in Python for the capstone):

unzip_files.r was used for unzipping files
combine_file.r was used for combining relevant .csv files into one
clean_file.r was used for cleaning the combined .csv file
JChatterjee_R_Project.Rmd was used for creating the visualizations that were used for the original class presentation which has been superseded by the aforementioned blog article

The New York City neighborhoods used to map the locations of their respective stations derive from a Zillow shapefile available here:

https://www.kaggle.com/datasets/jackcook/neighborhoods-in-new-york

And those from the New Jersey (Hudson County) side have been obtained from shapefiles available here:

https://catalog.data.gov/dataset/tiger-line-shapefile-2016-state-new-jersey-current-place-state-based

Name		Name	Last commit message	Last commit date
Latest commit History 75 Commits
.gitattributes		.gitattributes
.gitignore		.gitignore
CB_Full_Service-eBike.ipynb		CB_Full_Service-eBike.ipynb
CB_Full_Service.ipynb		CB_Full_Service.ipynb
CB_Python_Clean_File.ipynb		CB_Python_Clean_File.ipynb
CB_Python_Combine_File.ipynb		CB_Python_Combine_File.ipynb
CB_Python_Feature_Eng_Final.ipynb		CB_Python_Feature_Eng_Final.ipynb
CB_Python_Feature_Eng_Prelim.ipynb		CB_Python_Feature_Eng_Prelim.ipynb
CB_Python_Geolocation.ipynb		CB_Python_Geolocation.ipynb
JChatterjee_R_Project.Rmd		JChatterjee_R_Project.Rmd
README.md		README.md
clean_file.R		clean_file.R
combine_file.R		combine_file.R
unzip_files.R		unzip_files.R

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

To eBike or Not to eBike?

NYC Data Science Academy R Project (CitiBike)

About

Uh oh!

Releases

Packages

Languages

jchatterjee/nycdsa_r_project

Folders and files

Latest commit

History

Repository files navigation

To eBike or Not to eBike?

NYC Data Science Academy R Project (CitiBike)

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages