I am Emre Okcular, a data science intern at Dictionary.com Currently pursuing Master of Science in Data Science at University of San Francisco.
The main project of this project is to understand what are the characteristics of the data and to reveal key findings. In addition, various regressors and time series models were built for forecasting the daily rides for the transportation department to improve service in the next few years.
The dataset is downloaded from https://data.cityofchicago.org/Transportation/CTA-Ridership-L-Station-Entries-Daily-Totals/5neh-572f.
CTA_-_Ridership_-__L__Station_Entries_-_Daily_Totals.csv
: Raw Datasetchicago-train.ipynb
: Main notebookl_map.png
: L Train Mappopulation_density.png
: Chicago Population Density Mapridershipreadme.txt
: Readme file for the datasetrequirements.txt
: Python packages used in the notebook
To run the notebook in your local, you should run below command and install all required packages from requirement.txt which is located in the same folder.
pip install -r requirements.txt
Main Notebook File: https://github.com/emreokcular/chicago-trains/blob/main/chicago-train.ipynb
In this project, Chicago L Train Ridership data is analyzed. The dataset from 2001 to 2021 loaded and explored to extract characteristics and key findings. Lessons learned from data can be found in the notebook. Different type of forescasting models were built for the transportation department to improve service by forecasting the daily rides in the next few years. The final model were selected by evaluating model performance and considering production usage.