Complete Time-Series Analysis on Traffic Congestion

This is a complete time-series workflow done in Python for a univariate response. There are a total of 63 features out of which several were manually engineered and contain collinearity that I have treated by simply dropping them and then comparing predictive performance.

This was primarily focussed on including only time-series algorithms but since gradient boosted and some linear methods too have been known to perform good on timestamp data, I therefore decided to perorm a comparative study for both categories.

The data used is basically competitive data hosted by IITM earlier this year, in the month of July. I've only extended my analysis on traffic congestion after my participation in the hackathon.

TODO:

Perform cleaning procedures.
Perform comparison of supervised learning methods.
Find out d, p, q terms for ARIMA models.
Implement time-series algorithms (SARIMAX).
Implement a LSTM model.
Compare time-series methods with regression/supervised methods.
Add a fully functional jupyter notebook.
Debug LSTM model(s) for full functionality.

Name		Name	Last commit message	Last commit date
Latest commit History 13 Commits
.ipynb_checkpoints		.ipynb_checkpoints
Plots		Plots
.gitignore		.gitignore
Readme.md		Readme.md
Sangam_analysis.py		Sangam_analysis.py
Test.csv		Test.csv
Train.csv		Train.csv
cython_code.pyx		cython_code.pyx
load_libs.py		load_libs.py
sample_submission.csv		sample_submission.csv
setup.py		setup.py
traffic_congestion.ipynb		traffic_congestion.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Complete Time-Series Analysis on Traffic Congestion

About

Releases

Packages

Languages

shivendra90/traffic_time_series

Folders and files

Latest commit

History

Repository files navigation

Complete Time-Series Analysis on Traffic Congestion

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages