Skip to content

Time series analysis for the dataset provided by Sangam hackathon hosted at hackerearth.com

Notifications You must be signed in to change notification settings

shivendra90/traffic_time_series

Repository files navigation

Complete Time-Series Analysis on Traffic Congestion

This is a complete time-series workflow done in Python for a univariate response. There are a total of 63 features out of which several were manually engineered and contain collinearity that I have treated by simply dropping them and then comparing predictive performance.

This was primarily focussed on including only time-series algorithms but since gradient boosted and some linear methods too have been known to perform good on timestamp data, I therefore decided to perorm a comparative study for both categories.

The data used is basically competitive data hosted by IITM earlier this year, in the month of July. I've only extended my analysis on traffic congestion after my participation in the hackathon.

TODO:

  • Perform cleaning procedures.
  • Perform comparison of supervised learning methods.
  • Find out d, p, q terms for ARIMA models.
  • Implement time-series algorithms (SARIMAX).
  • Implement a LSTM model.
  • Compare time-series methods with regression/supervised methods.
  • Add a fully functional jupyter notebook.
  • Debug LSTM model(s) for full functionality.

About

Time series analysis for the dataset provided by Sangam hackathon hosted at hackerearth.com

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published