Skip to content

Latest commit

 

History

History

m4

M4

The M4 competition was held in 2018 and measured forecast accuracy on 100,000 time series of varying lenghts and frequencies. This repository shows the scalecast approach to the problem. It uses only scikit-learn models with automated hyperparameter and input selection. It intentionally avoids classic statistical methods. Currently, the best OWA achieved by any one model is 0.92 (ElasticNet), which is better than the best Naive benchmark, but not quite as good as ARIMA (0.90) or Theta (0.89). The goal is to see scalecast achieve better results than all benchmarks.

The hourly, weekly, and yearly series are all predicted superbly by models in scalecast. The quarterly, monthly, and daily series need more work.

Notebooks

Resources

Ideas for improvement

  • Stacking models
  • Larger hyperparmaeter grids, especially for tree models
  • Nonlinear feature selection