predict_indicator() #2

sebastian-fox · 2017-06-19T09:04:06Z

A function that predicts the next year value of an indicator.
predict_indicator():

IndicatorID
R asks for area type for prediction (eg, UTLA), then:
- Extracts all indicator data for indicators in same profile(s) at the same geography
- Identifies latest year for target indicator
- Subsets dataframe of latest year information for all indicators
- Creates flat, wide table of remaining indicators with variables for each previous year of data available for each indicator (eg, indicator_x_1yr_previous, indicator_x_2yr_previous, … , indicator_x_nyr_previous)
- Trains and tests model on second latest year for target indicator (maybe multiple machine learning methods)
- Uses best model to predict next year of data for indicator
  - Lasso
  - Glm
  - Svm
  - Randomforest

julianflowers · 2017-08-24T14:46:50Z

https://github.com/julianflowers/Data-science/blob/master/scripts/get_sui_data.R
https://github.com/julianflowers/Data-science/blob/master/suicide_prediction2.Rmd

julianflowers · 2017-08-24T14:50:34Z

This is not so much forecasting but prediction (subtle I know) but prediction seems to be about fitting values to unseen data, forecasting about the future. To forecast next year we would need to be able to estimate all the model inputs as well...

julianflowers · 2018-01-08T09:38:04Z

Have been trying a few other models - xgboost, gbm, brnn...

xgboost seems to be very popular - a bit fiddly
brnn is a bayesian neural network which seems quite accurate

sebastian-fox · 2018-01-08T10:40:00Z

This looks really good. I'm starting to think this belongs to a different package. This package has been reviewed by some rOpenSci reviewers and one of the comments is to reduce dependencies on other packages. That is a good suggestion and helps draw the boundaries around the limits of this package. I think we need to start developing the insights package internally...

… loops with lapply

sebastian-fox added the enhancement label Jun 19, 2017

sebastian-fox self-assigned this Jun 19, 2017

sebastian-fox mentioned this issue Jan 8, 2018

Get some simple stats on Fingertips #37

Closed

sebastian-fox pushed a commit that referenced this issue Jan 12, 2018

Minor updates based on rOpenSci reviewer #2. About to replace all for…

601140d

… loops with lapply

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

predict_indicator() #2

predict_indicator() #2

sebastian-fox commented Jun 19, 2017

julianflowers commented Aug 24, 2017

julianflowers commented Aug 24, 2017 •

edited

Loading

julianflowers commented Jan 8, 2018

sebastian-fox commented Jan 8, 2018

predict_indicator() #2

predict_indicator() #2

Comments

sebastian-fox commented Jun 19, 2017

julianflowers commented Aug 24, 2017

julianflowers commented Aug 24, 2017 • edited Loading

julianflowers commented Jan 8, 2018

sebastian-fox commented Jan 8, 2018

julianflowers commented Aug 24, 2017 •

edited

Loading