Integration with Modeltime #2

mdancho84 · 2020-07-10T15:12:01Z

I'd like to open this issue to keep track of how I plan to use stacks within the modeltime forecasting framework. There shouldn't be anything additional required on your part to get the integration to happen. On my end, I'll just allow a model_stack that has been "fitted" (contains a "member_fits" list element) to be allowed in the modeltime_table().

Once stacks is released, just be aware that if you change the argument name or object class names, that it will break modeltime until I can catch up.

Plan

The goal is to integrate model_stack objects into the modeltime forecasting workflow similar to how I integrate workflow objects.

It's quite simple - add the fitted model_stack to a Modeltime Table just like you add a fitted workflow.

Then the fitted model stack will follow the same forecasting workflow.

To achieve this result, there are only a few requirements (things you need to be aware of that are intricacies of time series cross-validation and the modeltime forecasting workflow).

Modeltime Forecasting Workflow Requirements:

Objects must be "fitted" stacks, meaning you can call a predict() method and they work just like calling predict(workflow, new_data). Therefore, there should be a way to easily determine if a stack has been fitted or not. Only if a stack has been fitted, can it be added to a Modeltime Table. It looks like this can be detective if a model has a "member_fits" element.
Sequential models (e.g. ARIMA, Exponential Smoothing, RNN, LSTM) must be able to have stacks preserve the time-based sequence. Note that this is only required for sequential models, and non-sequential models like Random Forest can use the normal cross-validation. This should be already taken care of with rsample and timetk.
- Cross Validation: rsample::rolling_origin() or timetk::time_series_cv() as the grid tuning strategy
- Final Evaluation: rsample::initial_time_split() or timetk::time_series_split() as the final training and evaluation sets.

The text was updated successfully, but these errors were encountered:

simonpcouch · 2020-07-10T15:15:00Z

Sounds great! Thanks for all the detail. The features you're relying on feel like they should be pretty stable right now, but I'll make sure to let you know if any change.

mdancho84 · 2020-07-10T15:17:50Z

The only thing I need is to get the predict.model_stack() function working. Once that happens, I can begin testing with modeltime.

simonpcouch · 2020-07-10T15:22:17Z

Yeah, that's up for the next week or two! I'll drop a note here when we get the basics working.

mdancho84 · 2020-07-10T15:23:55Z

Ok, that would be great. I'll then work on modeltime integration. I'm excited about this!

mdancho84 · 2020-07-26T11:15:03Z

Once we get the naming conventions down in #13, I’m going to begin working on the Modeltime integration.

One concern I have is the butcher (#10). I have a modeltime_refit() method that retrains models on new data. I’m thinking the stack member models will need to be retrained to refit the stack on the full time series. My plan is to use the member models, so I’m hoping butcher won’t chop those out.

simonpcouch · 2020-07-28T20:06:42Z

Fair warning that development will probably slow up quite a bit for the next month or so, and then pick back up after then. I don't imagine we'd undergo any changes re: #13 beyond finding and replacing function names, so the API should otherwise remain stable in that sense. :-)

Still need to spend more time on the butcher methods before I'll have a good sense of what operations will still be able to carried out. Thinking about what a refit would look like, though, if that method uses a new training set, you'd probably need to start back up at the tuning candidates step, since the data stack is made up of the collated assessment set predictions from the tune results.

github-actions · 2021-03-06T00:09:40Z

This issue has been automatically locked. If you believe you have found a related problem, please file a new issue (with a reprex: https://reprex.tidyverse.org) and link to this issue.

simonpcouch closed this as completed in eed3bab Jul 10, 2020

mdancho84 mentioned this issue Jul 17, 2020

Ensemble Modeling business-science/modeltime#12

Closed

This was referenced Sep 18, 2020

Error in apply(glmn_coef, 2, function(x) sum(x != 0)) : dim(X) must have a positive length #33

Closed

modeltime_stack: Integrate stacks into modeltime.ensemble package business-science/modeltime.ensemble#1

Open

github-actions bot locked and limited conversation to collaborators Mar 6, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Integration with Modeltime #2

Integration with Modeltime #2

mdancho84 commented Jul 10, 2020 •

edited

Loading

simonpcouch commented Jul 10, 2020

mdancho84 commented Jul 10, 2020

simonpcouch commented Jul 10, 2020

mdancho84 commented Jul 10, 2020

mdancho84 commented Jul 26, 2020

simonpcouch commented Jul 28, 2020

github-actions bot commented Mar 6, 2021

Integration with Modeltime #2

Integration with Modeltime #2

Comments

mdancho84 commented Jul 10, 2020 • edited Loading

Plan

Modeltime Forecasting Workflow Requirements:

simonpcouch commented Jul 10, 2020

mdancho84 commented Jul 10, 2020

simonpcouch commented Jul 10, 2020

mdancho84 commented Jul 10, 2020

mdancho84 commented Jul 26, 2020

simonpcouch commented Jul 28, 2020

github-actions bot commented Mar 6, 2021

mdancho84 commented Jul 10, 2020 •

edited

Loading