Staged prediction/incremental fitting support #304

pkhokhlov · 2021-11-29T08:23:17Z

Do you plan on adding staged prediction similar to XGBoost's ntree_limit/iteration_range?
Do you plan on adding warm starts similar to scikit-learn's GBM warm_start?

Thanks for your great work on this library.

interpret-ml · 2021-12-13T08:17:28Z

We've talked internally about ways to expose more customization in the boosting stages, mostly in order to give the caller better ways to control pair selection. Allowing for something that looks like scikit-learn's warm_start is one option for handling that. We don't have immediate plans to work on this given other priorities, although we do view it as important in the medium term.

Staged prediction isn't something that we've considered. Our model class does a lot more boosting than XGBoost. Reaching into the millions of boosting steps is a typical scenario. The extra storage required to preserve the information for an iteration_range feature would therefore be considerable. We have considered using algorithms internally that would require preserving a window of the last N boosting steps, but the discussion there has been to throw away that information after completing the model.

Can you give us some details on how you'd like to use these features?

-InterpretML team

pkhokhlov · 2021-12-15T18:07:45Z

@interpret-ml Rather than training K separate models to K, 2K, ..., N * K iterations, I'd like to train a single model to N * K iterations and perform predictions on validation set using the first K, 2*K, ... N * K iterations to see model performance as function of iterations. It is very helpful for understanding convergence and overfitting.

Warm start would achieve something similar by allowing training the model "manually" to K iterations and saving that model, then continuing training from K to 2K and saving that, etc. If millions of boosting iterations are typical, then the warm start approach is definitely more viable.

Let me know if anything is unclear.

bverhoeff · 2021-12-15T21:00:35Z

If I understand it correctly, a warm_start functionality would enable federated learning in combination with the merge function.

This would be a very interesting use case. Or is there another way continue training of an EBM?

A well-performing explainable algorithm such as EBM that enables federated learning will solve some major problems of AI development.

paulbkoch · 2023-02-10T08:23:00Z

There's a longer discussion regarding an API for staged prediction here #403. This issue is slightly different in that it's about continuing the boosting of an existing model without changing any of the parameters, but you could imagine using the other API to handle warm starts as a special case by passing in the same parameters on each stage without modification.

One issue that I see with warm starts in general is that we internally choose which pairs to boost on after fitting the mains, and the warm start methodology doesn't fit neatly into that scenario. If you wanted to boost just 5 rounds for instance, would we pick the pairs after those 5 rounds? If we did, then the pairs selected would probably not be very good. Then we'd boost on those pairs for 5 rounds I suppose. If someone wanted to warm start this model later, presumably we'd keep our already selected pairs. I could still see warm starts as a useful feature if the model consists only of mains, or explicitly specified pairs.

@bverhoeff -- The merge_ebms function is now fully supported and available in the latest v0.3.0 release. You should be able to do federated learning with it now without needing the warm start functionality.

paulbkoch mentioned this issue Feb 10, 2023

Question about transfer learning development #407

Closed

paulbkoch added the enhancement New feature or request label Feb 11, 2023

paulbkoch mentioned this issue Feb 11, 2023

Backlog #400

Open

paulbkoch mentioned this issue Nov 12, 2023

Incremental fitting of interactions #488

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Staged prediction/incremental fitting support #304

Staged prediction/incremental fitting support #304

pkhokhlov commented Nov 29, 2021

interpret-ml commented Dec 13, 2021

pkhokhlov commented Dec 15, 2021

bverhoeff commented Dec 15, 2021 •

edited

paulbkoch commented Feb 10, 2023

Staged prediction/incremental fitting support #304

Staged prediction/incremental fitting support #304

Comments

pkhokhlov commented Nov 29, 2021

interpret-ml commented Dec 13, 2021

pkhokhlov commented Dec 15, 2021

bverhoeff commented Dec 15, 2021 • edited

paulbkoch commented Feb 10, 2023

bverhoeff commented Dec 15, 2021 •

edited