[Feature request] Arbitrary base learner #5802

zachmayer · 2020-06-16T21:34:24Z

Its pretty cool that I can define my own loss function and gradient for xgboost, and then use the linear, tree, or dart base learners to optimize my loss function.

It'd be really cool if I could specify my own base learner, perhaps in the form of an sklearn class with a fit method, a predict method, and support for sample weights.

It'd really open up a whole new world of possibilities to be able to use the Xgboost algorithm to fit a wider range of possible base learners.

hcho3 · 2020-06-16T21:36:17Z

@zachmayer Is StackingClassifier / StackingRegressor an option for you? We recently added support for it: #5780

hcho3 · 2020-06-16T21:44:35Z

Oops, my bad. When you say "base learner," you mean that you want to fit a boosted ensemble consisting of your custom models?

zachmayer · 2020-06-16T21:51:11Z

Yes exactly. So for example if I wanted to boost a kernel Svm I could do that

…

On Tue, Jun 16, 2020 at 5:44 PM Philip Hyunsu Cho ***@***.***> wrote: Oops, my bad. When you say "base learner," you mean that you want a custom model as part of the ensemble? — You are receiving this because you were mentioned. Reply to this email directly, view it on GitHub <#5802 (comment)>, or unsubscribe <https://github.com/notifications/unsubscribe-auth/AAEN7VSYLUUOLIVMOYLAO63RW7RVBANCNFSM4OAABSMQ> .

tunguz · 2020-06-17T11:37:04Z

Here is a related issue that has just been opened:

rapidsai/cuml#2435

Adding AdaBoost to cuml might be a good stop-gap measure.

zachmayer · 2020-06-17T17:13:43Z

sklearn adaboost already supports arbitrary base learners: sklearn.ensemble.AdaBoostClassifier

XGboost is way better than adaboost though, and supports a bunch of features adaboost doesn't have:

You can specify an arbitrary loss function in xgboost.
You can specify a gradient for your loss function, and use the gradient in your base learner.
You can specify an arbitrary evaluation function in xgboost.
You can do early stopping with xgboost.
You can run xgboost base learners in parallel, to mix "random forest" type learning with "boosting" type learning

zachmayer · 2020-06-17T17:17:19Z

@tunguz "Run adaboost on a gpu" isn't really what I'm looking for.

"Run adaboost with an arbitrary base learner, arbitrary loss function, arbitrary gradient, arbitrary evaluation, early stopping, and a mix of parallel learners (aka bagging) and boosting" would suit my needs, but that's another way to say "run xgboost with an arbitrary base learner" 😁

zachmayer · 2020-07-27T16:11:00Z

Just a follow up on this:

ngboost supports arbitrary base learners, which solves the problem for me for now.
There's an interesting new package called Grownet which has some evidence that boosting different weak learners (specifically neural networks) is useful. (There's a paper too)

trivialfis added the feature-request label Jun 17, 2020

teju85 mentioned this issue Jun 17, 2020

[FEA] Provide support for ensemble.AdaBoost* weighted classifiers rapidsai/cuml#2435

Open

zachmayer mentioned this issue Jun 22, 2020

[Feature request] Arbitrary base learner microsoft/LightGBM#3180

Closed

zachmayer mentioned this issue Jul 1, 2020

[Feature Request] Arbitrary base learners for GradientBoostingRegressor/Classifier scikit-learn/scikit-learn#17660

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Feature request] Arbitrary base learner #5802

[Feature request] Arbitrary base learner #5802

zachmayer commented Jun 16, 2020

hcho3 commented Jun 16, 2020 •

edited

Loading

hcho3 commented Jun 16, 2020 •

edited

Loading

zachmayer commented Jun 16, 2020 via email

tunguz commented Jun 17, 2020 •

edited

Loading

zachmayer commented Jun 17, 2020

zachmayer commented Jun 17, 2020

zachmayer commented Jul 27, 2020

[Feature request] Arbitrary base learner #5802

[Feature request] Arbitrary base learner #5802

Comments

zachmayer commented Jun 16, 2020

hcho3 commented Jun 16, 2020 • edited Loading

hcho3 commented Jun 16, 2020 • edited Loading

zachmayer commented Jun 16, 2020 via email

tunguz commented Jun 17, 2020 • edited Loading

zachmayer commented Jun 17, 2020

zachmayer commented Jun 17, 2020

zachmayer commented Jul 27, 2020

hcho3 commented Jun 16, 2020 •

edited

Loading

hcho3 commented Jun 16, 2020 •

edited

Loading

tunguz commented Jun 17, 2020 •

edited

Loading