Enable weighted sampling of features #5308

lutzvdb · 2020-02-14T13:50:08Z

In our random forest models (using the R package "ranger") we frequently use weighted sampling of features (argument "split.select.weights", compare official documentation here ). I would like to apply the same logic to xgboost trees, as in my usecase I frequently know which variables play a more important role for a certain forecast period (we do time series analysis). I already utilize feature sampling using colsample_bytree and colsample_bynode, so I would greatly appreciate being able to alter the probabilities for column sampling.

hcho3 · 2020-02-15T01:48:02Z

Related: #3754, #4230

trivialfis · 2020-07-16T07:50:37Z

I will look into this.

trivialfis added the feature-request label Feb 14, 2020

trivialfis mentioned this issue Jul 30, 2020

Feature weights #5962

Merged

1 task

trivialfis closed this as completed in #5962 Aug 18, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Enable weighted sampling of features #5308

Enable weighted sampling of features #5308

lutzvdb commented Feb 14, 2020 •

edited

hcho3 commented Feb 15, 2020

trivialfis commented Jul 16, 2020

Enable weighted sampling of features #5308

Enable weighted sampling of features #5308

Comments

lutzvdb commented Feb 14, 2020 • edited

hcho3 commented Feb 15, 2020

trivialfis commented Jul 16, 2020

lutzvdb commented Feb 14, 2020 •

edited