Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Enable weighted sampling of features #5308

Closed
lutzvdb opened this issue Feb 14, 2020 · 2 comments · Fixed by #5962
Closed

Enable weighted sampling of features #5308

lutzvdb opened this issue Feb 14, 2020 · 2 comments · Fixed by #5962

Comments

@lutzvdb
Copy link

lutzvdb commented Feb 14, 2020

In our random forest models (using the R package "ranger") we frequently use weighted sampling of features (argument "split.select.weights", compare official documentation here ). I would like to apply the same logic to xgboost trees, as in my usecase I frequently know which variables play a more important role for a certain forecast period (we do time series analysis). I already utilize feature sampling using colsample_bytree and colsample_bynode, so I would greatly appreciate being able to alter the probabilities for column sampling.

@hcho3
Copy link
Collaborator

hcho3 commented Feb 15, 2020

Related: #3754, #4230

@trivialfis
Copy link
Member

I will look into this.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

Successfully merging a pull request may close this issue.

3 participants