-
-
Notifications
You must be signed in to change notification settings - Fork 8.7k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
xgboost very slow for classification with many classes #2926
Comments
XGBoost uses regression trees, so build 120 (=num_class) trees per iteration. That issue will be helpfull. |
Does this mean xgboost will always be slow for multi class classification? You accidentally linked back to this issue in your comment (2926). Did you mean to link to another one? |
@lesshaste Sorry, I fix the link. |
There is an inefficiently in prediction caching that results in nclass^2 complexity #1689 (comment) |
That would be awesome! Thank you. |
This recent paper might be another potential solution to increase multiclass speed. |
Environment info
Ubuntu
Compiler:
gcc
Package used (python/R/jvm/C++):
python
xgboost
version used:0.6
If installing from source, please provide
git rev-parse HEAD
)git rev-parse HEAD
3dcf966
If you are using python package, please provide
Python 3.5.2
xgboost
if you are not installing from sourceInstalled from source
Steps to reproduce
Run the following self contained code which also creates fake data for classification with 120 classes:
What have you tried?
The RandomForestClassifier takes about 15 seconds but xgboost never terminates at all for me even after 10 minutes.
The text was updated successfully, but these errors were encountered: