New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Speed up catboost fit: change catboost default and automl parameter ranges #998
Conversation
Codecov Report
@@ Coverage Diff @@
## main #998 +/- ##
=======================================
Coverage 99.86% 99.86%
=======================================
Files 179 179
Lines 9424 9424
=======================================
Hits 9411 9411
Misses 13 13
Continue to review full report at Codecov.
|
"eta": Real(0.000001, 1), | ||
"max_depth": Integer(1, 16), | ||
"max_depth": Integer(4, 10), |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Interesting to note we already had the regressor set with this range, but not the classifier.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
LGTM. Its going to be fun with more experimentation later on!
I was just testing #337 with the titanic dataset. I noticed that on |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Really like the analysis w the graphs and all of the data sets-excited to see the perf testing being used to change parameters to speed up evalml!
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
@dsherry Very cool analysis!
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
LGTM!
Fix #979
Two changes to speed up catboost fit times:
n_estimators
n_estimators
andmax_depth
Performance results and discussion here.
Hopefully we can do more tuning after this and open these ranges up some more.
We have other features on the roadmap which will help with this too.