Warm start for ensembles #90

inventormc · 2020-09-05T04:14:12Z

Add functionality to warm start ensembles estimators in sklearn. This relies on incrementing n_estimators one at a time to get the effect of fitting one estimator at a time in the ensemble. This unfortunately means the user can't cross validate the n_estimators parameter if they choose to early stop.

inventormc · 2020-09-05T04:14:23Z

Still need to add some tests and documentation

richardliaw · 2020-09-05T04:53:52Z

tune_sklearn/utils.py

+def check_warm_start_ensemble(estimator):
+    from sklearn.ensemble import BaseEnsemble
+    is_ensemble_subclass = issubclass(type(estimator), BaseEnsemble)
+


should you update the above? check_warm_start can probably return True if is ensemble now

I chose to separate it because warm start on non-ensemble estimators means we set max_iter=1, but warm start on ensemble estimators means we set n_estimators=1. I thought it'd be easier to handle these different cases with two separate checks.

can you then rename the above to can_warm_start_iter?

richardliaw · 2020-09-06T07:20:16Z

tune_sklearn/_trainable.py

+                    updated_n_estimators = self.estimator_list[i].get_params(
+                    )["n_estimators"] + 1
+                    self.estimator_list[i].set_params(
+                        **{"n_estimators": updated_n_estimators})
+                    self.estimator_list[i].fit(X_train, y_train)


nit: rename estimator = self.estimator_list[i]

richardliaw · 2020-09-06T07:21:15Z

tune_sklearn/_trainable.py

@@ -92,6 +97,10 @@ def _setup(self, config):
                self.estimator_config["warm_start"] = True
                self.estimator_config["max_iter"] = 1

+            if not self._can_partial_fit() and self._can_warm_start_ensemble():
+                self.estimator_config["warm_start"] = True
+                self.estimator_config["n_estimators"] = 0


Add a comment about implementation?

richardliaw · 2020-09-06T07:22:59Z

tune_sklearn/tune_basesearch.py

@@ -405,6 +405,10 @@ def _fit(self, X, y=None, groups=None, **fit_params):
                mode="max",
                scope="last")
            self.best_params = self._clean_config_dict(best_config)
+            if not check_partial_fit(


BTW, I think we should actually log something here. I think this behavior is somewhat of a "surprise", so being upfront that this methodology is being leveraged would be nice to mention.

tune_sklearn/tune_basesearch.py

…ensemble

Yard1 · 2020-09-12T17:42:21Z

Wouldn't it be a good idea to explicitly raise an exception if the user tries to tune n_estimators with early stopping? I worry that warnings will be unnoticed.

richardliaw · 2020-09-12T17:53:24Z

@Yard1 Yes, that's a good idea. Let me make a quick issue.

warm start for ensembles

bec8763

inventormc added the wip Work in progress label Sep 5, 2020

richardliaw reviewed Sep 5, 2020

View reviewed changes

inventormc added 4 commits September 5, 2020 17:19

fix warm start detection test

27d5ee3

fix test

fbf9b33

voting classifier

43808aa

fix tests

56dc1f9

richardliaw reviewed Sep 6, 2020

View reviewed changes

apply suggestions

ab9cfc7

inventormc requested a review from richardliaw September 8, 2020 01:44

richardliaw added 2 commits September 7, 2020 20:46

fix

8864a9a

lint

76bc803

richardliaw reviewed Sep 8, 2020

View reviewed changes

tune_sklearn/tune_basesearch.py Outdated Show resolved Hide resolved

Update tune_sklearn/tune_basesearch.py

c700206

richardliaw approved these changes Sep 8, 2020

View reviewed changes

richardliaw added 4 commits September 8, 2020 00:35

small

a212b88

Merge branch 'ensemble' of github.com:inventormc/tune-sklearn-1 into …

44dfeae

…ensemble

fix

dbe9263

fix

327151b

richardliaw merged commit f5e7366 into ray-project:master Sep 8, 2020

richardliaw mentioned this pull request Sep 12, 2020

Explicitly raise an exception if the user tries to tune n_estimators with early stopping #101

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Warm start for ensembles #90

Warm start for ensembles #90

inventormc commented Sep 5, 2020

inventormc commented Sep 5, 2020

richardliaw Sep 5, 2020

inventormc Sep 5, 2020

richardliaw Sep 6, 2020

richardliaw Sep 6, 2020

richardliaw Sep 6, 2020

richardliaw Sep 6, 2020

Yard1 commented Sep 12, 2020

richardliaw commented Sep 12, 2020

Warm start for ensembles #90

Warm start for ensembles #90

Conversation

inventormc commented Sep 5, 2020

inventormc commented Sep 5, 2020

richardliaw Sep 5, 2020

Choose a reason for hiding this comment

inventormc Sep 5, 2020

Choose a reason for hiding this comment

richardliaw Sep 6, 2020

Choose a reason for hiding this comment

richardliaw Sep 6, 2020

Choose a reason for hiding this comment

richardliaw Sep 6, 2020

Choose a reason for hiding this comment

richardliaw Sep 6, 2020

Choose a reason for hiding this comment

Yard1 commented Sep 12, 2020

richardliaw commented Sep 12, 2020