[MRG] accelerate stacking-classifier in test_common.py::test_ensemble_heterogeneous_estimators_behavior #21562

chritter · 2021-11-05T12:23:47Z

Reference Issues/PRs

Towards #21407

What does this implement/fix? Explain your changes.

These changes are accelerating test case test_common.py::test_ensemble_heterogeneous_estimators_behavior

Any other comments?

#DataUmbrella Sprint

ogrisel · 2021-11-05T14:29:58Z

sklearn/ensemble/tests/test_common.py

@@ -32,7 +32,8 @@
                    ("lr", LogisticRegression()),
                    ("svm", LinearSVC()),
                    ("rf", RandomForestClassifier()),


Thanks! I think it would be possible to make it run even faster with:

Suggested change

("rf", RandomForestClassifier()),

("rf", RandomForestClassifier(n_estimators=5, max_depth=3)),

Could you also update the related parametrized config for the same test function below?

Please also report the timings you get when running this test on your local machine with --durations=10.

@ogrisel Thank you very much for your suggestions. Note that this PR is still WIP and I plan to finish the optimization with your advice and the appropriate measurements reported.

chritter · 2021-11-08T14:04:00Z

Speed improvements reported as test duration 5-times averaged:

Setup	Test Duration
Original	2.52s
with cv=2	1.98s
with cv=2 + Randomforestclassifier(n_estimators=5, max_depth=3)	0.24s

Note: With final improvement LR fit: 27ms, RF fit: 17ms, , SVM fit: 1ms.

chritter · 2021-11-08T14:21:23Z

Based on @ogrisel suggestion I have committed the speed optimization of RandomForestRegressor(n_estimators=5, max_depth=3) for voting-classifier, stacking-regressor and voting-regressor. The following speed-ups are achieved (5-times measurements). To speed-up stacking-regressor I have also used cv=2.

Algo	Setup	Test Duration
Voting Classsifer	Original	0.60s
Voting Classsifer	Optimized	0.05s
StackingRegressor	Original	0.41s
StackingRegressor	Optimized	0.10s
VotingRegressor	Original	0.46s
VotingRegressor	Optimized	0.04s

…nto test-speed-improvement-stacking-classifier

thomasjpfan

Thank you for the PR @chritter !

LGTM!

ogrisel

Thanks very much! LGTM.

…e_heterogeneous_estimators_behavior (scikit-learn#21562)

…e_heterogeneous_estimators_behavior (#21562)

accelerated test ensemble stacking-classifier cv

a0efe21

github-actions bot added the module:ensemble label Nov 5, 2021

chritter changed the title ~~accelerated test ensemble stacking-classifier cv~~ [WIP] accelerate stacking-classifier in test_common.py::test_ensemble_heterogeneous_estimators_behavior Nov 5, 2021

ogrisel added the No Changelog Needed label Nov 5, 2021

ogrisel reviewed Nov 5, 2021

View reviewed changes

ogrisel mentioned this pull request Nov 5, 2021

Meta-issue: accelerate the slowest running tests #21407

Closed

24 tasks

accelerate test ensemble stacking-classifier rf

f33eb87

accelerate test ensemble voting-class, regressors

8c3592d

chritter added 2 commits November 8, 2021 21:31

test-speed-improvement StackingRegressor

ad63b77

Merge branch 'main' of https://github.com/scikit-learn/scikit-learn i…

f94478c

…nto test-speed-improvement-stacking-classifier

chritter changed the title ~~[WIP] accelerate stacking-classifier in test_common.py::test_ensemble_heterogeneous_estimators_behavior~~ [MRG] accelerate stacking-classifier in test_common.py::test_ensemble_heterogeneous_estimators_behavior Nov 9, 2021

chritter added 2 commits November 9, 2021 06:18

Merge branch 'main' of https://github.com/scikit-learn/scikit-learn i…

a1231a4

…nto test-speed-improvement-stacking-classifier

Merge branch 'main' of https://github.com/scikit-learn/scikit-learn i…

5856237

…nto test-speed-improvement-stacking-classifier

reshamas added the Sprint label Nov 16, 2021

thomasjpfan approved these changes Nov 24, 2021

View reviewed changes

ogrisel approved these changes Dec 6, 2021

View reviewed changes

ogrisel merged commit 5f3d1e5 into scikit-learn:main Dec 6, 2021

thomasjpfan pushed a commit to thomasjpfan/scikit-learn that referenced this pull request Dec 9, 2021

[MRG] accelerate stacking-classifier in test_common.py::test_ensembl…

3cd6d86

…e_heterogeneous_estimators_behavior (scikit-learn#21562)

glemaitre pushed a commit to glemaitre/scikit-learn that referenced this pull request Dec 24, 2021

[MRG] accelerate stacking-classifier in test_common.py::test_ensembl…

c7527ad

…e_heterogeneous_estimators_behavior (scikit-learn#21562)

glemaitre pushed a commit that referenced this pull request Dec 25, 2021

[MRG] accelerate stacking-classifier in test_common.py::test_ensembl…

fef1087

…e_heterogeneous_estimators_behavior (#21562)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[MRG] accelerate stacking-classifier in test_common.py::test_ensemble_heterogeneous_estimators_behavior #21562

[MRG] accelerate stacking-classifier in test_common.py::test_ensemble_heterogeneous_estimators_behavior #21562

chritter commented Nov 5, 2021 •

edited

Loading

ogrisel Nov 5, 2021 •

edited

Loading

chritter Nov 8, 2021 •

edited

Loading

chritter commented Nov 8, 2021 •

edited

Loading

chritter commented Nov 8, 2021 •

edited

Loading

thomasjpfan left a comment

ogrisel left a comment

	("rf", RandomForestClassifier()),
	("rf", RandomForestClassifier(n_estimators=5, max_depth=3)),

[MRG] accelerate stacking-classifier in test_common.py::test_ensemble_heterogeneous_estimators_behavior #21562

[MRG] accelerate stacking-classifier in test_common.py::test_ensemble_heterogeneous_estimators_behavior #21562

Conversation

chritter commented Nov 5, 2021 • edited Loading

Reference Issues/PRs

What does this implement/fix? Explain your changes.

Any other comments?

ogrisel Nov 5, 2021 • edited Loading

Choose a reason for hiding this comment

chritter Nov 8, 2021 • edited Loading

Choose a reason for hiding this comment

chritter commented Nov 8, 2021 • edited Loading

chritter commented Nov 8, 2021 • edited Loading

thomasjpfan left a comment

Choose a reason for hiding this comment

ogrisel left a comment

Choose a reason for hiding this comment

chritter commented Nov 5, 2021 •

edited

Loading

ogrisel Nov 5, 2021 •

edited

Loading

chritter Nov 8, 2021 •

edited

Loading

chritter commented Nov 8, 2021 •

edited

Loading

chritter commented Nov 8, 2021 •

edited

Loading