Add random_state parameter to stacking cv estimators #523

qiagu · 2019-04-29T02:16:01Z

Description

The idea just came to my mind today. Since the latest check_cv from scikit-learn supports random_state, stacking CV estimators can have the parameter now.

Related issues or pull requests

Pull Request Checklist

Added a note about the modification or contribution to the ./docs/sources/CHANGELOG.md file (if applicable)
Added appropriate unit test functions in the ./mlxtend/*/tests directories (if applicable)
Modify documentation in the corresponding Jupyter Notebook under mlxtend/docs/sources/ (if applicable)
Ran nosetests ./mlxtend -sv and make sure that all unit tests pass (for small modifications, it might be sufficient to only run the specific test file, e.g., nosetests ./mlxtend/classifier/tests/test_stacking_cv_classifier.py -sv)
Checked for style issues by running flake8 ./mlxtend

coveralls · 2019-04-29T02:32:13Z

Coverage decreased (-0.001%) to 91.546% when pulling 34ddf02 on qiagu:stacking into c338a1f on rasbt:master.

rasbt · 2019-04-29T05:35:05Z

Great point. I think this is a relatively new feature and I didn't know it would work. While this is certainly great, one little request though,

random_state : int, RandomState instance or None, optional (default: 0)

     Constrols the randomness of the cv splitter. Used when `cv` is
     integer and `shuffle=True`. New in v0.16.0.

Can we change that to None as default setting (while I personally prefer setting random seeds explicitly, I think this is a scikit-learn default to use None.)

rasbt · 2019-04-29T06:20:38Z

mlxtend/regressor/stacking_cv_regression.py

@@ -115,7 +115,7 @@ class StackingCVRegressor(_BaseXComposition, RegressorMixin, TransformerMixin):
    def __init__(self, regressors, meta_regressor, cv=5,
                 shuffle=True, random_state=0, verbose=0,
                 refit=True, use_features_in_secondary=False,
-                 store_train_meta_features=False, n_jobs=1,
+                 store_train_meta_features=False, n_jobs=None,


I actually meant the random_state to be random_state=None, but good that you caught the n_jobs=None thing as well (which is another sklearn convention)

qiagu · 2019-04-29T06:59:18Z

The random_state default was given by not None because the shuffle was True and also it may save typing in many cases. I'm fine with None and don't have strong inclinations. I'll follow your decision. @rasbt

rasbt · 2019-04-29T14:41:53Z

I see. I think shuffle=True would still work with random_state=None -- as far as I know, random_state=None will just use the default random seed. I don't have a strong preference, but for consistency with sklearn, maybe random_state=None is the slightly better choice.

qiagu · 2019-04-29T17:14:39Z

I agree. Updated.

rasbt · 2019-04-29T18:20:48Z

That's great, happy to merge this. Thanks a lot!

qiagu added 3 commits April 28, 2019 18:32

add random_state parameter to stacking cv estimators

042fddb

update changelog and jupyter docs

7fe58b3

update changelog again

703677c

minor change

20b1e8a

rasbt reviewed Apr 29, 2019

View reviewed changes

default stacking cv random_state to None

34ddf02

rasbt merged commit ec2658c into rasbt:master Apr 29, 2019

qiagu deleted the stacking branch November 24, 2019 23:17

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add random_state parameter to stacking cv estimators #523

Add random_state parameter to stacking cv estimators #523

qiagu commented Apr 29, 2019 •

edited

Loading

coveralls commented Apr 29, 2019 •

edited

Loading

rasbt commented Apr 29, 2019

rasbt Apr 29, 2019

qiagu commented Apr 29, 2019 •

edited

Loading

rasbt commented Apr 29, 2019

qiagu commented Apr 29, 2019

rasbt commented Apr 29, 2019

Add random_state parameter to stacking cv estimators #523

Add random_state parameter to stacking cv estimators #523

Conversation

qiagu commented Apr 29, 2019 • edited Loading

Description

Related issues or pull requests

Pull Request Checklist

coveralls commented Apr 29, 2019 • edited Loading

rasbt commented Apr 29, 2019

rasbt Apr 29, 2019

Choose a reason for hiding this comment

qiagu commented Apr 29, 2019 • edited Loading

rasbt commented Apr 29, 2019

qiagu commented Apr 29, 2019

rasbt commented Apr 29, 2019

qiagu commented Apr 29, 2019 •

edited

Loading

coveralls commented Apr 29, 2019 •

edited

Loading

qiagu commented Apr 29, 2019 •

edited

Loading