[MRG] add support for lists of dictionaries to RandomizedSearchCV #14549

amueller · 2019-08-01T20:37:14Z

Follow up on #12759 with a slightly simplified interface.
This makes the API of RandomizedSearchCV a superset of GridSearchCV which makes it more convenient to use.

jnothman · 2019-08-02T06:46:07Z

This makes the API of RandomizedSearchCV a superset of GridSearchCV which makes it more convenient to use.

Awesome!

thomasjpfan

Add whats new?

thomasjpfan · 2019-08-02T15:17:26Z

sklearn/model_selection/_search.py

        Dictionary with parameters names (string) as keys and distributions
        or lists of parameters to try. Distributions must provide a ``rvs``
        method for sampling (such as those from scipy.stats.distributions).
        If a list is given, it is sampled uniformly.
+        If a list of dicts is given, for each parameter, one of the dicts


This is slightly unclear. It looks like, first dicts are sampled uniformly, then the parameters are sampled based on that dict.

sklearn/model_selection/_search.py

jnothman

Otherwise lgtm

jnothman · 2019-08-03T21:48:53Z

sklearn/model_selection/_search.py

            for _ in range(self.n_iter):
+                dist = self.param_distributions[
+                    rnd.randint(len(self.param_distributions))]


This is an awkwardly numpy way of expressing random.choose

amueller · 2019-08-05T15:26:34Z

sklearn/model_selection/_search.py

+        all_lists = all(
+            all(not hasattr(v, "rvs") for v in dist.values())
+            for dist in self.param_distributions)
+        rng = check_random_state(self.random_state)


renamed this to be more consistent with the rest of the library

NicolasHug

Super nitpic feel free to merge without addressing

NicolasHug · 2019-08-05T15:35:47Z

doc/whats_new/v0.22.rst

@@ -210,6 +210,9 @@ Changelog
  plot model scalability (see learning_curve example).
  :pr:`13938` by :user:`Hadrien Reboul <H4dr1en>`.

+- |Enhancement| :class:`model_selection.RandomizedSearchCV` now accepts lists
+  of parameter distributions. :pr:`14549` by `Andreas Müller`_.


maybe

lists of dicts to sample from multiple parameter spaces

?

I'm unconvinced ;)

NicolasHug · 2019-08-05T15:40:53Z

sklearn/model_selection/_search.py

+            for key in dist:
+                if (not isinstance(dist[key], Iterable)
+                        and not hasattr(dist[key], 'rvs')):
+                    raise TypeError('Parameter value is not iterable '


... must be an iterable or a distribution?

this is copy & pasted from ParameterGrid. Not sure if your version is any clearer and I think being semi-consistent between the two is good.

amueller · 2019-08-07T15:56:43Z

OH YEAH!

add support for lists of dictionaries to RandomizedSearchCV

b2be08a

thomasjpfan reviewed Aug 2, 2019

View reviewed changes

amueller added 2 commits August 2, 2019 14:30

more input validation, more tests

dfb31c6

pep8

7a6c3df

jnothman approved these changes Aug 3, 2019

View reviewed changes

use random.choice

6a33900

amueller commented Aug 5, 2019

View reviewed changes

add whatsnew

d3d2d0b

NicolasHug approved these changes Aug 5, 2019

View reviewed changes

jnothman merged commit 98e1c0f into scikit-learn:master Aug 6, 2019

amueller mentioned this pull request Aug 7, 2019

[MRG] adds list of dictionaries to RandomizedSearchCV #12759

Closed

thomasjpfan mentioned this pull request Aug 14, 2019

RandomizedSearchCV does not work in a Pipeline when evaluating alternative classifiers #14661

Closed

thomasjpfan mentioned this pull request Oct 26, 2019

Use dict or list of dictionaries in RandomizedSearchCV #12728

Closed

mfeurer mentioned this pull request Nov 8, 2019

Simpler interface for Random Search over MLPClassifier number of layer and their sizes #15568

Closed

justmarkham mentioned this pull request Aug 1, 2020

Unexpected behavior when passing multiple parameter sets to RandomizedSearchCV #18057

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[MRG] add support for lists of dictionaries to RandomizedSearchCV #14549

[MRG] add support for lists of dictionaries to RandomizedSearchCV #14549

amueller commented Aug 1, 2019

jnothman commented Aug 2, 2019

thomasjpfan left a comment

thomasjpfan Aug 2, 2019

jnothman left a comment

jnothman Aug 3, 2019

amueller Aug 5, 2019

amueller Aug 5, 2019

NicolasHug left a comment

NicolasHug Aug 5, 2019

amueller Aug 6, 2019

NicolasHug Aug 5, 2019

amueller Aug 6, 2019

amueller commented Aug 7, 2019

[MRG] add support for lists of dictionaries to RandomizedSearchCV #14549

[MRG] add support for lists of dictionaries to RandomizedSearchCV #14549

Conversation

amueller commented Aug 1, 2019

jnothman commented Aug 2, 2019

thomasjpfan left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

jnothman left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

NicolasHug left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

amueller commented Aug 7, 2019