[MRG+1] Issue #8173 - Passing n_neighbors to compute MI #8181

glemaitre · 2017-01-10T17:00:17Z

Reference Issue

What does this implement/fix? Explain your changes.

The parameters n_neighbors is passed to the function compute_mi in _estimate_mi.
Previously this parameter was not given to compute_mi. The user could not set this
parameter as indicated in the documentation.

Any other comments?

A single test has been added in a classification case.
The computation of the mutual information was in fact correct since it was already tested
for different neighbours

glemaitre · 2017-01-10T17:03:42Z

@jnothman Do you see additional unit test(s) or this is enough.

jnothman

Just to be sure, can you please remove the default n_neighbors from _compute_mi if this harms nothing?

jnothman · 2017-01-11T02:00:54Z

sklearn/feature_selection/tests/test_mutual_info.py

@@ -5,7 +5,8 @@
 from scipy.sparse import csr_matrix

 from sklearn.utils.testing import (assert_array_equal, assert_almost_equal,
-                                   assert_false, assert_raises, assert_equal)
+                                   assert_false, assert_raises, assert_equal,
+                                   assert_array_almost_equal)


I think the preference is for assert_allclose

glemaitre · 2017-01-11T09:57:13Z

Just to be sure, can you please remove the default n_neighbors from _compute_mi if this harms nothing?

I did it and then I had a second thought. _compute_mi is a wrapper to choose either the discrete or continuous version of the MI. There is no need for n_neighbors for the discrete. So it seems odd to required absolutely n_neighbors when calling the function for discrete case. I would assume that this is the reason of having a default value.

ogrisel · 2017-01-17T15:24:31Z

sklearn/feature_selection/tests/test_mutual_info.py

+    assert_allclose(mi, [0.06987399, 0.03197151, 0.21946924], rtol=1e-6)
+    mi_7 = mutual_info_classif(X, y, discrete_features=[2], n_neighbors=7,
+                               random_state=0)
+    assert_allclose(mi_7, [0.0735522, 0.0343685, 0.2194692], rtol=1e-5)


I don't really like tests that hardcode numerical values that are true only for randomly generated but with fixed see data. Is there a better way to test the impact of n_neighbors without hardcoding the values?

jnothman · 2017-01-17T21:48:55Z

I think checking that the MI changes with n_neighbors would be fine?

…

On 18 January 2017 at 02:24, Olivier Grisel ***@***.***> wrote: ***@***.**** commented on this pull request. ------------------------------ In sklearn/feature_selection/tests/test_mutual_info.py <#8181 (review)> : > assert_array_equal(np.argsort(-mi), [2, 0, 1]) + assert_allclose(mi, [0.06987399, 0.03197151, 0.21946924], rtol=1e-6) + mi_7 = mutual_info_classif(X, y, discrete_features=[2], n_neighbors=7, + random_state=0) + assert_allclose(mi_7, [0.0735522, 0.0343685, 0.2194692], rtol=1e-5) I don't really like tests that hardcode numerical values that are true only for randomly generated but with fixed see data. Is there a better way to test the impact of n_neighbors without hardcoding the values? — You are receiving this because you were mentioned. Reply to this email directly, view it on GitHub <#8181 (review)>, or mute the thread <https://github.com/notifications/unsubscribe-auth/AAEz6zKiPLJMV8BChk0v2M8CddRWNG5Kks5rTN0xgaJpZM4Lfpn9> .

jnothman · 2017-01-18T11:09:23Z

sklearn/feature_selection/tests/test_mutual_info.py

+    for n_neighbors in [5, 7, 9]:
+        mi_nn = mutual_info_classif(X, y, discrete_features=[2],
+                                    n_neighbors=n_neighbors, random_state=0)
+        # Check that the continuous values have an higher MI


I think it would help to say "with greater n_neighbors"

jnothman · 2017-01-18T20:53:49Z

LGTM

glemaitre · 2017-01-18T20:56:04Z

@ogrisel for a second review

raghavrv · 2017-01-19T16:36:56Z

LGTM thanks @glemaitre

…kit-learn#8181)

glemaitre changed the title ~~pass n_neighbors in _estimate_mi~~ [MRG] Issue #8173 - Passing n_neighbors to compute MI Jan 10, 2017

jnothman reviewed Jan 11, 2017

View reviewed changes

ogrisel reviewed Jan 17, 2017

View reviewed changes

jnothman reviewed Jan 18, 2017

View reviewed changes

glemaitre force-pushed the is/8173 branch 2 times, most recently from ce35545 to 05d0585 Compare January 18, 2017 18:17

jnothman approved these changes Jan 18, 2017

View reviewed changes

jnothman changed the title ~~[MRG] Issue #8173 - Passing n_neighbors to compute MI~~ [MRG+1] Issue #8173 - Passing n_neighbors to compute MI Jan 18, 2017

FIX Issue scikit-learn#8173 - pass n_neighbors in MI computation

1cf33ec

glemaitre force-pushed the is/8173 branch from 05d0585 to 1cf33ec Compare January 19, 2017 10:03

raghavrv merged commit aaebee1 into scikit-learn:master Jan 19, 2017

sergeyf pushed a commit to sergeyf/scikit-learn that referenced this pull request Feb 28, 2017

FIX Issue scikit-learn#8173 - pass n_neighbors in MI computation (sci…

4b1287e

…kit-learn#8181)

Przemo10 mentioned this pull request Mar 17, 2017

update fork (#1) #8606

Closed

Sundrique pushed a commit to Sundrique/scikit-learn that referenced this pull request Jun 14, 2017

FIX Issue scikit-learn#8173 - pass n_neighbors in MI computation (sci…

f8f60c7

…kit-learn#8181)

NelleV pushed a commit to NelleV/scikit-learn that referenced this pull request Aug 11, 2017

FIX Issue scikit-learn#8173 - pass n_neighbors in MI computation (sci…

e33fe92

…kit-learn#8181)

paulha pushed a commit to paulha/scikit-learn that referenced this pull request Aug 19, 2017

FIX Issue scikit-learn#8173 - pass n_neighbors in MI computation (sci…

a68a3c1

…kit-learn#8181)

maskani-moh pushed a commit to maskani-moh/scikit-learn that referenced this pull request Nov 15, 2017

FIX Issue scikit-learn#8173 - pass n_neighbors in MI computation (sci…

4a392dd

…kit-learn#8181)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

[MRG+1] Issue #8173 - Passing n_neighbors to compute MI #8181

[MRG+1] Issue #8173 - Passing n_neighbors to compute MI #8181

Uh oh!

glemaitre commented Jan 10, 2017

Uh oh!

glemaitre commented Jan 10, 2017

Uh oh!

jnothman left a comment

Uh oh!

jnothman Jan 11, 2017

Uh oh!

glemaitre commented Jan 11, 2017

Uh oh!

ogrisel Jan 17, 2017

Uh oh!

jnothman commented Jan 17, 2017 via email

Uh oh!

jnothman Jan 18, 2017

Uh oh!

jnothman commented Jan 18, 2017

Uh oh!

glemaitre commented Jan 18, 2017

Uh oh!

raghavrv commented Jan 19, 2017

Uh oh!

Uh oh!

Uh oh!

[MRG+1] Issue #8173 - Passing n_neighbors to compute MI #8181

[MRG+1] Issue #8173 - Passing n_neighbors to compute MI #8181

Uh oh!

Conversation

glemaitre commented Jan 10, 2017

Reference Issue

What does this implement/fix? Explain your changes.

Any other comments?

Uh oh!

glemaitre commented Jan 10, 2017

Uh oh!

jnothman left a comment

Choose a reason for hiding this comment

Uh oh!

jnothman Jan 11, 2017

Choose a reason for hiding this comment

Uh oh!

glemaitre commented Jan 11, 2017

Uh oh!

ogrisel Jan 17, 2017

Choose a reason for hiding this comment

Uh oh!

jnothman commented Jan 17, 2017 via email

Uh oh!

jnothman Jan 18, 2017

Choose a reason for hiding this comment

Uh oh!

jnothman commented Jan 18, 2017

Uh oh!

glemaitre commented Jan 18, 2017

Uh oh!

raghavrv commented Jan 19, 2017

Uh oh!

Uh oh!