[MRG] FIX allow non-finite target values in TransformedTargetRegressor #11349

vahidbas · 2018-06-24T20:07:08Z

Reference Issues/PRs

Fixes #11339
simply changed force_all_finite to 'False'

Update:
turn off all finiteness checks

What does this implement/fix? Explain your changes.

Allow target values to have missing values in TransformedTargetRegressor

jnothman · 2018-06-24T21:38:54Z

CI failing. Let me know if you need help with it

jnothman · 2018-06-24T21:38:13Z

sklearn/compose/_target.py

@@ -162,7 +162,7 @@ def fit(self, X, y, sample_weight=None):
        -------
        self : object
        """
-        y = check_array(y, accept_sparse=False, force_all_finite=True,
+        y = check_array(y, accept_sparse=False, force_all_finite='allow-nan',


I'd consider going further to switch off all finiteness validation. Are there cases where it would be risky to pass an inf through unchecked?

I guess it would be safe to leave both INF and NAN checks to the actual transformer. I'll give it a go, let's check if it passes the tests.

glemaitre · 2018-06-25T15:24:17Z

sklearn/compose/tests/test_target.py

+    )
+
+    estimator.fit(X, y)
+    estimator.predict(X)


You should check that your output contains NaN where it should.

estimator.predict(X) is actually unnecessary. The test only to assert that estimator.fit(X, y) doesn't raise. In this case, predict is expected to return no NaN. should I make it explicit or remove predict?

glemaitre · 2018-06-25T15:24:30Z

sklearn/compose/tests/test_target.py

+
+    X, y = datasets.load_linnerud(return_X_y=True)
+
+    # put some NaN in y


remove this comment

jnothman · 2018-06-25T21:46:14Z

It's worth checking predict also. Ensuring the output does not contain NaN doesn't hurt, but I don't think it is necessary

jnothman · 2018-06-25T23:33:01Z

Travis is reporting flake8 errors.

jnothman

This LGTM, thanks!

I'd be interested if you have other comments/critiques of TransformedTargetRegressor design before we release it.

vahidbas · 2018-06-26T09:15:32Z

@jnothman Thanks for help! This will resolve my issue for now. My next step is to evaluate it with model_selection.* tools for joint hyperparameter tuning, I'll post any new finding.

jnothman · 2019-02-11T07:30:17Z

@glemaitre please review changes?

jnothman · 2019-02-11T07:31:02Z

@vahidbas
Please add an entry to the change log at doc/whats_new/v0.21.rst. Like the other entries there, please reference this pull request with :issue: and credit yourself (and other contributors if applicable) with :user:

glemaitre · 2019-02-11T10:18:06Z

sklearn/compose/_target.py

@@ -162,7 +162,7 @@ def fit(self, X, y, sample_weight=None):
        -------
        self : object
        """
-        y = check_array(y, accept_sparse=False, force_all_finite=True,
+        y = check_array(y, accept_sparse=False, force_all_finite=False,


It should not be 'allow-nan' instead?

@glemaitre all checks are turned off as suggested by @jnothman here

OK fine with me then.

glemaitre · 2019-02-11T10:29:50Z

I am also wondering if we should avoid the validate=True there:

scikit-learn/sklearn/compose/_target.py

Lines 132 to 134 in af842d3

    
           self.transformer_ = FunctionTransformer( 
        
               func=self.func, inverse_func=self.inverse_func, validate=True, 
        
               check_inverse=self.check_inverse)

As mentioned by @shreyasramachandran, if you pass a func and inverse_func, you will create such a transformer which will not accept the NaN by default.

@jnothman Which behaviour do you think is the best by default?

jnothman

I'm now a bit confused about how this is working with that validate=True...?

glemaitre · 2019-02-11T12:15:28Z

I'm now a bit confused about how this is working with that validate=True...?

validate=True performs

check_array(X, accept_sparse=self.accept_sparse)

So it does not let pass the Nan

jnothman · 2019-02-11T22:17:12Z

So if validate=True doesn't pass the nan, why does it help for us to change force_all_finite here?

jnothman · 2019-02-11T22:17:58Z

It's because we're only handling the case here where a transformer, rather than a function, is provided. Yes, I think we should handle both cases

jnothman

So this currently works for the case of passing in a transformer, but not when passing in func and inverse_func?

vahidbas added 3 commits June 24, 2018 21:36

[ADD] test to check allow-nan

fea63ff

[FIX] allow missing values in the target

2aede40

[FIX] fit also works multi output, fix doc

76bb7da

jnothman reviewed Jun 24, 2018

View reviewed changes

glemaitre requested changes Jun 25, 2018

View reviewed changes

vahidbas added 3 commits June 25, 2018 22:42

[FIX] skip check_supervised_y_no_nan for TransfomTargetRegressor

9a7a199

remove comment

491df30

[FIX] turn off all finiteness validation

3e99ac3

[ENH] truncate line, resolving flake8 issue

0b3b6e0

jnothman approved these changes Jun 26, 2018

View reviewed changes

glemaitre changed the title ~~[MRG] FIX allow NaNs for the target values in TransformedTargetRegressor #11339~~ [MRG] FIX allow NaNs for the target values in TransformedTargetRegressor Jun 26, 2018

glemaitre changed the title ~~[MRG] FIX allow NaNs for the target values in TransformedTargetRegressor~~ [MRG] FIX allow non-finite target values in TransformedTargetRegressor Jun 26, 2018

jeremiedbb mentioned this pull request Aug 10, 2018

Allow NaN in y of TransformedTargetRegressor #11794

Closed

jnothman mentioned this pull request Feb 11, 2019

Allow NaNs for the target values in TransformedTargetRegressor #11339

Open

glemaitre reviewed Feb 11, 2019

View reviewed changes

jnothman reviewed Feb 11, 2019

View reviewed changes

amueller added the Needs work label Aug 6, 2019

github-actions bot added module:compose module:utils labels Mar 2, 2020

jnothman reviewed Aug 25, 2020

View reviewed changes

cmarmo added help wanted Stalled labels Aug 26, 2020

Base automatically changed from master to main January 22, 2021 10:50

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[MRG] FIX allow non-finite target values in TransformedTargetRegressor #11349

[MRG] FIX allow non-finite target values in TransformedTargetRegressor #11349

vahidbas commented Jun 24, 2018 •

edited by glemaitre

jnothman commented Jun 24, 2018

jnothman Jun 24, 2018

vahidbas Jun 25, 2018

glemaitre Jun 25, 2018

vahidbas Jun 25, 2018

glemaitre Jun 25, 2018

vahidbas Jun 25, 2018

jnothman commented Jun 25, 2018

jnothman commented Jun 25, 2018 via email

jnothman left a comment

vahidbas commented Jun 26, 2018

jnothman commented Feb 11, 2019

jnothman commented Feb 11, 2019

glemaitre Feb 11, 2019

vahidbas Feb 11, 2019

glemaitre Feb 11, 2019

glemaitre commented Feb 11, 2019

jnothman left a comment

glemaitre commented Feb 11, 2019

jnothman commented Feb 11, 2019 via email

jnothman commented Feb 11, 2019

jnothman left a comment


		X, y = datasets.load_linnerud(return_X_y=True)

		# put some NaN in y

[MRG] FIX allow non-finite target values in TransformedTargetRegressor #11349

Are you sure you want to change the base?

[MRG] FIX allow non-finite target values in TransformedTargetRegressor #11349

Conversation

vahidbas commented Jun 24, 2018 • edited by glemaitre

Reference Issues/PRs

What does this implement/fix? Explain your changes.

jnothman commented Jun 24, 2018

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

jnothman commented Jun 25, 2018

jnothman commented Jun 25, 2018 via email

jnothman left a comment

Choose a reason for hiding this comment

vahidbas commented Jun 26, 2018

jnothman commented Feb 11, 2019

jnothman commented Feb 11, 2019

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

glemaitre commented Feb 11, 2019

jnothman left a comment

Choose a reason for hiding this comment

glemaitre commented Feb 11, 2019

jnothman commented Feb 11, 2019 via email

jnothman commented Feb 11, 2019

jnothman left a comment

Choose a reason for hiding this comment

vahidbas commented Jun 24, 2018 •

edited by glemaitre