WIP: extend Pipeline transform and inverse_transform behavior #2561

schwarty · 2013-10-30T13:24:19Z

Pipeline transform and inverse_transform skip last step if it lacks corresponding methods.

The extension is useful to:

extract the features before the last step of the pipeline for debugging purposes
inverse_transform the coef_ attribute of a linear classifier for example

Design discussion here: Issue #2562

…orresponding methods

coveralls · 2013-10-30T13:37:16Z

Coverage remained the same when pulling d9260f0 on schwarty:pipeline_inverse_transform into 03926cc on scikit-learn:master.

ogrisel · 2013-10-30T14:28:06Z

sklearn/pipeline.py

+        # test the last step implements an inverse_transform method
+        inverse_steps = self.steps[::-1]
+        if not hasattr(self.steps[-1][-1], 'inverse_transform'):
+            inverse_steps = self.steps[:-1][::-1]


Could you please edit the docstring of the inverse_transform method to explain that it works even if the last step does not support the inverse_transform method and give motivation for this feature as done for the forward case?

ogrisel · 2013-10-30T14:28:56Z

Looks good to me. @schwarty please add a doc/whats_new.rst entry for this.

schwarty · 2013-10-31T14:24:05Z

Done. @ogrisel and @GaelVaroquaux what do you think?

coveralls · 2013-10-31T14:38:18Z

Coverage remained the same when pulling 1906989 on schwarty:pipeline_inverse_transform into 03926cc on scikit-learn:master.

ogrisel · 2013-10-31T14:49:17Z

doc/whats_new.rst

@@ -60,6 +60,10 @@ Changelog
   - Added :class:`linear_model.RANSACRegressor` meta-estimator for the robust
     fitting of regression models. By Johannes Schönberger.

+   - Extended transform and inverse_transform methods from


Please add double back-ticks around code words like transform and inverse_transform.

ogrisel · 2013-10-31T14:51:34Z

Apart from the nitpick comment, LGTM. +1 for merge.

GaelVaroquaux · 2013-10-31T15:29:12Z

As I just told @schwarty orally, I am not in favor of merging this PR, although I had backed the idea during our first discussions. I don't like the automatic change of behavior depending on subtle properties of the final learner. For instance changing the code to use an LDA instead of a LogisticRegression would have really strange consequences with this PR.

ogrisel · 2013-10-31T15:36:51Z

Do you have an alternative API to propose? Maybe a new constructor param?

I don't see your point with LDA. How this PR would change the behavior in this case? It is my understanding that only cases that would have previously raised an exception are now supported.

GaelVaroquaux · 2013-10-31T15:48:34Z

I don't see your point with LDA? How this PR would change the behavior in this
case? It is my understanding that only cases that would have previously raised
an exception are now supported.

Yes: with a LogisticRegression it would have raised an error, but not
with LDA. Now both work, but do something different. The difference is
subtle and can easily fool someone.

ogrisel · 2013-10-31T15:52:11Z

Hum ok I see your point. Still I think @schwarty's use cases are very valid and are not addressed by the current implementation. Maybe @schwarty you should put this PR back to WIP and raise the point on the mailing list to brain storm alternative design ideas.

GaelVaroquaux · 2013-10-31T15:53:11Z

Still I think @schwarty's use cases are very
valid and are not addressed by the current implementation.

I agree.

Maybe @schwarty you should put this PR back to WIP and raise the point
on the mailing list to brain storm alternative design ideas.

He has opened an issue.

jnothman · 2013-11-01T01:51:59Z

I am not in favor of merging this PR, although I had backed the idea during our first discussions. I don't like the automatic change of behavior depending on subtle properties of the final learner.

I agree with @GaelVaroquaux. It's a case of implicit behaviour that could bite you. What I wouldn't mind (though perhaps it's overkill and will create API problems) is for Pipeline to support slicing. I also think it deserves a getter for the final estimator (.steps[-1][1]) just to increase readability:

p = Pipeline([('sel', SelectKBest()), ('clf', LogisticRegression())])
p[:-1].inverse_transform(p.final_estimator.coef_)

jnothman · 2013-11-01T01:53:25Z

Or, just as slicing a list with an integer returns an element and with a slice returns a list, you could have integer slicing return just the estimator:

p[:-1].inverse_transform(p[-1].coef_)

GaelVaroquaux · 2013-12-07T17:52:00Z

I think that there is a consensus against merging this PR. @schwarty : is it OK with you if I close it.

amueller · 2016-10-25T20:40:43Z

closing as @schwarty didn't object within the last 3 years

GaelVaroquaux · 2016-10-25T21:56:20Z

closing as @schwarty didn't object within the last 3 years

@schwarty defended is PhD and has disappeared in the industry, making more money than me. He is still available to drink beers, though.

jnothman · 2016-10-25T23:48:18Z

I suppose it's good when people making more money still drink beers with you. :)

Pipeline transform and inverse_transform skip last step if it lacks c…

d9260f0

…orresponding methods

ogrisel reviewed Oct 30, 2013
View reviewed changes

Yannick Schwartz added 2 commits October 31, 2013 15:16

Add docstring for new pipeline inverse_transform

04b0144

Updated the whats_new

1906989

ogrisel reviewed Oct 31, 2013
View reviewed changes

schwarty mentioned this pull request Oct 31, 2013

Adding get_estimated method to pipelines #2562

Closed

jnothman mentioned this pull request Nov 2, 2013

[MRG+1] Pipeline can now be sliced or indexed #2568

Merged

jnothman mentioned this pull request Dec 2, 2013

Make Pipeline compatible with AdaBoost #2630

Closed

larsmans force-pushed the master branch from 58a55ad to 4b82379 Compare August 25, 2014 21:50

MechCoder force-pushed the master branch from 6deaea0 to 3f49cee Compare November 3, 2014 12:36

amueller closed this Oct 25, 2016

This was referenced Feb 23, 2017

[WIP] ENH allow extraction of subsequence pipeline #8431

Closed

Pipeline pop #8448

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

WIP: extend Pipeline transform and inverse_transform behavior #2561

WIP: extend Pipeline transform and inverse_transform behavior #2561

schwarty commented Oct 30, 2013

coveralls commented Oct 30, 2013

ogrisel Oct 30, 2013

ogrisel commented Oct 30, 2013

schwarty commented Oct 31, 2013

coveralls commented Oct 31, 2013

ogrisel Oct 31, 2013

ogrisel commented Oct 31, 2013

GaelVaroquaux commented Oct 31, 2013

ogrisel commented Oct 31, 2013

GaelVaroquaux commented Oct 31, 2013

ogrisel commented Oct 31, 2013

GaelVaroquaux commented Oct 31, 2013

jnothman commented Nov 1, 2013

jnothman commented Nov 1, 2013

GaelVaroquaux commented Dec 7, 2013

amueller commented Oct 25, 2016

GaelVaroquaux commented Oct 25, 2016 via email

jnothman commented Oct 25, 2016

WIP: extend Pipeline transform and inverse_transform behavior #2561

WIP: extend Pipeline transform and inverse_transform behavior #2561

Conversation

schwarty commented Oct 30, 2013

coveralls commented Oct 30, 2013

ogrisel Oct 30, 2013

Choose a reason for hiding this comment

ogrisel commented Oct 30, 2013

schwarty commented Oct 31, 2013

coveralls commented Oct 31, 2013

ogrisel Oct 31, 2013

Choose a reason for hiding this comment

ogrisel commented Oct 31, 2013

GaelVaroquaux commented Oct 31, 2013

ogrisel commented Oct 31, 2013

GaelVaroquaux commented Oct 31, 2013

ogrisel commented Oct 31, 2013

GaelVaroquaux commented Oct 31, 2013

jnothman commented Nov 1, 2013

jnothman commented Nov 1, 2013

GaelVaroquaux commented Dec 7, 2013

amueller commented Oct 25, 2016

GaelVaroquaux commented Oct 25, 2016 via email

jnothman commented Oct 25, 2016