instantiating a ColumnTransformer with already fitted transformers #13614

sebastiangonsal · 2019-04-10T19:29:05Z

Can I instantiate a ColumnTransformer with already fitted transformers? I get an error right now saying the fit wasn't called on ColumnTransformer. I don't want to call fit on the transformer since I am using already fitted transformers and just want to use ColumnTransformer to aggregate the columns into a matrix.

The text was updated successfully, but these errors were encountered:

jnothman · 2019-04-11T00:29:52Z

No you can't. What use case do you have, seeing as the applicable columns of the input are only identified when fitting the column transformer?

thomasjpfan · 2019-10-26T03:56:13Z

Recently spoke to someone that is running into this issue. He fitted a transformer on unlabeled data and they he created a metaestimator to wrap this transformer in order to freeze it (by overwriting fit to do nothing). Next, he placed it into a ColumnTransformer as one of the transformers. Since ColumnTransformer will clone its transformers, the metaestimator that he wanted to freeze lost its trained state.

He also mentioned how this worked in Pipeline and that its weird it didn't work in ColumnTransformer. (No cloning in Pipeline)

jnothman · 2019-10-26T11:20:50Z

Yes, cloning is precisely why freezing needs a specialised solution. It's very hard for a user unfamiliar with scikit-learn internals to fix.

ribonucleic · 2020-02-14T08:51:18Z

I have another use case where I want to fit two target encoders not on the target label but on other labels (e.g. email domains we get many clicks from, whereas the final target is a user buying). So basically I have 3 y vectors. Fitting has to happen outside of column transformer, then the fitted transformers would be passed to column transformer.

Great hint about "clone"! Helps to find a workaround.

jnothman · 2020-05-05T23:19:19Z

Why not pass categories to OneHotEncoder?

sophieatalt · 2022-09-29T23:59:57Z

was anyone able to find a solution to this?

thomasjpfan added the API label Oct 26, 2019

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

instantiating a ColumnTransformer with already fitted transformers #13614

instantiating a ColumnTransformer with already fitted transformers #13614

sebastiangonsal commented Apr 10, 2019

jnothman commented Apr 11, 2019 via email

thomasjpfan commented Oct 26, 2019

jnothman commented Oct 26, 2019 via email

ribonucleic commented Feb 14, 2020

jnothman commented May 5, 2020

sophieatalt commented Sep 29, 2022

instantiating a ColumnTransformer with already fitted transformers #13614

instantiating a ColumnTransformer with already fitted transformers #13614

Comments

sebastiangonsal commented Apr 10, 2019

jnothman commented Apr 11, 2019 via email

thomasjpfan commented Oct 26, 2019

jnothman commented Oct 26, 2019 via email

ribonucleic commented Feb 14, 2020

jnothman commented May 5, 2020

sophieatalt commented Sep 29, 2022