Multiple Transforms for Multiple Columns #253

adiv5 · 2022-02-23T10:51:55Z

Hello,

I wanted to know is there any way to do multiple transforms on multiple columns , treating each one seperately.

I was able to implement it using Sklearn's ColumnTransformer as follows:


ct = ColumnTransformer(
    [(
        'numeric',
        Pipeline([
            ('handle_na',NAHandler(is_train=True,nan_cols=[])),
            ('standardize',StandardScaler()),
            ('PCA',PCA(n_components=4))
        ]),
            ['col1','col2','col3','col4']
    
    ),
  )],
  remainder='passthrough'
)

However SKlearn pandas' documentation doesnt point me to something like this.
I can see there are 2 sections --- one for single column , multiple transforms and other for multiple colums, single transform.

I couldnt see multiple cols multiple transforms

for now i am able to do what i intend by writing transforms for each and every column seperately . i.e


mapper = DataFrameMapper(
        [
                (
                        ['col1'],
                        [NAHandler(is_train=True,nan_cols=[]),StandardScaler(),PCA(n_components=4)]
                ),
                (
                        ['col2'],
                        [NAHandler(is_train=True,nan_cols=[]),StandardScaler(),PCA(n_components=4)]
                )

               ........
        ],
        input_df=True,
        df_out=True,
        default=None
        )

But what i was actually looking for is ColumnTransformer- like usage .

something like this :

mapper = DataFrameMapper(
        [
                (
                        ['col1','col2','col3','col4'],
                        [NAHandler(is_train=True,nan_cols=[]),StandardScaler(),PCA(n_components=4)]
                )
        ],
        input_df=True,
        df_out=True,
        default=None
        )

will such functionality be supported in upcoming builds? Can be very helpful!

The text was updated successfully, but these errors were encountered:

hu-minghao · 2022-02-23T10:52:23Z

你好，已收到，谢谢。

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Multiple Transforms for Multiple Columns #253

Multiple Transforms for Multiple Columns #253

adiv5 commented Feb 23, 2022

hu-minghao commented Feb 23, 2022 via email

Multiple Transforms for Multiple Columns #253

Multiple Transforms for Multiple Columns #253

Comments

adiv5 commented Feb 23, 2022

hu-minghao commented Feb 23, 2022 via email