fix cat encode #50

eromoe · 2022-09-07T05:36:34Z

No description provided.

eromoe · 2022-09-07T05:36:53Z

featurewiz/my_encoders.py

@@ -56,7 +56,6 @@ def __init__(self):
    def fit(self,testx, y=None):
        ### Do not change this since Rare class combiner requires this test ##
        if isinstance(testx, tuple):
-            y = testx[1]


No need in fit .

You have made a mistake in below code:

test_result = MLB.fit_transform(train[everycol])

Instead it should be:

test_result = MLB.transform(test[everycol])

Do you see the difference?

Right , my mistake , now fixed

eromoe · 2022-09-07T10:13:54Z

@AutoViML
I haven't look into why rare class need label encoder return tuple , could you mind to make an explanation ?

If , tuple return is only used for something relate to rare class . Don't you think two label_encoder are more correct ?
Such as

One only fit one string array and return one transformed array
The other fit (string array, y ) and return (transformed array, y )

eromoe · 2022-09-07T10:27:20Z

@AutoViML
I think you are mainly working on kaggle competition ?

Let's think about OOT (out of time) Testing , we usually have 3 datasets ,

train ( data from 2022.09.01 ~2022.09.05）
test ( data from 2022.09.06）
production ( data on today 2022.09.07 )

Transformer only fit_transform by train , then apply to test and do validation .

Acually there is no need to allow input both train and test in one funtion .
just one input is enough

def FE_convert_all_object_columns_to_numeric(train, features=[])

Just call same funtion again if new data comming .Or do you have another consideration ?

AutoViML · 2022-09-07T10:29:50Z

At the moment let's close this thread with this change. If you feel another change is needed elsewhere, you can open another pull request. Okay?

eromoe · 2022-09-07T10:54:09Z

OK, I won't make pr in recently I think , just gona make some discussion :)

fix cat encode

1e0a57b

eromoe commented Sep 7, 2022

View reviewed changes

update

c0d1a0b

update

e3a1716

AutoViML closed this Sep 7, 2022

AutoViML reopened this Sep 7, 2022

AutoViML merged commit b30b73d into AutoViML:main Sep 7, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix cat encode #50

fix cat encode #50

eromoe commented Sep 7, 2022

eromoe Sep 7, 2022

AutoViML Sep 7, 2022

eromoe Sep 7, 2022

eromoe commented Sep 7, 2022

eromoe commented Sep 7, 2022 •

edited

Loading

AutoViML commented Sep 7, 2022

eromoe commented Sep 7, 2022

fix cat encode #50

fix cat encode #50

Conversation

eromoe commented Sep 7, 2022

eromoe Sep 7, 2022

Choose a reason for hiding this comment

AutoViML Sep 7, 2022

Choose a reason for hiding this comment

eromoe Sep 7, 2022

Choose a reason for hiding this comment

eromoe commented Sep 7, 2022

eromoe commented Sep 7, 2022 • edited Loading

AutoViML commented Sep 7, 2022

eromoe commented Sep 7, 2022

eromoe commented Sep 7, 2022 •

edited

Loading