-
Notifications
You must be signed in to change notification settings - Fork 122
Closed
Labels
Description
Hi Max, and thanks for the great package.
I watched your interesting presentation titled (I don't want to be a dummy), and actually based on my experience, I tend to find that label_encoding of categorical variables is generally better than one_hot (dummy variables) encoding, in terms of training speed as well as robustness to overfitting.
So my question is: is there a function that transforms a categorical variable to numeric representation (like the LabelEncoder class in scikit-learn), plus handling novel levels in the test data?
That would be very nice to have!
Thank you in advance