-
-
Notifications
You must be signed in to change notification settings - Fork 25.1k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
LabelEncoder with mixed typed labels feature or bug? #17294
Comments
For the center we have no choice, the bottom one makes sense to me, you don't want to necessarily identify Btw, you run into this when concatenating dataframes from multiple csv files and a column is sometimes parsed as string and sometimes as integer. Great Fun! |
No this stems from numpy casting issues when an array is constructed with
string elements.
|
Or are you saying that we should have a check_array that is more clever
about mixed types in non-array input?
|
If we want to make the stuff consistent I am really scared that we will stumble in the dtype nightmare and we might come with tricks-and-tips implementations on something which should already be solved otherwise I think. If we could maybe have better error messages with potential avenues for resolution could already be great. |
Is the following a bug or a feature?
This stems from how we only check for multiple types in
_encode
whendtype=object
. Note this behavior is the same in0.22
,0.23
, and on master.The text was updated successfully, but these errors were encountered: