-
Notifications
You must be signed in to change notification settings - Fork 82
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Support for pandas NAType #2931
Comments
@xinyuejohn - thanks for reporting the issue! Looking at the Pandas NA type doc linked here, I think, |
In our meeting, we talked about this. I was worried about having too many different ways of expressing "missing value," since Awkward has option-types and Pandas has NAType. (I wonder the same thing about Pandas having both NAType and floating-point
Option 1 would involve a lot of work, and the Pandas API might not allow it. (They might not have a hook for us to tell Pandas, "these values are missing.") It would be a lot of work because Pandas has a lot of functions that do special things with missing values. Option 2 would probably be confusing for users. That's why I would vote for option 3. The option-type removal and replacement with NAType could perhaps happen in the layout.is_option and layout.is_union and any(x.is_option for x in layout.contents) (Actually, I think the policy for option-type and union-type is that all of the union's The code that strips off the option-types would have to preserve indexes, and the If the option-types are inside of a union, then this becomes more complicated for the one non- Finally, I think this discussion should move to the awkward-pandas library. I tried to transfer the issue, but I don't have permissions. @douglasdavis, do you have permissions? (I think one person would need to have Admin permissions on both repos.) If not, could you give me the permissions on awkward-pandas so that I can move this sort of issue in the future? |
You should now have permission! |
@jpivarski thanks for your reply! I think option 3 is indeed more user-friendly! |
The issue has been moved to intake/akimbo#44, so I'm closing it here. |
Description of new feature
Hi, I was creating an awkward array using pandas dataframe and I found awkward doesn't support pandas._libs.missing.NAType()
It would be great if this NAType could be supported.
To replicate:
Traceback:
The text was updated successfully, but these errors were encountered: