Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

BUG: groupby.count maintains masked and arrow dtypes #54129

Merged
merged 4 commits into from Jul 24, 2023

Conversation

mroeschke
Copy link
Member

@mroeschke mroeschke commented Jul 14, 2023

@mroeschke mroeschke added Groupby NA - MaskedArrays Related to pd.NA and nullable extension arrays Arrow pyarrow functionality labels Jul 14, 2023
@mroeschke mroeschke added this to the 2.1 milestone Jul 14, 2023
@mroeschke
Copy link
Member Author

Looks like tests are passing here so merging. Happy to follow up if needed

@mroeschke mroeschke merged commit 04180b0 into pandas-dev:main Jul 24, 2023
36 checks passed
@mroeschke mroeschke deleted the bug/gb/count_ea branch July 24, 2023 17:45
Copy link
Member

@rhshadrach rhshadrach left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Sorry - this fell off my radar. Looks good.

@@ -343,9 +343,27 @@ def test_cython_agg_nullable_int(op_name):
# the result is not yet consistently using Int64/Float64 dtype,
# so for now just checking the values by casting to float
result = result.astype("float64")
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Out of scope here, but I think this might pass now without the cast? Should probably be xfailed if not. I plan to look into this.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Arrow pyarrow functionality Groupby NA - MaskedArrays Related to pd.NA and nullable extension arrays
Projects
None yet
Development

Successfully merging this pull request may close these issues.

BUG: DataFrame.groupby.count with arrow dtypes do not return arrow dtypes
2 participants