Skip to content

Conversation

mroeschke
Copy link
Member

@mroeschke mroeschke commented Jul 14, 2023

@mroeschke mroeschke added Groupby NA - MaskedArrays Related to pd.NA and nullable extension arrays Arrow pyarrow functionality labels Jul 14, 2023
@mroeschke mroeschke requested a review from rhshadrach as a code owner July 14, 2023 20:14
@mroeschke mroeschke added this to the 2.1 milestone Jul 14, 2023
@mroeschke
Copy link
Member Author

Looks like tests are passing here so merging. Happy to follow up if needed

@mroeschke mroeschke merged commit 04180b0 into pandas-dev:main Jul 24, 2023
@mroeschke mroeschke deleted the bug/gb/count_ea branch July 24, 2023 17:45
Copy link
Member

@rhshadrach rhshadrach left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Sorry - this fell off my radar. Looks good.

if op_name != "count":
# the result is not yet consistently using Int64/Float64 dtype,
# so for now just checking the values by casting to float
result = result.astype("float64")
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Out of scope here, but I think this might pass now without the cast? Should probably be xfailed if not. I plan to look into this.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Arrow pyarrow functionality Groupby NA - MaskedArrays Related to pd.NA and nullable extension arrays
Projects
None yet
Development

Successfully merging this pull request may close these issues.

BUG: DataFrame.groupby.count with arrow dtypes do not return arrow dtypes
2 participants