Skip to content

Support multi-column groupby with sort=True and split_out>1#10425

Merged
rjzamora merged 4 commits intodask:mainfrom
rjzamora:multi-column-groupby-sorted
Aug 16, 2023
Merged

Support multi-column groupby with sort=True and split_out>1#10425
rjzamora merged 4 commits intodask:mainfrom
rjzamora:multi-column-groupby-sorted

Conversation

@rjzamora
Copy link
Copy Markdown
Member

Uses the multi-column sort_values support added in #8263 to support multi-column groupby aggregations with sort=True and split_out > 1.

@rjzamora rjzamora added dataframe enhancement Improve existing functionality or make things work better labels Jul 21, 2023
@rjzamora rjzamora self-assigned this Jul 21, 2023
@hendrikmakait hendrikmakait self-requested a review August 2, 2023 07:13
Copy link
Copy Markdown
Member

@hendrikmakait hendrikmakait left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

The code generally looks good, with one question regarding the tests. I'd like someone with more dataframe experience to review before signing off (cc @phofl).

Copy link
Copy Markdown
Collaborator

@phofl phofl left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

lgtm

@rjzamora
Copy link
Copy Markdown
Member Author

Thanks for reviewing @hendrikmakait and @phofl !

@rjzamora rjzamora merged commit 214acfa into dask:main Aug 16, 2023
@rjzamora rjzamora deleted the multi-column-groupby-sorted branch August 16, 2023 17:30
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

dataframe enhancement Improve existing functionality or make things work better

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants