Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Register cudf.core.groupby.Grouper objects to dask grouper_dispatch #10982

Merged

Conversation

brandon-b-miller
Copy link
Contributor

Backport of #10838

…h` (rapidsai#10838)

This PR registers uses the (presumably shortly merged) dask `Grouper` dispatch to register `cudf.core.groupby.Grouper` objects to `cudf.DataFrame` objects. This should allow our own Grouper objects to be used in critical places in dask rather than pandas objects. 

This solution is favorable IMO rather than changing cuDF to handle pandas grouper objects directly. 

Xref dask/dask#9074

Authors:
  - https://github.com/brandon-b-miller

Approvers:
  - GALI PREM SAGAR (https://github.com/galipremsagar)

URL: rapidsai#10838
@github-actions github-actions bot added the Python Affects Python cuDF API. label May 26, 2022
@brandon-b-miller brandon-b-miller marked this pull request as ready for review May 26, 2022 19:54
@brandon-b-miller brandon-b-miller requested a review from a team as a code owner May 26, 2022 19:54
@brandon-b-miller brandon-b-miller added non-breaking Non-breaking change improvement Improvement / enhancement to an existing function labels May 26, 2022
@galipremsagar galipremsagar added the 5 - Ready to Merge Testing and reviews complete, ready to merge label May 26, 2022
@galipremsagar galipremsagar added this to PR-WIP in v22.06 Release via automation May 26, 2022
@galipremsagar galipremsagar moved this from PR-WIP to PR-Reviewer approved in v22.06 Release May 26, 2022
@codecov
Copy link

codecov bot commented May 26, 2022

Codecov Report

Merging #10982 (7f20268) into branch-22.06 (b4674a1) will decrease coverage by 0.01%.
The diff coverage is 57.14%.

❗ Current head 7f20268 differs from pull request most recent head aee42fe. Consider uploading reports for the commit aee42fe to get more accurate results

@@               Coverage Diff                @@
##           branch-22.06   #10982      +/-   ##
================================================
- Coverage         86.33%   86.31%   -0.02%     
================================================
  Files               144      144              
  Lines             22681    22688       +7     
================================================
+ Hits              19581    19584       +3     
- Misses             3100     3104       +4     
Impacted Files Coverage Δ
python/dask_cudf/dask_cudf/backends.py 84.61% <57.14%> (-0.90%) ⬇️
python/cudf/cudf/core/column/numerical.py 95.88% <0.00%> (-0.30%) ⬇️

Continue to review full report at Codecov.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update b4674a1...aee42fe. Read the comment docs.

@ajschmidt8 ajschmidt8 merged commit 0a34f67 into rapidsai:branch-22.06 May 27, 2022
v22.06 Release automation moved this from PR-Reviewer approved to Done May 27, 2022
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
5 - Ready to Merge Testing and reviews complete, ready to merge improvement Improvement / enhancement to an existing function non-breaking Non-breaking change Python Affects Python cuDF API.
Projects
No open projects
Development

Successfully merging this pull request may close these issues.

None yet

3 participants