Improve multi dataframe join performance by holdenk · Pull Request #8740 · dask/dask

holdenk · 2022-02-19T05:12:37Z

by allowing recursive merge function to take also operate on the initial frame rather than a separate non-parallel merge.

[ X ] Closes join should take maximal advantage of _recursive_pairwise_outer_join when present #8739
[ X ] Tests added / passed (py.test dask --verbose -k test_multi.py)
[ X ] Passes pre-commit run --all-files

…ive merge function to take also operate on the initial frame rather than a seperate non-parallel merge. Fix

GPUtester · 2022-02-19T05:12:38Z

Can one of the admins verify this patch?

holdenk · 2022-02-19T06:24:26Z

cc @KrishanBhasin who was the last to work on _recursive_pairwise_outer_join.

KrishanBhasin

Oh nice! LGTM 🚀

jsignell · 2022-02-21T22:04:09Z

add to allowlist

jsignell · 2022-02-22T14:21:05Z

Thanks for this change @holdenk and thanks @KrishanBhasin for the review!

holdenk added 3 commits February 18, 2022 21:01

Try and improve multi dataframe merges performance by allowing recurs…

1146ec8

…ive merge function to take also operate on the initial frame rather than a seperate non-parallel merge. Fix

We can only do full recursive pairwise join for outer joins.

cee9b1a

Style fix.

2837e4f

github-actions bot added the dataframe label Feb 19, 2022

holdenk changed the title ~~[WIP] Try and improve multi dataframe merges performance~~ Improve multi dataframe join performance Feb 19, 2022

KrishanBhasin approved these changes Feb 19, 2022

View reviewed changes

jsignell merged commit 1f8d2c1 into dask:main Feb 22, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Improve multi dataframe join performance#8740

Improve multi dataframe join performance#8740
jsignell merged 3 commits intodask:mainfrom
holdenk:improve-multi-dataframe-merges

holdenk commented Feb 19, 2022

Uh oh!

GPUtester commented Feb 19, 2022

Uh oh!

holdenk commented Feb 19, 2022

Uh oh!

KrishanBhasin left a comment

Uh oh!

jsignell commented Feb 21, 2022

Uh oh!

jsignell commented Feb 22, 2022

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Uh oh!

Conversation

holdenk commented Feb 19, 2022

Uh oh!

GPUtester commented Feb 19, 2022

Uh oh!

holdenk commented Feb 19, 2022

Uh oh!

KrishanBhasin left a comment

Choose a reason for hiding this comment

Uh oh!

jsignell commented Feb 21, 2022

Uh oh!

jsignell commented Feb 22, 2022

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants