Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[SPARK-45882][SQL][3.5] BroadcastHashJoinExec propagate partitioning should respect CoalescedHashPartitioning #43792

Closed
wants to merge 2 commits into from

Conversation

ulysses-you
Copy link
Contributor

This pr backport #43753 to branch-3.5

What changes were proposed in this pull request?

Add HashPartitioningLike trait and make HashPartitioning and CoalescedHashPartitioning extend it. When we propagate output partiitoning, we should handle HashPartitioningLike instead of HashPartitioning. This pr also changes the BroadcastHashJoinExec to use HashPartitioningLike to avoid regression.

Why are the changes needed?

Avoid unnecessary shuffle exchange.

Does this PR introduce any user-facing change?

yes, avoid regression

How was this patch tested?

add test

Was this patch authored or co-authored using generative AI tooling?

no

…d respect CoalescedHashPartitioning

Add HashPartitioningLike trait and make HashPartitioning and CoalescedHashPartitioning extend it. When we propagate output partiitoning, we should handle HashPartitioningLike instead of HashPartitioning. This pr also changes the BroadcastHashJoinExec to use HashPartitioningLike to avoid regression.

Avoid unnecessary shuffle exchange.

yes, avoid regression

add test

no

Closes apache#43753 from ulysses-you/partitioning.

Authored-by: ulysses-you <ulyssesyou18@gmail.com>
Signed-off-by: Wenchen Fan <wenchen@databricks.com>
@github-actions github-actions bot added the BUILD label Nov 14, 2023
@ulysses-you
Copy link
Contributor Author

thanks for review, merging to branch-3.5

ulysses-you added a commit that referenced this pull request Nov 14, 2023
…should respect CoalescedHashPartitioning

This pr backport #43753 to branch-3.5

### What changes were proposed in this pull request?

Add HashPartitioningLike trait and make HashPartitioning and CoalescedHashPartitioning extend it. When we propagate output partiitoning, we should handle HashPartitioningLike instead of HashPartitioning. This pr also changes the BroadcastHashJoinExec to use HashPartitioningLike to avoid regression.

### Why are the changes needed?

Avoid unnecessary shuffle exchange.

### Does this PR introduce _any_ user-facing change?

yes, avoid regression

### How was this patch tested?

add test

### Was this patch authored or co-authored using generative AI tooling?

no

Closes #43792 from ulysses-you/partitioning-3.5.

Authored-by: ulysses-you <ulyssesyou18@gmail.com>
Signed-off-by: youxiduo <youxiduo@corp.netease.com>
@ulysses-you ulysses-you deleted the partitioning-3.5 branch November 14, 2023 11:55
@dongjoon-hyun
Copy link
Member

Hi, @ulysses-you and @cloud-fan .

If you don't mind, please adjust the JIRA type properly before backporting.

Screenshot 2023-11-14 at 10 12 18 AM

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Projects
None yet
3 participants