Skip to content

Conversation

@ulysses-you
Copy link
Contributor

What changes were proposed in this pull request?

This pr makes KeyGroupedPartitioning inherit HashPartitioningLike, so that the BroadcastHashJoin#expandOutputPartitioning and PartitioningPreservingUnaryExecNode can work with it.

Why are the changes needed?

To make KeyGroupedPartitioning support alias aware framework.

Does this PR introduce any user-facing change?

no

How was this patch tested?

add test

Was this patch authored or co-authored using generative AI tooling?

no

@github-actions github-actions bot added the SQL label Aug 13, 2024
@ulysses-you
Copy link
Contributor Author

cc @yaooqinn @cloud-fan @viirya @sunchao thank you

Copy link
Member

@sunchao sunchao left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM, thanks @ulysses-you !

@yaooqinn yaooqinn closed this in 3f3d024 Aug 13, 2024
@yaooqinn
Copy link
Member

Merged to master.

Thank you @ulysses-you @sunchao

Copy link
Member

@dongjoon-hyun dongjoon-hyun left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

+1, late LGTM.

BTW, KeyGroupedPartitioning was added at Apache Spark 3.3.0 via SPARK-37377.

Do we need to backport this to branch-3.5 and branch-3.4, @ulysses-you , @sunchao , @yaooqinn ?

Copy link
Member

@viirya viirya left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Looks good to me.

@ulysses-you ulysses-you deleted the SPARK-49205-partitioning branch August 13, 2024 23:12
@ulysses-you
Copy link
Contributor Author

@dongjoon-hyun it is not a bug fix so the JIRA only targets 4.0.0. I'm fine to backport it to previous branch if you are are interested in.

@dongjoon-hyun
Copy link
Member

Thank you for a quick reply. If this is only an improvement, never mind, @ulysses-you . I was think about the original intention at Spark 3.3.0 PR~

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

Projects

None yet

Development

Successfully merging this pull request may close these issues.

5 participants