Skip to content

Conversation

@mcdull-zhang
Copy link
Contributor

What changes were proposed in this pull request?

NULL-aware ANTI join (https://issues.apache.org/jira/browse/SPARK-32290) will build right side into a HashMap.
code in SparkStrategy:
case j @ ExtractSingleColumnNullAwareAntiJoin(leftKeys, rightKeys) => Seq(joins.BroadcastHashJoinExec(leftKeys, rightKeys, LeftAnti, BuildRight, None, planLater(j.left), planLater(j.right), isNullAwareAntiJoin = true))

we should add the conditions and use this optimization when the size of the right side is small enough.

Why are the changes needed?

better stability

Does this PR introduce any user-facing change?

No

How was this patch tested?

added Test

@github-actions github-actions bot added the SQL label Jul 11, 2021
@AmplabJenkins
Copy link

Can one of the admins verify this patch?

dongdongzhang added 2 commits July 11, 2021 15:19
@HyukjinKwon HyukjinKwon changed the title [SPARK-36082][SQL]when the right side is small enough to use SingleColumn Null Aware Anti Join [SPARK-36082][SQL] When the right side is small enough to use SingleColumn Null Aware Anti Join Jul 12, 2021
@github-actions
Copy link

We're closing this PR because it hasn't been updated in a while. This isn't a judgement on the merit of the PR in any way. It's just a way of keeping the PR queue manageable.
If you'd like to revive this PR, please reopen it and ask a committer to remove the Stale tag!

@github-actions github-actions bot added the Stale label Oct 21, 2021
@github-actions github-actions bot closed this Oct 22, 2021
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants