Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[SPARK-38206][SS] Ignore nullability on comparing the data type of join keys on stream-stream join #35599

Closed
wants to merge 2 commits into from

Conversation

HeartSaVioR
Copy link
Contributor

What changes were proposed in this pull request?

This PR proposes to change the assertion of data type against joining keys on stream-stream join to ignore nullability.

Why are the changes needed?

The existing requirement on checking data types of joining keys is too restricted, as it also requires the same nullability. In batch query (I checked with HashJoinExec), nullability is ignored when checking data types of joining keys.

Does this PR introduce any user-facing change?

Yes, end users will no longer encounter the assertion error on join keys with different nullability in both keys.

How was this patch tested?

New test added.

@HeartSaVioR
Copy link
Contributor Author

I've added test cases for inner & left-outer join. I guess it's redundant to have separate test cases for right-outer/full-outer/left-semi, but please let me know if we would like to have exhausive set of tests.

@HeartSaVioR
Copy link
Contributor Author

cc. @cloud-fan @viirya @xuanyuanking @c21

@HeartSaVioR
Copy link
Contributor Author

Thanks! Merging to master.

Copy link
Member

@dongjoon-hyun dongjoon-hyun left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

+1, LGTM.

@c21
Copy link
Contributor

c21 commented Feb 22, 2022

Sorry for late comment, just late +1 LGTM.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
5 participants