Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[SPARK-45564][SQL] Simplify 'DataFrameStatFunctions.bloomFilter' with 'BloomFilterAggregate' expression #43391

Closed

Conversation

zhengruifeng
Copy link
Contributor

What changes were proposed in this pull request?

Simplify 'DataFrameStatFunctions.bloomFilter' function with 'BloomFilterAggregate' expression

Why are the changes needed?

existing implementation was based on RDD, and it can be simplified by dataframe operations

Does this PR introduce any user-facing change?

when the input parameters or datatypes are invalid, throw AnalysisException instead of IllegalArgumentException

How was this patch tested?

ci

Was this patch authored or co-authored using generative AI tooling?

no

@github-actions github-actions bot added the SQL label Oct 17, 2023
@zhengruifeng zhengruifeng changed the title [SPARK-45564][SQL] Simplify 'DataFrameStatFunctions.bloomFilter' function with 'BloomFilterAggregate' expression [SPARK-45564][SQL] Simplify 'DataFrameStatFunctions.bloomFilter' with 'BloomFilterAggregate' expression Oct 17, 2023
@LuciferYang
Copy link
Contributor

cc @hvanhovell

@beliefer
Copy link
Contributor

cc @srowen @cloud-fan

Copy link
Member

@srowen srowen left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

If there's no behavior change other than the exception type, and its' not slower, seems fine.

Copy link
Contributor

@hvanhovell hvanhovell left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM

@srowen srowen closed this in 922844f Oct 17, 2023
@srowen
Copy link
Member

srowen commented Oct 17, 2023

Merged to master

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Projects
None yet
5 participants