[SPARK-28962][SQL][FOLLOW-UP] Add the parameter description for the Scala function API filter #27336

gatorsmile · 2020-01-23T08:30:39Z

What changes were proposed in this pull request?

This PR is a follow-up PR #25666 for adding the description and example for the Scala function API filter.

Why are the changes needed?

It is hard to tell which parameter is the index column.

Does this PR introduce any user-facing change?

No

How was this patch tested?

N/A

gatorsmile · 2020-01-23T08:31:33Z

cc @henrydavidge @cloud-fan @maropu @ueshin

sql/core/src/main/scala/org/apache/spark/sql/functions.scala

SparkQA · 2020-01-23T12:40:42Z

Test build #117290 has finished for PR 27336 at commit 59860e7.

This patch fails SparkR unit tests.
This patch merges cleanly.
This patch adds no public classes.

dongjoon-hyun · 2020-01-23T16:54:21Z

All test passed, the R failure is a known flaky incoming feasibility test. cc @viirya

* checking CRAN incoming feasibility ...Error in .check_package_CRAN_incoming(pkgdir) : 
  dims [product 26] do not match the length of object [0]

viirya · 2020-01-23T16:59:11Z

@dongjoon-hyun Thanks for pinging me. Requested help from CRAN.

SparkQA · 2020-01-23T22:57:36Z

Test build #117318 has finished for PR 27336 at commit ac1c3c0.

This patch passes all tests.
This patch merges cleanly.
This patch adds no public classes.

dongjoon-hyun

+1, LGTM. Merged to master.

cloud-fan · 2020-02-04T06:33:58Z

sql/core/src/main/scala/org/apache/spark/sql/functions.scala

+   *
+   * @param column: the input array column
+   * @param f: (col, index) => predicate, the boolean predicate to filter the input column
+   *           given the index. Indices start at 0.


is this consistent within Spark that the indices parameter starts with 0 in higher-order functions? @ueshin @HyukjinKwon

ArrayTransform with index argument starts with 0.
We might need to change it from 1 (with a legacy config and migration guide)?

what's the behavior of presto?

Actually presto's transform or filter don't take index argument.

I remember we had a discussion about the index argument in the PR for zip_with_index (#21121 (comment)).
And for filter, it was done later separately, but seems like the similar context https://issues.apache.org/jira/browse/SPARK-28962.

It's too late to change now, let's keep using 0.

add description of the parameter f and an example.

59860e7

maropu approved these changes Jan 23, 2020

View reviewed changes

HyukjinKwon reviewed Jan 23, 2020

View reviewed changes

sql/core/src/main/scala/org/apache/spark/sql/functions.scala Show resolved Hide resolved

HyukjinKwon reviewed Jan 23, 2020

View reviewed changes

sql/core/src/main/scala/org/apache/spark/sql/functions.scala Show resolved Hide resolved

dongjoon-hyun added the SQL label Jan 23, 2020

gatorsmile added 2 commits January 23, 2020 10:51

fixed.

6372720

fixed.

ac1c3c0

dongjoon-hyun approved these changes Jan 24, 2020

View reviewed changes

dongjoon-hyun closed this in ddf8315 Jan 24, 2020

cloud-fan reviewed Feb 4, 2020

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[SPARK-28962][SQL][FOLLOW-UP] Add the parameter description for the Scala function API filter #27336

[SPARK-28962][SQL][FOLLOW-UP] Add the parameter description for the Scala function API filter #27336

gatorsmile commented Jan 23, 2020

gatorsmile commented Jan 23, 2020

SparkQA commented Jan 23, 2020

dongjoon-hyun commented Jan 23, 2020

viirya commented Jan 23, 2020

SparkQA commented Jan 23, 2020

dongjoon-hyun left a comment

cloud-fan Feb 4, 2020

ueshin Feb 5, 2020 •

edited

cloud-fan Feb 5, 2020

ueshin Feb 5, 2020

cloud-fan Feb 5, 2020

[SPARK-28962][SQL][FOLLOW-UP] Add the parameter description for the Scala function API filter #27336

[SPARK-28962][SQL][FOLLOW-UP] Add the parameter description for the Scala function API filter #27336

Conversation

gatorsmile commented Jan 23, 2020

What changes were proposed in this pull request?

Why are the changes needed?

Does this PR introduce any user-facing change?

How was this patch tested?

gatorsmile commented Jan 23, 2020

SparkQA commented Jan 23, 2020

dongjoon-hyun commented Jan 23, 2020

viirya commented Jan 23, 2020

SparkQA commented Jan 23, 2020

dongjoon-hyun left a comment

Choose a reason for hiding this comment

cloud-fan Feb 4, 2020

Choose a reason for hiding this comment

ueshin Feb 5, 2020 • edited

Choose a reason for hiding this comment

cloud-fan Feb 5, 2020

Choose a reason for hiding this comment

ueshin Feb 5, 2020

Choose a reason for hiding this comment

cloud-fan Feb 5, 2020

Choose a reason for hiding this comment

ueshin Feb 5, 2020 •

edited