[SPARK-23624][SQL]Revise doc of method pushFilters in Datasource V2 #20769

gengliangwang · 2018-03-08T05:01:59Z

What changes were proposed in this pull request?

Revise doc of method pushFilters in SupportsPushDownFilters/SupportsPushDownCatalystFilters

In FileSourceStrategy, except partitionKeyFilters(the references of which is subset of partition keys), all filters needs to be evaluated after scanning. Otherwise, Spark will get wrong result from data sources like Orc/Parquet.

This PR is to improve the doc.

…lters

gengliangwang · 2018-03-08T05:02:17Z

@cloud-fan

SparkQA · 2018-03-08T06:30:36Z

Test build #88075 has finished for PR 20769 at commit bc98b20.

This patch fails Spark unit tests.
This patch merges cleanly.
This patch adds no public classes.

gengliangwang · 2018-03-08T07:34:30Z

retest this please.

SparkQA · 2018-03-08T08:05:02Z

Test build #88077 has finished for PR 20769 at commit bc98b20.

This patch fails due to an unknown error code, -9.
This patch merges cleanly.
This patch adds no public classes.

gengliangwang · 2018-03-08T09:04:41Z

retest this please.

SparkQA · 2018-03-08T12:11:28Z

Test build #88083 has finished for PR 20769 at commit bc98b20.

This patch passes all tests.
This patch merges cleanly.
This patch adds no public classes.

gatorsmile

LGTM

## What changes were proposed in this pull request? Revise doc of method pushFilters in SupportsPushDownFilters/SupportsPushDownCatalystFilters In `FileSourceStrategy`, except `partitionKeyFilters`(the references of which is subset of partition keys), all filters needs to be evaluated after scanning. Otherwise, Spark will get wrong result from data sources like Orc/Parquet. This PR is to improve the doc. Author: Wang Gengliang <gengliang.wang@databricks.com> Closes #20769 from gengliangwang/revise_pushdown_doc. (cherry picked from commit 10b0657) Signed-off-by: gatorsmile <gatorsmile@gmail.com>

## What changes were proposed in this pull request? Revise doc of method pushFilters in SupportsPushDownFilters/SupportsPushDownCatalystFilters In `FileSourceStrategy`, except `partitionKeyFilters`(the references of which is subset of partition keys), all filters needs to be evaluated after scanning. Otherwise, Spark will get wrong result from data sources like Orc/Parquet. This PR is to improve the doc. Author: Wang Gengliang <gengliang.wang@databricks.com> Closes apache#20769 from gengliangwang/revise_pushdown_doc. (cherry picked from commit 10b0657) Signed-off-by: gatorsmile <gatorsmile@gmail.com>

## What changes were proposed in this pull request? Revise doc of method pushFilters in SupportsPushDownFilters/SupportsPushDownCatalystFilters In `FileSourceStrategy`, except `partitionKeyFilters`(the references of which is subset of partition keys), all filters needs to be evaluated after scanning. Otherwise, Spark will get wrong result from data sources like Orc/Parquet. This PR is to improve the doc. Author: Wang Gengliang <gengliang.wang@databricks.com> Closes apache#20769 from gengliangwang/revise_pushdown_doc.

Update doc for SupportsPushDownCatalystFilters and SupportsPushDownFi…

bc98b20

…lters

gatorsmile reviewed Mar 9, 2018

View reviewed changes

asfgit closed this in 10b0657 Mar 9, 2018

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[SPARK-23624][SQL]Revise doc of method pushFilters in Datasource V2 #20769

[SPARK-23624][SQL]Revise doc of method pushFilters in Datasource V2 #20769

gengliangwang commented Mar 8, 2018

gengliangwang commented Mar 8, 2018

SparkQA commented Mar 8, 2018

gengliangwang commented Mar 8, 2018

SparkQA commented Mar 8, 2018

gengliangwang commented Mar 8, 2018

SparkQA commented Mar 8, 2018

gatorsmile left a comment

[SPARK-23624][SQL]Revise doc of method pushFilters in Datasource V2 #20769

[SPARK-23624][SQL]Revise doc of method pushFilters in Datasource V2 #20769

Conversation

gengliangwang commented Mar 8, 2018

What changes were proposed in this pull request?

gengliangwang commented Mar 8, 2018

SparkQA commented Mar 8, 2018

gengliangwang commented Mar 8, 2018

SparkQA commented Mar 8, 2018

gengliangwang commented Mar 8, 2018

SparkQA commented Mar 8, 2018

gatorsmile left a comment

Choose a reason for hiding this comment