[SPARK-15453] [SQL] [Follow-up] FileSourceScanExec to extract `outputOrdering` information #16994

gatorsmile · 2017-02-20T00:47:10Z

What changes were proposed in this pull request?

outputOrdering is also dependent on whether the bucket has more than one files. The test cases fail when we try to move them to sql/core. This PR is to fix the test cases introduced in #14864 and add a test case to verify the related logics.

How was this patch tested?

N/A

gatorsmile · 2017-02-20T00:48:00Z

cc @tejasapatil @cloud-fan

tejasapatil · 2017-02-20T01:09:02Z

sql/hive/src/test/scala/org/apache/spark/sql/sources/BucketedReadSuite.scala

@@ -240,6 +240,7 @@ class BucketedReadSuite extends QueryTest with SQLTestUtils with TestHiveSinglet
      joinCondition: (DataFrame, DataFrame) => Column,
      shuffleLeft: Boolean,
      shuffleRight: Boolean,
+      numPartitions: Int = 10,


Since other vars are for left and right side, wondering if same could to be done for numPartitions. ie. numPartitionsLeft and numPartitionsRight.

That could make the new test you added more interesting :

numPartitionsLeft = 1, numPartitionsRight = 10, sortLeft = false, sortRight = true

Sure, let me do it.

SparkQA · 2017-02-20T02:16:14Z

Test build #73135 has finished for PR 16994 at commit d158ce1.

This patch passes all tests.
This patch merges cleanly.
This patch adds no public classes.

SparkQA · 2017-02-20T05:50:55Z

Test build #73140 has finished for PR 16994 at commit f1569bf.

This patch passes all tests.
This patch merges cleanly.
This patch adds the following public classes (experimental):
case class BucketTableTestSpec(

SparkQA · 2017-02-20T06:01:53Z

Test build #73142 has finished for PR 16994 at commit 4b73130.

This patch passes all tests.
This patch merges cleanly.
This patch adds the following public classes (experimental):
case class BucketedTableTestSpec(

tejasapatil · 2017-02-20T11:59:29Z

LGTM

gatorsmile · 2017-02-20T17:02:03Z

Thanks! Merging to master.

…dering` information ### What changes were proposed in this pull request? `outputOrdering` is also dependent on whether the bucket has more than one files. The test cases fail when we try to move them to sql/core. This PR is to fix the test cases introduced in apache#14864 and add a test case to verify [the related logics](https://github.com/tejasapatil/spark/blob/070c24994747c0479fb2520774ede27ff1cf8cac/sql/core/src/main/scala/org/apache/spark/sql/execution/DataSourceScanExec.scala#L197-L206). ### How was this patch tested? N/A Author: Xiao Li <gatorsmile@gmail.com> Closes apache#16994 from gatorsmile/bucketingTS.

fix.

d158ce1

tejasapatil reviewed Feb 20, 2017

View reviewed changes

gatorsmile added 2 commits February 19, 2017 20:22

fix.

f1569bf

style fix

4b73130

asfgit closed this in ead4ba0 Feb 20, 2017

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[SPARK-15453] [SQL] [Follow-up] FileSourceScanExec to extract `outputOrdering` information #16994

[SPARK-15453] [SQL] [Follow-up] FileSourceScanExec to extract `outputOrdering` information #16994

gatorsmile commented Feb 20, 2017 •

edited

gatorsmile commented Feb 20, 2017

tejasapatil Feb 20, 2017

gatorsmile Feb 20, 2017

SparkQA commented Feb 20, 2017

SparkQA commented Feb 20, 2017

SparkQA commented Feb 20, 2017

tejasapatil commented Feb 20, 2017

gatorsmile commented Feb 20, 2017

[SPARK-15453] [SQL] [Follow-up] FileSourceScanExec to extract outputOrdering information #16994

[SPARK-15453] [SQL] [Follow-up] FileSourceScanExec to extract outputOrdering information #16994

Conversation

gatorsmile commented Feb 20, 2017 • edited

What changes were proposed in this pull request?

How was this patch tested?

gatorsmile commented Feb 20, 2017

tejasapatil Feb 20, 2017

Choose a reason for hiding this comment

gatorsmile Feb 20, 2017

Choose a reason for hiding this comment

SparkQA commented Feb 20, 2017

SparkQA commented Feb 20, 2017

SparkQA commented Feb 20, 2017

tejasapatil commented Feb 20, 2017

gatorsmile commented Feb 20, 2017

[SPARK-15453] [SQL] [Follow-up] FileSourceScanExec to extract `outputOrdering` information #16994

[SPARK-15453] [SQL] [Follow-up] FileSourceScanExec to extract `outputOrdering` information #16994

gatorsmile commented Feb 20, 2017 •

edited