[SPARK-24716][TESTS][FOLLOW-UP] Test Hive metastore schema and parquet schema are in different letter cases #22267

wangyum · 2018-08-29T10:24:10Z

What changes were proposed in this pull request?

Since #21696. Spark uses Parquet schema instead of Hive metastore schema to do pushdown.
That change can avoid wrong records returned when Hive metastore schema and parquet schema are in different letter cases. This pr add a test case for it.

More details:
https://issues.apache.org/jira/browse/SPARK-25206

How was this patch tested?

unit tests

SparkQA · 2018-08-29T13:58:03Z

Test build #95415 has finished for PR 22267 at commit f5559f4.

This patch passes all tests.
This patch merges cleanly.
This patch adds no public classes.

wangyum · 2018-08-29T14:00:16Z

cc @cloud-fan

cloud-fan · 2018-08-30T02:03:42Z

...e/src/test/scala/org/apache/spark/sql/execution/datasources/parquet/ParquetFilterSuite.scala

@@ -1021,6 +1021,18 @@ class ParquetFilterSuite extends QueryTest with ParquetTest with SharedSQLContex
      }
    }
  }
+
+  test("SPARK-25206: wrong records are returned when Hive metastore schema and parquet schema " +


this is a end-to-end test and should not be put here. How about HiveParquetSuite

SparkQA · 2018-08-30T05:10:39Z

Test build #95453 has finished for PR 22267 at commit cdbf81a.

This patch passes all tests.
This patch merges cleanly.
This patch adds no public classes.

cloud-fan · 2018-08-30T08:25:17Z

thanks, merging to master!

…t schema are in different letter cases ## What changes were proposed in this pull request? Since apache#21696. Spark uses Parquet schema instead of Hive metastore schema to do pushdown. That change can avoid wrong records returned when Hive metastore schema and parquet schema are in different letter cases. This pr add a test case for it. More details: https://issues.apache.org/jira/browse/SPARK-25206 ## How was this patch tested? unit tests Closes apache#22267 from wangyum/SPARK-24716-TESTS. Authored-by: Yuming Wang <yumwang@ebay.com> Signed-off-by: Wenchen Fan <wenchen@databricks.com>

Improvement test.

f5559f4

cloud-fan reviewed Aug 30, 2018

View reviewed changes

Move to HiveParquetSuite

cdbf81a

asfgit closed this in e9fce2a Aug 30, 2018

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[SPARK-24716][TESTS][FOLLOW-UP] Test Hive metastore schema and parquet schema are in different letter cases #22267

[SPARK-24716][TESTS][FOLLOW-UP] Test Hive metastore schema and parquet schema are in different letter cases #22267

wangyum commented Aug 29, 2018 •

edited

SparkQA commented Aug 29, 2018

wangyum commented Aug 29, 2018

cloud-fan Aug 30, 2018

SparkQA commented Aug 30, 2018

cloud-fan commented Aug 30, 2018

[SPARK-24716][TESTS][FOLLOW-UP] Test Hive metastore schema and parquet schema are in different letter cases #22267

[SPARK-24716][TESTS][FOLLOW-UP] Test Hive metastore schema and parquet schema are in different letter cases #22267

Conversation

wangyum commented Aug 29, 2018 • edited

What changes were proposed in this pull request?

How was this patch tested?

SparkQA commented Aug 29, 2018

wangyum commented Aug 29, 2018

cloud-fan Aug 30, 2018

Choose a reason for hiding this comment

SparkQA commented Aug 30, 2018

cloud-fan commented Aug 30, 2018

wangyum commented Aug 29, 2018 •

edited