[SPARK-24797] [SQL] respect spark.sql.hive.convertMetastoreOrc/Parquet when build… #21757

CodingCat · 2018-07-13T02:46:06Z

What changes were proposed in this pull request?

the current code path ignore the value of spark.sql.hive.convertMetastoreParquet when building data source table

spark/sql/core/src/main/scala/org/apache/spark/sql/execution/datasources/DataSourceStrategy.scala

Line 263 in e0559f2

    
           case UnresolvedCatalogRelation(tableMeta) if DDLUtils.isDatasourceTable(tableMeta) =>

as a result, even I turned off spark.sql.hive.convertMetastoreParquet, Spark SQL still uses its own parquet reader to access table instead of delegate to serder

This PR checks the value of the configuration when building data source table

How was this patch tested?

existing test

… the data source table

CodingCat · 2018-07-13T02:46:33Z

@felixcheung

SparkQA · 2018-07-13T04:55:44Z

Test build #92960 has finished for PR 21757 at commit a5d72cc.

This patch fails Spark unit tests.
This patch merges cleanly.
This patch adds no public classes.

gatorsmile · 2018-07-13T05:41:44Z

sql/core/src/main/scala/org/apache/spark/sql/execution/datasources/DataSourceStrategy.scala

    case i @ InsertIntoTable(UnresolvedCatalogRelation(tableMeta), _, _, _, _)
-        if DDLUtils.isDatasourceTable(tableMeta) =>
+        if DDLUtils.isDatasourceTable(tableMeta) &&
+          DDLUtils.convertSchema(tableMeta, sparkSession) =>


I do not think this is a right fix. If the original table is the native data source table, we will always use our parquet/orc reader instead of hive serde.

do you mean any table built through df.write.format("..") should be taken as a data source table no matter we register it with HMS or not

If you are using format("parquet") to create a new table, it will be a data source table. We always use the native reader/writer to read/write such a table.

respect respect spark.sql.hive.convertMetastoreOrc/Parquet when build…

a5d72cc

… the data source table

CodingCat changed the title ~~[SQL][SPARK-24797] respect spark.sql.hive.convertMetastoreOrc/Parquet when build…~~ [SPARK-24797] [SQL] respect spark.sql.hive.convertMetastoreOrc/Parquet when build… Jul 13, 2018

gatorsmile reviewed Jul 13, 2018

View reviewed changes

CodingCat closed this Jul 13, 2018

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[SPARK-24797] [SQL] respect spark.sql.hive.convertMetastoreOrc/Parquet when build… #21757

[SPARK-24797] [SQL] respect spark.sql.hive.convertMetastoreOrc/Parquet when build… #21757

Uh oh!

CodingCat commented Jul 13, 2018

Uh oh!

CodingCat commented Jul 13, 2018

Uh oh!

SparkQA commented Jul 13, 2018

Uh oh!

gatorsmile Jul 13, 2018

Uh oh!

CodingCat Jul 13, 2018

Uh oh!

gatorsmile Jul 13, 2018

Uh oh!

CodingCat Jul 13, 2018 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

[SPARK-24797] [SQL] respect spark.sql.hive.convertMetastoreOrc/Parquet when build… #21757

[SPARK-24797] [SQL] respect spark.sql.hive.convertMetastoreOrc/Parquet when build… #21757

Uh oh!

Conversation

CodingCat commented Jul 13, 2018

What changes were proposed in this pull request?

How was this patch tested?

Uh oh!

CodingCat commented Jul 13, 2018

Uh oh!

SparkQA commented Jul 13, 2018

Uh oh!

gatorsmile Jul 13, 2018

Choose a reason for hiding this comment

Uh oh!

CodingCat Jul 13, 2018

Choose a reason for hiding this comment

Uh oh!

gatorsmile Jul 13, 2018

Choose a reason for hiding this comment

Uh oh!

CodingCat Jul 13, 2018 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

CodingCat Jul 13, 2018 •

edited

Loading