[SPARK-11778] [SQL]:parse table name before it is passed to lookupRelation #9773

huaxingao · 2015-11-17T19:57:48Z

Fix a bug in DataFrameReader.table (table with schema name such as "db_name.table" doesn't work)
Use SqlParser.parseTableIdentifier to parse the table name before lookupRelation.

huaxingao · 2015-11-17T20:04:48Z

hiveContext.table("db_name.table") works but
hiveContext.read.table("db_name.table")
throws an org.apache.spark.sql.catalyst.analysis.NoSuchTableException

In hiveContext.table("db_name.table"), it goes through SqlParser.parseTableIdentifier(tableName)
and the table name "db_name.table" got resolved to 'db_name'.'table', and later, when trying to get the the qualified table name, the database name is resolved to db_name, and table name is table, and it can get the qualified table name OK.

In hiveContext.read.table("db_name.table"), it doesn't go through SQLParser to parse the table name, so the table name "db_name.table" remain as is. Later, when trying to get the the qualified table name, the database name resolved as default, and table name is "db_name.table", it can't get the qualified table name correctly.

marmbrus · 2015-11-17T20:20:36Z

Test cases please for any bug fix. Look at HiveDataFrameAnalyticsSuite as an example (since i think we have to put this in the hive package for it to work).

marmbrus · 2015-11-17T20:20:47Z

ok to test

SparkQA · 2015-11-17T20:26:10Z

Test build #46106 has finished for PR 9773 at commit 02300e6.

This patch fails Scala style tests.
This patch merges cleanly.
This patch adds no public classes.

huaxingao · 2015-11-18T21:13:20Z

I will add a test case.

SparkQA · 2015-11-18T23:53:08Z

Test build #46242 has finished for PR 9773 at commit 1342374.

This patch passes all tests.
This patch merges cleanly.
This patch adds no public classes.

marmbrus · 2015-11-19T00:12:17Z

If we can get a test case soon we can still include this in Spark 1.6.

Also please make the title: [SPARK-11778] [SQL] Parse table name before it is passed to lookupRelation and add something to the PR description. Together, these will become the commit message.

huaxingao · 2015-11-19T01:19:19Z

test case added. Could you please take a look? Thanks a lot!!

SparkQA · 2015-11-19T01:24:17Z

Test build #46279 has finished for PR 9773 at commit 158d1a7.

This patch fails Scala style tests.
This patch merges cleanly.
This patch adds no public classes.

SparkQA · 2015-11-19T04:08:12Z

Test build #46285 has finished for PR 9773 at commit b12a475.

This patch passes all tests.
This patch merges cleanly.
This patch adds no public classes.

marmbrus · 2015-11-19T21:07:07Z

sql/hive/src/test/scala/org/apache/spark/sql/hive/HiveDataFrameAnalyticsSuite.scala

+  // There was a bug in DataFrameFrameReader.table and it has problem for table with schema name,
+  // Before fix, it throw Exceptionorg.apache.spark.sql.catalyst.analysis.NoSuchTableException
+  test("table name with schema") {
+    hiveContext.read.table("usrdb.test")


Sorry, I think I was unclear. I don't think we should put this in HiveDataFrameAnalyticsSuite since it has nothing to do with analytics. I was just suggesting to use this as a model. Lets just make a separate generic HiveDataFrameSuite since this is pretty core functionality.

While we are at it. I'm also not a huge fan of using beforeAll and afterAll for test setup for a single test since it means the state is spread out across the file (sorry for the bad example). I would just do it all in the test block.

For the description in the comments, its customary to link to the JIRA.

marmbrus · 2015-11-19T21:07:37Z

I'm going to go ahead and merge this since I want it in 1.6. It would be great if you could address comments in a follow up PR. Thanks!

…tion Fix a bug in DataFrameReader.table (table with schema name such as "db_name.table" doesn't work) Use SqlParser.parseTableIdentifier to parse the table name before lookupRelation. Author: Huaxin Gao <huaxing@oc0558782468.ibm.com> Closes #9773 from huaxingao/spark-11778. (cherry picked from commit 4700074) Signed-off-by: Michael Armbrust <michael@databricks.com>

SPARK-11778:parse table name before it is passed to lookupRelation

02300e6

Huaxin Gao added 3 commits November 18, 2015 04:11

SPARK-11778:parse table name before it is passed to lookupRelation

1342374

SPARK-11778:add test case

158d1a7

[SPARK-11778][SQL]:fix scala style problem in newly added test case

b12a475

huaxingao changed the title ~~SPARK-11778:parse table name before it is passed to lookupRelation~~ [SPARK-11778] [SQL]:parse table name before it is passed to lookupRelation Nov 19, 2015

marmbrus reviewed Nov 19, 2015
View reviewed changes

asfgit closed this in 4700074 Nov 19, 2015

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[SPARK-11778] [SQL]:parse table name before it is passed to lookupRelation #9773

[SPARK-11778] [SQL]:parse table name before it is passed to lookupRelation #9773

huaxingao commented Nov 17, 2015

huaxingao commented Nov 17, 2015

marmbrus commented Nov 17, 2015

marmbrus commented Nov 17, 2015

SparkQA commented Nov 17, 2015

huaxingao commented Nov 18, 2015

SparkQA commented Nov 18, 2015

marmbrus commented Nov 19, 2015

huaxingao commented Nov 19, 2015

SparkQA commented Nov 19, 2015

SparkQA commented Nov 19, 2015

marmbrus Nov 19, 2015

marmbrus commented Nov 19, 2015

[SPARK-11778] [SQL]:parse table name before it is passed to lookupRelation #9773

[SPARK-11778] [SQL]:parse table name before it is passed to lookupRelation #9773

Conversation

huaxingao commented Nov 17, 2015

huaxingao commented Nov 17, 2015

marmbrus commented Nov 17, 2015

marmbrus commented Nov 17, 2015

SparkQA commented Nov 17, 2015

huaxingao commented Nov 18, 2015

SparkQA commented Nov 18, 2015

marmbrus commented Nov 19, 2015

huaxingao commented Nov 19, 2015

SparkQA commented Nov 19, 2015

SparkQA commented Nov 19, 2015

marmbrus Nov 19, 2015

Choose a reason for hiding this comment

marmbrus commented Nov 19, 2015