[SPARK-22016][SQL] Add HiveDialect for JDBC connection to Hive #19238

danielfx90 · 2017-09-14T21:28:06Z

What changes were proposed in this pull request?

Added a HiveDialect for JDBC connection to Hive.
It overrides two methods:

canHandle
quoteIdentifier

How was this patch tested?

It passes the added tests and it was used with a real Hive instance with real data.

AmplabJenkins · 2017-09-14T21:32:04Z

Can one of the admins verify this patch?

gatorsmile · 2017-09-14T22:51:57Z

Why not directly connecting to Hive metastore?

danielfx90 · 2017-09-15T15:54:45Z

@gatorsmile if Hive lies on the same infrastructure as the application, then the metastore should definitely solve the issue, but a connection over JDBC is needed when data comes from an external source which only exposes such a connection through its Hive server. We encountered this and ended up adding the HiveDialect to solve it.

dongjoon-hyun · 2017-09-15T16:05:48Z

sql/core/src/test/scala/org/apache/spark/sql/jdbc/JDBCSuite.scala

-      assert(df3.collect() === Array(Row(21519, 1234)))
-    }
+      assert(df3.collect() === Array(Row(21519, 1234))
+    )


This ')' is wrong. Line 1105~1107 from the original have indentation issue.

It must have changed when formatting the code using the IDE. Scalastyle checks passed though, but let me rollback that anyway.

@dongjoon-hyun done! Thank you!

Ur, actually, I meant the original Spark code is also wrong in terms of indentation. You can fix the indentation of original line 1105~1107 here. :)

@dongjoon-hyun You are right! I misread the parenthesis. I think now is correct. Thank you for the observation :)

gatorsmile · 2017-09-18T16:26:17Z

I can see the value, but it does not perform well in most cases if we using JDBC connection. Instead of adding the extra dialect to upstream, could you please add Hive as a separate data source? Thanks!

https://spark.apache.org/third-party-projects.html

danielfx90 · 2017-09-18T20:00:12Z

Seems logical. Then, unless someone disagrees, feel free to close this PR and we will create a new spark package with this feature in a new repository.

Thanks!

paulstaab · 2018-06-19T12:00:18Z

This merge request would partly solve https://issues.apache.org/jira/browse/SPARK-21063

danielfx90 added 3 commits September 14, 2017 18:11

HiveDialect implementation done

3f486be

HiveDialect registration added

c0d2624

Tests for the HiveDialect added

f704950

dongjoon-hyun reviewed Sep 15, 2017

View reviewed changes

danielfx90 added 2 commits September 15, 2017 14:11

Code indentation fixed in JDBCSuite

7d3a6d6

JDBCSuite indentation issues fixed

12bc9ca

HyukjinKwon mentioned this pull request Sep 26, 2017

[BUILD] Close stale PRs #19348

Closed

asfgit closed this in ceaec93 Sep 27, 2017

HyukjinKwon mentioned this pull request Apr 16, 2020

[SPARK-31457][SQL]spark jdbc read hive created the wrong PreparedStatementadd #28230

Closed

HyukjinKwon mentioned this pull request Mar 21, 2024

[SPARK-47482] Add HiveDialect to sql module #45609

Closed

dongjoon-hyun mentioned this pull request Mar 21, 2024

[SPARK-47482] Add HiveDialect to sql module #45644

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[SPARK-22016][SQL] Add HiveDialect for JDBC connection to Hive #19238

[SPARK-22016][SQL] Add HiveDialect for JDBC connection to Hive #19238

danielfx90 commented Sep 14, 2017

AmplabJenkins commented Sep 14, 2017

gatorsmile commented Sep 14, 2017

danielfx90 commented Sep 15, 2017

dongjoon-hyun Sep 15, 2017

danielfx90 Sep 15, 2017

danielfx90 Sep 15, 2017

dongjoon-hyun Sep 15, 2017

danielfx90 Sep 18, 2017

gatorsmile commented Sep 18, 2017

danielfx90 commented Sep 18, 2017

paulstaab commented Jun 19, 2018

[SPARK-22016][SQL] Add HiveDialect for JDBC connection to Hive #19238

[SPARK-22016][SQL] Add HiveDialect for JDBC connection to Hive #19238

Conversation

danielfx90 commented Sep 14, 2017

What changes were proposed in this pull request?

How was this patch tested?

AmplabJenkins commented Sep 14, 2017

gatorsmile commented Sep 14, 2017

danielfx90 commented Sep 15, 2017

dongjoon-hyun Sep 15, 2017

Choose a reason for hiding this comment

danielfx90 Sep 15, 2017

Choose a reason for hiding this comment

danielfx90 Sep 15, 2017

Choose a reason for hiding this comment

dongjoon-hyun Sep 15, 2017

Choose a reason for hiding this comment

danielfx90 Sep 18, 2017

Choose a reason for hiding this comment

gatorsmile commented Sep 18, 2017

danielfx90 commented Sep 18, 2017

paulstaab commented Jun 19, 2018