[SPARK-22016][SQL] Add HiveDialect for JDBC connection to Hive #19238

danielfx90 · 2017-09-14T21:28:06Z

What changes were proposed in this pull request?

Added a HiveDialect for JDBC connection to Hive.
It overrides two methods:

canHandle
quoteIdentifier

How was this patch tested?

It passes the added tests and it was used with a real Hive instance with real data.

AmplabJenkins · 2017-09-14T21:32:04Z

Can one of the admins verify this patch?

gatorsmile · 2017-09-14T22:51:57Z

Why not directly connecting to Hive metastore?

danielfx90 · 2017-09-15T15:54:45Z

@gatorsmile if Hive lies on the same infrastructure as the application, then the metastore should definitely solve the issue, but a connection over JDBC is needed when data comes from an external source which only exposes such a connection through its Hive server. We encountered this and ended up adding the HiveDialect to solve it.

dongjoon-hyun · 2017-09-15T16:05:48Z

sql/core/src/test/scala/org/apache/spark/sql/jdbc/JDBCSuite.scala

-      assert(df3.collect() === Array(Row(21519, 1234)))
-    }
+      assert(df3.collect() === Array(Row(21519, 1234))
+    )


This ')' is wrong. Line 1105~1107 from the original have indentation issue.

It must have changed when formatting the code using the IDE. Scalastyle checks passed though, but let me rollback that anyway.

@dongjoon-hyun done! Thank you!

Ur, actually, I meant the original Spark code is also wrong in terms of indentation. You can fix the indentation of original line 1105~1107 here. :)

@dongjoon-hyun You are right! I misread the parenthesis. I think now is correct. Thank you for the observation :)

gatorsmile · 2017-09-18T16:26:17Z

I can see the value, but it does not perform well in most cases if we using JDBC connection. Instead of adding the extra dialect to upstream, could you please add Hive as a separate data source? Thanks!

https://spark.apache.org/third-party-projects.html

danielfx90 · 2017-09-18T20:00:12Z

Seems logical. Then, unless someone disagrees, feel free to close this PR and we will create a new spark package with this feature in a new repository.

Thanks!

paulstaab · 2018-06-19T12:00:18Z

This merge request would partly solve https://issues.apache.org/jira/browse/SPARK-21063

Closes apache#13794 Closes apache#18474 Closes apache#18897 Closes apache#18978 Closes apache#19152 Closes apache#19238 Closes apache#19295 Closes apache#19334 Closes apache#19335 Closes apache#19347 Closes apache#19236 Closes apache#19244 Closes apache#19300 Closes apache#19315 Closes apache#19356 Closes apache#15009 Closes apache#18253 Author: hyukjinkwon <[email protected]> Closes apache#19348 from HyukjinKwon/stale-prs.

danielfx90 added 3 commits September 14, 2017 18:11

HiveDialect implementation done

3f486be

HiveDialect registration added

c0d2624

Tests for the HiveDialect added

f704950

dongjoon-hyun reviewed Sep 15, 2017

View reviewed changes

danielfx90 added 2 commits September 15, 2017 14:11

Code indentation fixed in JDBCSuite

7d3a6d6

JDBCSuite indentation issues fixed

12bc9ca

HyukjinKwon mentioned this pull request Sep 26, 2017

[BUILD] Close stale PRs #19348

Closed

asfgit closed this in ceaec93 Sep 27, 2017

HyukjinKwon mentioned this pull request Apr 16, 2020

[SPARK-31457][SQL]spark jdbc read hive created the wrong PreparedStatementadd #28230

Closed

HyukjinKwon mentioned this pull request Mar 21, 2024

[SPARK-47482] Add HiveDialect to sql module #45609

Closed

dongjoon-hyun mentioned this pull request Mar 21, 2024

[SPARK-47482] Add HiveDialect to sql module #45644

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[SPARK-22016][SQL] Add HiveDialect for JDBC connection to Hive #19238

[SPARK-22016][SQL] Add HiveDialect for JDBC connection to Hive #19238

Uh oh!

danielfx90 commented Sep 14, 2017

Uh oh!

AmplabJenkins commented Sep 14, 2017

Uh oh!

gatorsmile commented Sep 14, 2017

Uh oh!

danielfx90 commented Sep 15, 2017

Uh oh!

dongjoon-hyun Sep 15, 2017

Uh oh!

danielfx90 Sep 15, 2017

Uh oh!

danielfx90 Sep 15, 2017

Uh oh!

dongjoon-hyun Sep 15, 2017

Uh oh!

danielfx90 Sep 18, 2017

Uh oh!

gatorsmile commented Sep 18, 2017

Uh oh!

danielfx90 commented Sep 18, 2017

Uh oh!

paulstaab commented Jun 19, 2018

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

[SPARK-22016][SQL] Add HiveDialect for JDBC connection to Hive #19238

[SPARK-22016][SQL] Add HiveDialect for JDBC connection to Hive #19238

Uh oh!

Conversation

danielfx90 commented Sep 14, 2017

What changes were proposed in this pull request?

How was this patch tested?

Uh oh!

AmplabJenkins commented Sep 14, 2017

Uh oh!

gatorsmile commented Sep 14, 2017

Uh oh!

danielfx90 commented Sep 15, 2017

Uh oh!

dongjoon-hyun Sep 15, 2017

Choose a reason for hiding this comment

Uh oh!

danielfx90 Sep 15, 2017

Choose a reason for hiding this comment

Uh oh!

danielfx90 Sep 15, 2017

Choose a reason for hiding this comment

Uh oh!

dongjoon-hyun Sep 15, 2017

Choose a reason for hiding this comment

Uh oh!

danielfx90 Sep 18, 2017

Choose a reason for hiding this comment

Uh oh!

gatorsmile commented Sep 18, 2017

Uh oh!

danielfx90 commented Sep 18, 2017

Uh oh!

paulstaab commented Jun 19, 2018

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants