Fixing Spark & Hive jar mismatch for hive registration via hoodie-hive 

This is a hairy issue. Spark comes with its version of Hive, which could be different from what version of Hive itself being run.  Ultimately since Hoodie is a spark app, this means any incompatibility here prevents hivesync being done inline from a tool like HoodieDeltaStreamer, and forced to run a separate task on a workflow engine.. 

We need a long term fix for this, to be able to seamlessly register Hive tables from a Hoodie datasource (like you can with HiveContext) 

    * Spark 2.1 still uses hive 1.2, which is different from the CDH version. Running into this (https://www.bountysource.com/issues/1366324-newer-versions-of-jdbc-driver-does-not-work-with-older-hiveserver2)
    * What does Spark itself do? Thin client to talk to Hive? 



Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fixing Spark & Hive jar mismatch for hive registration via hoodie-hive #123

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Fixing Spark & Hive jar mismatch for hive registration via hoodie-hive #123

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions