[SPARK-47197] Failed to connect HiveMetastore when using iceberg with HiveCatalog on spark-sql or spark-shell#45309
[SPARK-47197] Failed to connect HiveMetastore when using iceberg with HiveCatalog on spark-sql or spark-shell#45309eubnara wants to merge 1 commit intoapache:masterfrom
Conversation
… HiveCatalog on spark-sql or spark-shell
|
As an example, Apache Kyuubi implements a Hive Connector based on Spark DSv2 API, which allows connecting to multi HMSs, and implements |
|
Thanks for reply. |
|
IMO it's an Iceberg side issue, and in addition to the case you listed above, accessing multiple Kerberized HMS cases should be considered, e.g. the Spark built-in HMS and Iceberg HMS are different, configure more than one Iceberg Hive catalogs +cc @pvary @szehon-ho @sunchao |
|
Thanks for explanation. I think I need to review spark, iceberg codes more... |
|
Even with this patch, |
|
Oh! I finally figure out why it fails. |
What changes were proposed in this pull request?
Make
spark-sql,spark-shellbe able to access iceberg with HiveCatalog.If a user want to access iceberg table with HiveCatalog through
spark-sql,spark-shell, the user should specify additional configuration:Why are the changes needed?
spark-sqlandspark-shellcannot access iceberg table with HiveCatalog because there is no HIVE_DELEGATION_TOKEN.Does this PR introduce any user-facing change?
If there is a user who specify
--conf spark.security.credentials.hive.enabled=true, spark will get HIVE_DELEGATION_TOKEN even though deploy mode is not "cluster".How was this patch tested?
Manually tested on on-premise internal cluster with Hadoop 3.3.4, Iceberg 1.4.3, and Spark 3.2.3.
Was this patch authored or co-authored using generative AI tooling?
No.