[SPARK-13403] [SQL] Pass hadoopConfiguration to HiveConf constructors. #11273

rdblue · 2016-02-19T17:37:56Z

This commit updates the HiveContext so that sc.hadoopConfiguration is used to instantiate its internal instances of HiveConf.

I tested this by overriding the S3 FileSystem implementation from spark-defaults.conf as "spark.hadoop.fs.s3.impl" (to avoid HADOOP-12810).

rxin · 2016-02-20T06:08:02Z

cc @marmbrus

marmbrus · 2016-02-22T19:09:12Z

Seems reasonable.

marmbrus · 2016-02-22T19:09:16Z

ok to test

SparkQA · 2016-02-22T21:39:02Z

Test build #51669 has finished for PR 11273 at commit 32a3dcf.

This patch fails Scala style tests.
This patch merges cleanly.
This patch adds no public classes.

srowen · 2016-02-23T11:26:10Z

FWIW also seems reasonable to me; making a new conf is rarely correct. @rdblue can you rebase?

vanzin · 2016-03-14T18:21:41Z

Also, could you fix the title to follow the Spark convention?
https://cwiki.apache.org/confluence/display/SPARK/Contributing+to+Spark#ContributingtoSpark-PullRequest

sc.hadoopConfiguration may contain Hadoop-specific configuration properties that are not used by SparkSQL's HiveContext because it is not passed when constructing instances of HiveConf.

rdblue · 2016-03-17T00:34:34Z

Sorry for the delay, @vanzin and @srowen! I didn't get notified that you had commented. I've rebased this on master and fixed the PR title.

SparkQA · 2016-03-17T01:00:10Z

Test build #53377 has finished for PR 11273 at commit f0f6cee.

This patch passes all tests.
This patch merges cleanly.
This patch adds no public classes.

rxin · 2016-03-17T05:56:41Z

Thanks - I'm going to merge this.

rdblue · 2016-03-17T15:55:58Z

Thanks for the reviews!

This commit updates the HiveContext so that sc.hadoopConfiguration is used to instantiate its internal instances of HiveConf. I tested this by overriding the S3 FileSystem implementation from spark-defaults.conf as "spark.hadoop.fs.s3.impl" (to avoid [HADOOP-12810](https://issues.apache.org/jira/browse/HADOOP-12810)). Author: Ryan Blue <blue@apache.org> Closes apache#11273 from rdblue/SPARK-13403-new-hive-conf-from-hadoop-conf.

rdblue changed the title ~~SPARK-13403: Pass hadoopConfiguration to HiveConf constructors.~~ [SPARK-13403] [SQL] Pass hadoopConfiguration to HiveConf constructors. Mar 16, 2016

SPARK-13403: Pass hadoopConfiguration to HiveConf constructors.

f0f6cee

sc.hadoopConfiguration may contain Hadoop-specific configuration properties that are not used by SparkSQL's HiveContext because it is not passed when constructing instances of HiveConf.

rdblue force-pushed the SPARK-13403-new-hive-conf-from-hadoop-conf branch from 32a3dcf to f0f6cee Compare March 16, 2016 23:42

rdblue closed this Mar 17, 2016

rdblue reopened this Mar 17, 2016

asfgit closed this in 5faba9f Mar 17, 2016

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[SPARK-13403] [SQL] Pass hadoopConfiguration to HiveConf constructors. #11273

[SPARK-13403] [SQL] Pass hadoopConfiguration to HiveConf constructors. #11273

rdblue commented Feb 19, 2016

rxin commented Feb 20, 2016

marmbrus commented Feb 22, 2016

marmbrus commented Feb 22, 2016

SparkQA commented Feb 22, 2016

srowen commented Feb 23, 2016

vanzin commented Mar 14, 2016

rdblue commented Mar 17, 2016

SparkQA commented Mar 17, 2016

rxin commented Mar 17, 2016

rdblue commented Mar 17, 2016

[SPARK-13403] [SQL] Pass hadoopConfiguration to HiveConf constructors. #11273

[SPARK-13403] [SQL] Pass hadoopConfiguration to HiveConf constructors. #11273

Conversation

rdblue commented Feb 19, 2016

rxin commented Feb 20, 2016

marmbrus commented Feb 22, 2016

marmbrus commented Feb 22, 2016

SparkQA commented Feb 22, 2016

srowen commented Feb 23, 2016

vanzin commented Mar 14, 2016

rdblue commented Mar 17, 2016

SparkQA commented Mar 17, 2016

rxin commented Mar 17, 2016

rdblue commented Mar 17, 2016