[SPARK-27175][BUILD] Upgrade hadoop-3 to 3.2.0 by wangyum · Pull Request #24106 · apache/spark

wangyum · 2019-03-15T10:32:54Z

What changes were proposed in this pull request?

This PR upgrade hadoop-3 to 3.2.0 to workaround HADOOP-16086. Otherwise some test case will throw IllegalArgumentException:

02:44:34.707 ERROR org.apache.hadoop.hive.ql.exec.Task: Job Submission failed with exception 'java.io.IOException(Cannot initialize Cluster. Please check your configuration for mapreduce.framework.name and the correspond server addresses.)'
java.io.IOException: Cannot initialize Cluster. Please check your configuration for mapreduce.framework.name and the correspond server addresses.
	at org.apache.hadoop.mapreduce.Cluster.initialize(Cluster.java:116)
	at org.apache.hadoop.mapreduce.Cluster.<init>(Cluster.java:109)
	at org.apache.hadoop.mapreduce.Cluster.<init>(Cluster.java:102)
	at org.apache.hadoop.mapred.JobClient.init(JobClient.java:475)
	at org.apache.hadoop.mapred.JobClient.<init>(JobClient.java:454)
	at org.apache.hadoop.hive.ql.exec.mr.ExecDriver.execute(ExecDriver.java:369)
	at org.apache.hadoop.hive.ql.exec.mr.MapRedTask.execute(MapRedTask.java:151)
	at org.apache.hadoop.hive.ql.exec.Task.executeTask(Task.java:199)
	at org.apache.hadoop.hive.ql.exec.TaskRunner.runSequential(TaskRunner.java:100)
	at org.apache.hadoop.hive.ql.Driver.launchTask(Driver.java:2183)
	at org.apache.hadoop.hive.ql.Driver.execute(Driver.java:1839)
	at org.apache.hadoop.hive.ql.Driver.runInternal(Driver.java:1526)
	at org.apache.hadoop.hive.ql.Driver.run(Driver.java:1237)
	at org.apache.hadoop.hive.ql.Driver.run(Driver.java:1227)
	at org.apache.spark.sql.hive.client.HiveClientImpl.$anonfun$runHive$1(HiveClientImpl.scala:730)
	at org.apache.spark.sql.hive.client.HiveClientImpl.$anonfun$withHiveState$1(HiveClientImpl.scala:283)
	at org.apache.spark.sql.hive.client.HiveClientImpl.liftedTree1$1(HiveClientImpl.scala:221)
	at org.apache.spark.sql.hive.client.HiveClientImpl.retryLocked(HiveClientImpl.scala:220)
	at org.apache.spark.sql.hive.client.HiveClientImpl.withHiveState(HiveClientImpl.scala:266)
	at org.apache.spark.sql.hive.client.HiveClientImpl.runHive(HiveClientImpl.scala:719)
	at org.apache.spark.sql.hive.client.HiveClientImpl.runSqlHive(HiveClientImpl.scala:709)
	at org.apache.spark.sql.hive.StatisticsSuite.createNonPartitionedTable(StatisticsSuite.scala:719)
	at org.apache.spark.sql.hive.StatisticsSuite.$anonfun$testAlterTableProperties$2(StatisticsSuite.scala:822)

How was this patch tested?

manual tests

HyukjinKwon · 2019-03-15T10:51:45Z

Yup, I'm okie with it too. Adding @srowen too

dev/test-dependencies.sh

SparkQA · 2019-03-15T21:31:30Z

Test build #103529 has finished for PR 24106 at commit 9e2e5dd.

This patch fails Spark unit tests.
This patch merges cleanly.
This patch adds no public classes.

wangyum · 2019-03-15T22:02:16Z

retest this please

dongjoon-hyun

Since we drop hadoop-3.1 in this PR, shall we use hadoop-3.2 explicitly as we did with hadoop-2.6 and hadoop-2.7?

SparkQA · 2019-03-16T02:36:45Z

Test build #103553 has finished for PR 24106 at commit 9e2e5dd.

This patch passes all tests.
This patch merges cleanly.
This patch adds no public classes.

SparkQA · 2019-03-16T22:35:31Z

Test build #103565 has finished for PR 24106 at commit 5473dc1.

This patch passes all tests.
This patch merges cleanly.
This patch adds no public classes.

srowen · 2019-03-17T00:42:13Z

Merged to master

dongjoon-hyun

Thank you, @wangyum and @srowen .
LGTM, too.

This PR upgrade `hadoop-3` to `3.2.0` to workaround [HADOOP-16086](https://issues.apache.org/jira/browse/HADOOP-16086). Otherwise some test case will throw IllegalArgumentException: ```java 02:44:34.707 ERROR org.apache.hadoop.hive.ql.exec.Task: Job Submission failed with exception 'java.io.IOException(Cannot initialize Cluster. Please check your configuration for mapreduce.framework.name and the correspond server addresses.)' java.io.IOException: Cannot initialize Cluster. Please check your configuration for mapreduce.framework.name and the correspond server addresses. at org.apache.hadoop.mapreduce.Cluster.initialize(Cluster.java:116) at org.apache.hadoop.mapreduce.Cluster.<init>(Cluster.java:109) at org.apache.hadoop.mapreduce.Cluster.<init>(Cluster.java:102) at org.apache.hadoop.mapred.JobClient.init(JobClient.java:475) at org.apache.hadoop.mapred.JobClient.<init>(JobClient.java:454) at org.apache.hadoop.hive.ql.exec.mr.ExecDriver.execute(ExecDriver.java:369) at org.apache.hadoop.hive.ql.exec.mr.MapRedTask.execute(MapRedTask.java:151) at org.apache.hadoop.hive.ql.exec.Task.executeTask(Task.java:199) at org.apache.hadoop.hive.ql.exec.TaskRunner.runSequential(TaskRunner.java:100) at org.apache.hadoop.hive.ql.Driver.launchTask(Driver.java:2183) at org.apache.hadoop.hive.ql.Driver.execute(Driver.java:1839) at org.apache.hadoop.hive.ql.Driver.runInternal(Driver.java:1526) at org.apache.hadoop.hive.ql.Driver.run(Driver.java:1237) at org.apache.hadoop.hive.ql.Driver.run(Driver.java:1227) at org.apache.spark.sql.hive.client.HiveClientImpl.$anonfun$runHive$1(HiveClientImpl.scala:730) at org.apache.spark.sql.hive.client.HiveClientImpl.$anonfun$withHiveState$1(HiveClientImpl.scala:283) at org.apache.spark.sql.hive.client.HiveClientImpl.liftedTree1$1(HiveClientImpl.scala:221) at org.apache.spark.sql.hive.client.HiveClientImpl.retryLocked(HiveClientImpl.scala:220) at org.apache.spark.sql.hive.client.HiveClientImpl.withHiveState(HiveClientImpl.scala:266) at org.apache.spark.sql.hive.client.HiveClientImpl.runHive(HiveClientImpl.scala:719) at org.apache.spark.sql.hive.client.HiveClientImpl.runSqlHive(HiveClientImpl.scala:709) at org.apache.spark.sql.hive.StatisticsSuite.createNonPartitionedTable(StatisticsSuite.scala:719) at org.apache.spark.sql.hive.StatisticsSuite.$anonfun$testAlterTableProperties$2(StatisticsSuite.scala:822) ``` manual tests Closes apache#24106 from wangyum/SPARK-27175. Authored-by: Yuming Wang <yumwang@ebay.com> Signed-off-by: Sean Owen <sean.owen@databricks.com>

Upgrade hadoop-3 to 3.2.0

9e2e5dd

srowen approved these changes Mar 15, 2019

View reviewed changes

dongjoon-hyun reviewed Mar 15, 2019

View reviewed changes

dev/test-dependencies.sh Outdated Show resolved Hide resolved

dongjoon-hyun requested changes Mar 15, 2019

View reviewed changes

Remane hadoop-3 to hadoop-3.2

5473dc1

srowen approved these changes Mar 16, 2019

View reviewed changes

srowen closed this in 9c0af74 Mar 17, 2019

wangyum deleted the SPARK-27175 branch March 17, 2019 00:55

dongjoon-hyun reviewed Mar 17, 2019

View reviewed changes

wangyum mentioned this pull request May 12, 2019

[SPARK-27175] Add test-hadoop3.2 signal to developer-tools.md apache/spark-website#203

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[SPARK-27175][BUILD] Upgrade hadoop-3 to 3.2.0#24106

[SPARK-27175][BUILD] Upgrade hadoop-3 to 3.2.0#24106
wangyum wants to merge 2 commits intoapache:masterfrom
wangyum:SPARK-27175

wangyum commented Mar 15, 2019

Uh oh!

HyukjinKwon commented Mar 15, 2019

Uh oh!

Uh oh!

SparkQA commented Mar 15, 2019

Uh oh!

wangyum commented Mar 15, 2019

Uh oh!

dongjoon-hyun left a comment

Uh oh!

SparkQA commented Mar 16, 2019

Uh oh!

SparkQA commented Mar 16, 2019

Uh oh!

srowen commented Mar 17, 2019

Uh oh!

dongjoon-hyun left a comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

Conversation

wangyum commented Mar 15, 2019

What changes were proposed in this pull request?

How was this patch tested?

Uh oh!

HyukjinKwon commented Mar 15, 2019

Uh oh!

Uh oh!

SparkQA commented Mar 15, 2019

Uh oh!

wangyum commented Mar 15, 2019

Uh oh!

dongjoon-hyun left a comment

Choose a reason for hiding this comment

Uh oh!

SparkQA commented Mar 16, 2019

Uh oh!

SparkQA commented Mar 16, 2019

Uh oh!

srowen commented Mar 17, 2019

Uh oh!

dongjoon-hyun left a comment

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants