[SPARK-14705][YARN]support Multiple FileSystem for YARN STAGING DIR by lianhuiwang · Pull Request #12473 · apache/spark

lianhuiwang · 2016-04-18T16:39:46Z

What changes were proposed in this pull request?

In SPARK-13063, It makes the SPARK YARN STAGING DIR as configurable. But it only support default FileSystem. If there are many clusters, It can be different FileSystem for different cluster in our spark.

How was this patch tested?

I have tested it successfully with following commands:
MASTER=yarn-client ./bin/spark-shell --conf spark.yarn.stagingDir=hdfs:namenode2/temp
$SPARK_HOME/bin/spark-submit --conf spark.yarn.stagingDir=hdfs:namenode2/temp

vanzin · 2016-04-18T17:23:53Z

yarn/src/main/scala/org/apache/spark/deploy/yarn/Client.scala

    // and add them as local resources to the application master.
-    val fs = FileSystem.get(hadoopConf)
-    val dst = getAppStagingDirPath(sparkConf, fs, appStagingDir)
+    val dst = new Path(appStagingBaseDir, appStagingDir)


You could pass appStagingDir as a Path to this method and save some duplication; same for setupLaunchEnv below.

This whole class could use some cleanup in that regard, but these two are pretty low-hanging fruit.

SparkQA · 2016-04-18T17:52:08Z

Test build #56064 has finished for PR 12473 at commit 0ba4de8.

This patch passes all tests.
This patch merges cleanly.
This patch adds no public classes.

tgravescs · 2016-04-18T17:55:44Z

test says NA, what testing have you done with this?

In the very least I would like to see manual regression test hdfs and I assume you are making this to talk to some other filesystem, so what other filesystem was it tested with?

lianhuiwang · 2016-04-19T02:43:54Z

@vanzin yes, I update code with your comments.Thanks.
@tgravescs I have tested it on my spark using spark-shell and spark-submit, I have updated it.Thanks.

SparkQA · 2016-04-19T02:52:13Z

Test build #56182 has finished for PR 12473 at commit c0374af.

This patch fails Spark unit tests.
This patch merges cleanly.
This patch adds no public classes.

SparkQA · 2016-04-19T03:48:16Z

Test build #56186 has finished for PR 12473 at commit 59eb03c.

This patch fails Spark unit tests.
This patch merges cleanly.
This patch adds no public classes.

jerryshao · 2016-04-19T03:53:52Z

So my understanding is that actually supporting different HDFS other than default one, not multiple HDFS, is that right?

lianhuiwang · 2016-04-19T04:02:51Z

@jerryshao Yes, what you said is right.

SparkQA · 2016-04-19T04:15:50Z

Test build #56188 has finished for PR 12473 at commit 61b51f2.

This patch passes all tests.
This patch merges cleanly.
This patch adds no public classes.

HyukjinKwon · 2016-04-20T00:44:15Z

(This is super minor but I remember I was told it might be better if those cc are added in comments not in the description because PR description is the place where to describe the PR.)

vanzin · 2016-04-20T02:43:12Z

@HyukjinKwon the merge scripts clean up "@" references from the PR summary.

vanzin · 2016-04-20T02:47:25Z

LGTM, merging to master.

HyukjinKwon · 2016-04-20T02:58:24Z

@vanzin Thank you! but it might still look a bit weird that there is cc in PR description above maybe.

lianhuiwang · 2016-04-20T03:06:20Z

@HyukjinKwon @vanzin Thanks. I have updated PR description. But @vanzin have merged to master before. So I think it does not matter for this PR.

init commit

0ba4de8

vanzin reviewed Apr 18, 2016
View reviewed changes

address comments

c0374af

fix ut

59eb03c

lianhuiwang added 2 commits April 19, 2016 12:00

Merge branch 'apache-master' into SPARK-14705

c125523

merge master and fix ut

61b51f2

asfgit closed this in 4514aeb Apr 20, 2016

Conversation

lianhuiwang commented Apr 18, 2016 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What changes were proposed in this pull request?

How was this patch tested?

Uh oh!

vanzin Apr 18, 2016

Choose a reason for hiding this comment

Uh oh!

SparkQA commented Apr 18, 2016

Uh oh!

tgravescs commented Apr 18, 2016 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

lianhuiwang commented Apr 19, 2016 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

SparkQA commented Apr 19, 2016

Uh oh!

SparkQA commented Apr 19, 2016

Uh oh!

jerryshao commented Apr 19, 2016

Uh oh!

lianhuiwang commented Apr 19, 2016

Uh oh!

SparkQA commented Apr 19, 2016

Uh oh!

HyukjinKwon commented Apr 20, 2016

Uh oh!

vanzin commented Apr 20, 2016

Uh oh!

vanzin commented Apr 20, 2016

Uh oh!

HyukjinKwon commented Apr 20, 2016 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

lianhuiwang commented Apr 20, 2016

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

6 participants

lianhuiwang commented Apr 18, 2016 •

edited

Loading

tgravescs commented Apr 18, 2016 •

edited

Loading

lianhuiwang commented Apr 19, 2016 •

edited

Loading

HyukjinKwon commented Apr 20, 2016 •

edited

Loading