[SPARK-2290] Worker should directly use its own sparkHome instead of appDesc.sparkHome when LaunchExecutor #1392

YanTangZhai · 2014-07-13T12:40:04Z

Worker should directly use its own sparkHome instead of appDesc.sparkHome when LaunchExecutor

AmplabJenkins · 2014-07-13T12:41:24Z

Can one of the admins verify this patch?

YanTangZhai · 2014-07-13T12:44:08Z

#1244

andrewor14 · 2014-07-13T22:38:11Z

Jenkins, test this please

SparkQA · 2014-07-13T22:42:36Z

QA tests have started for PR 1392. This patch merges cleanly.
View progress: https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/16609/consoleFull

SparkQA · 2014-07-14T00:07:52Z

QA results for PR 1392:
- This patch FAILED unit tests.
- This patch merges cleanly
- This patch adds no public classes

For more information see test ouptut:
https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/16609/consoleFull

andrewor14 · 2014-07-17T19:37:23Z

Hey @YanTangZhai, on second thought I think we should keep the config, but not set it by default like we do currently. The user may have multiple installations of Spark on the same Worker machine, and "spark.home" previously provided them a way to pick among these installations. We should keep this functionality, but make it optional as opposed to forcing it on them.

Also, since we no longer need "spark.home" anymore, it would be good to remove all occurrences of it to remove confusion (exception for backwards compatibility). However, this is slightly tricky because Mesos handles this differently from other modes.

So I suggest this: I will take over from here, because this change seems a little more involved than we originally imagined it to be. How does that sound?

YanTangZhai · 2014-07-21T09:03:56Z

Hi @andrewor14 , that's ok. Thanks.

pwendell · 2014-07-30T22:54:29Z

@YanTangZhai can you merge this up to master and make sure it is passing tests? Thanks. I spoke with @andrewor14 about #1472 and we agreed that having this patch would be good. We are right on a deadline, so if you can't up-merge it in the next day I can just fix it up and then merge it.

When standalone Workers launch executors, they inherit the Spark home set by the driver. This means if the worker machines do not share the same directory structure as the driver node, the Workers will attempt to run scripts (e.g. bin/compute-classpath.sh) that do not exist locally and fail. This is a common scenario if the driver is launched from outside of the cluster. The solution is to simply not pass the driver's Spark home to the Workers. This PR further makes an attempt to avoid overloading the usages of `spark.home`, which is now only used for setting executor Spark home on Mesos and in python. This is based on top of #1392 and originally reported by YanTangZhai. Tested on standalone cluster. Author: Andrew Or <andrewor14@gmail.com> Closes #1734 from andrewor14/spark-home-reprise and squashes the following commits: f71f391 [Andrew Or] Revert changes in python 1c2532c [Andrew Or] Merge branch 'master' of github.com:apache/spark into spark-home-reprise 188fc5d [Andrew Or] Avoid using spark.home where possible 09272b7 [Andrew Or] Always use Worker's working directory as spark home

YanTangZhai · 2014-08-05T12:25:22Z

@pwendell Sorry, I'm late. Please disregard this PR since #1734 has been closed.

When standalone Workers launch executors, they inherit the Spark home set by the driver. This means if the worker machines do not share the same directory structure as the driver node, the Workers will attempt to run scripts (e.g. bin/compute-classpath.sh) that do not exist locally and fail. This is a common scenario if the driver is launched from outside of the cluster. The solution is to simply not pass the driver's Spark home to the Workers. This PR further makes an attempt to avoid overloading the usages of `spark.home`, which is now only used for setting executor Spark home on Mesos and in python. This is based on top of apache#1392 and originally reported by YanTangZhai. Tested on standalone cluster. Author: Andrew Or <andrewor14@gmail.com> Closes apache#1734 from andrewor14/spark-home-reprise and squashes the following commits: f71f391 [Andrew Or] Revert changes in python 1c2532c [Andrew Or] Merge branch 'master' of github.com:apache/spark into spark-home-reprise 188fc5d [Andrew Or] Avoid using spark.home where possible 09272b7 [Andrew Or] Always use Worker's working directory as spark home

YanTangZhai added 7 commits July 13, 2014 19:50

Update ApplicationDescription.scala

d3072fc

Update JsonProtocol.scala

78ec6bc

Update TestClient.scala

95e6ccc

Update SparkDeploySchedulerBackend.scala

508dcb6

Update Worker.scala

6d6700a

Update JsonProtocolSuite.scala

c360154

Update ExecutorRunnerTest.scala

6febb21

andrewor14 mentioned this pull request Jul 17, 2014

[SPARK-2454] Do not assume drivers and executors share the same Spark home #1472

Closed

andrewor14 mentioned this pull request Aug 2, 2014

[SPARK-2454] Do not ship spark home to Workers #1734

Closed

YanTangZhai closed this Aug 5, 2014

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[SPARK-2290] Worker should directly use its own sparkHome instead of appDesc.sparkHome when LaunchExecutor #1392

[SPARK-2290] Worker should directly use its own sparkHome instead of appDesc.sparkHome when LaunchExecutor #1392

YanTangZhai commented Jul 13, 2014

AmplabJenkins commented Jul 13, 2014

YanTangZhai commented Jul 13, 2014

andrewor14 commented Jul 13, 2014

SparkQA commented Jul 13, 2014

SparkQA commented Jul 14, 2014

andrewor14 commented Jul 17, 2014

YanTangZhai commented Jul 21, 2014

pwendell commented Jul 30, 2014

YanTangZhai commented Aug 5, 2014

[SPARK-2290] Worker should directly use its own sparkHome instead of appDesc.sparkHome when LaunchExecutor #1392

[SPARK-2290] Worker should directly use its own sparkHome instead of appDesc.sparkHome when LaunchExecutor #1392

Conversation

YanTangZhai commented Jul 13, 2014

AmplabJenkins commented Jul 13, 2014

YanTangZhai commented Jul 13, 2014

andrewor14 commented Jul 13, 2014

SparkQA commented Jul 13, 2014

SparkQA commented Jul 14, 2014

andrewor14 commented Jul 17, 2014

YanTangZhai commented Jul 21, 2014

pwendell commented Jul 30, 2014

YanTangZhai commented Aug 5, 2014