[SPARK-16019][yarn] Use separate RM poll interval when starting client AM. #18380

vanzin · 2017-06-21T20:09:55Z

Currently the code monitoring the launch of the client AM uses the value of
spark.yarn.report.interval as the interval for polling the RM; if someone
has that value to a really large interval, it would take that long to detect
that the client AM has started, which is not expected.

Instead, have a separate config for the interval to use when the client AM is
starting. The other config is still used in cluster mode, and to detect the
status of the client AM after it is already running.

Tested by running client and cluster mode apps with a modified value of
spark.yarn.report.interval, verifying client AM launch is detected before
that interval elapses.

…t AM. Currently the code monitoring the launch of the client AM uses the value of spark.yarn.report.interval as the interval for polling the RM; if someone has that value to a really large interval, it would take that long to detect that the client AM has started, which is not expected. Instead, have a separate config for the interval to use when the client AM is starting. The other config is still used in cluster mode, and to detect the status of the client AM after it is already running. Tested by running client and cluster mode apps with a modified value of spark.yarn.report.interval, verifying client AM launch is detected before that interval elapses.

SparkQA · 2017-06-21T20:31:23Z

Test build #78406 has finished for PR 18380 at commit a117dd3.

This patch passes all tests.
This patch merges cleanly.
This patch adds no public classes.

vanzin · 2017-07-10T17:09:43Z

@tgravescs

tgravescs

minor doc update, otherwise +1

tgravescs · 2017-07-10T18:00:11Z

resource-managers/yarn/src/main/scala/org/apache/spark/deploy/yarn/Client.scala

-      logApplicationReport: Boolean = true): (YarnApplicationState, FinalApplicationStatus) = {
-    val interval = sparkConf.get(REPORT_INTERVAL)
+      logApplicationReport: Boolean = true,
+      interval: Long = sparkConf.get(REPORT_INTERVAL)):


add new param to method description

SparkQA · 2017-07-10T19:16:02Z

Test build #79475 has finished for PR 18380 at commit 47ec7ea.

This patch passes all tests.
This patch merges cleanly.
This patch adds no public classes.

tgravescs · 2017-07-10T20:00:13Z

resource-managers/yarn/src/main/scala/org/apache/spark/deploy/yarn/config.scala

    .timeConf(TimeUnit.MILLISECONDS)
    .createWithDefaultString("1s")

+  private[spark] val CLIENT_LAUNCH_MONITOR_INTERVAL =
+    ConfigBuilder("spark.yarn.am.launchMonitorInterval")


sorry missed this in my first pass. One thing here is that normally the spark.yarn.am. configs are configs that apply to the am, this one is slightly different in that its how often the client pulls the am for status (applies more to the client). We aren't documenting it anyway but perhaps we should name something like spark.yarn.clientAMLaunchMonitorInterval?

SparkQA · 2017-07-10T20:45:50Z

Test build #79478 has finished for PR 18380 at commit 4075b44.

This patch passes all tests.
This patch merges cleanly.
This patch adds no public classes.

tgravescs · 2017-07-11T14:31:32Z

+1, feel free to commit

vanzin · 2017-07-11T18:25:20Z

Merging to master.

tgravescs reviewed Jul 10, 2017

View reviewed changes

Add param doc.

47ec7ea

tgravescs reviewed Jul 10, 2017

View reviewed changes

Change config name.

4075b44

asfgit closed this in 1cad31f Jul 11, 2017

vanzin deleted the SPARK-16019 branch July 11, 2017 18:30

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[SPARK-16019][yarn] Use separate RM poll interval when starting client AM. #18380

[SPARK-16019][yarn] Use separate RM poll interval when starting client AM. #18380

vanzin commented Jun 21, 2017

SparkQA commented Jun 21, 2017

vanzin commented Jul 10, 2017

tgravescs left a comment

tgravescs Jul 10, 2017

SparkQA commented Jul 10, 2017

tgravescs Jul 10, 2017

SparkQA commented Jul 10, 2017

tgravescs commented Jul 11, 2017

vanzin commented Jul 11, 2017

[SPARK-16019][yarn] Use separate RM poll interval when starting client AM. #18380

[SPARK-16019][yarn] Use separate RM poll interval when starting client AM. #18380

Conversation

vanzin commented Jun 21, 2017

SparkQA commented Jun 21, 2017

vanzin commented Jul 10, 2017

tgravescs left a comment

Choose a reason for hiding this comment

tgravescs Jul 10, 2017

Choose a reason for hiding this comment

SparkQA commented Jul 10, 2017

tgravescs Jul 10, 2017

Choose a reason for hiding this comment

SparkQA commented Jul 10, 2017

tgravescs commented Jul 11, 2017

vanzin commented Jul 11, 2017