[YARN] SPARK-2668: Add variable of yarn log directory for reference from the log4j configuration #1573

renozhang · 2014-07-24T13:49:08Z

Assign value of yarn container log directory to java opts "spark.yarn.app.container.log.dir", So user defined log4j.properties can reference this value and write log to YARN container's log directory.
Otherwise, user defined file appender will only write to container's CWD, and log files in CWD will not be displayed on YARN UI，and either cannot be aggregated to HDFS log directory after job finished.

User defined log4j.properties reference example:
log4j.appender.rolling_file.File = ${spark.yarn.app.container.log.dir}/spark.log

SparkQA · 2014-07-24T13:53:24Z

QA tests have started for PR 1573. This patch merges cleanly.
View progress: https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/17119/consoleFull

SparkQA · 2014-07-24T14:37:35Z

QA results for PR 1573:
- This patch PASSES unit tests.
- This patch merges cleanly
- This patch adds no public classes

For more information see test ouptut:
https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/17119/consoleFull

tgravescs · 2014-09-09T15:29:57Z

@renozhang sorry for the delay on this, could you upmerge to the latest?

renozhang · 2014-09-11T03:36:07Z

@tgravescs I've update to the latest, thanks for review.

tgravescs · 2014-09-11T14:16:46Z

yarn/common/src/main/scala/org/apache/spark/deploy/yarn/ClientBase.scala

can you rename this to be spark.yarn.app.container.log.dir

Can you also update the documentation in docs/running-on-yarn.md to have this config and a good description.

renozhang · 2014-09-12T07:26:55Z

Sorry, I missed writing description in PR. I'll fill description in PR later.

As metioned in descritpion of Jira SPARK-2668:

Adding this varialbe is for user to define custom log4j.properties, eg:

log4j.appender.rolling_file.File = ${spark.yarn.log.dir}/spark.log

发件人: andrewor14 <notifications@github.com mailto:notifications@github.com>
答复: apache/spark <reply@reply.github.com mailto:reply@reply.github.com>
日期: Thu, 11 Sep 2014 18:08:37 -0700
至: apache/spark <spark@noreply.github.com mailto:spark@noreply.github.com>
抄送: Peng <peng.zhang@xiaomi.com mailto:peng.zhang@xiaomi.com>
主题: Re: [spark] [YARN] SPARK-2668: Add variable of yarn log directory for reference from the log4j configuration (#1573)

Where is this config being consumed? Am I missing something obvious?

—
Reply to this email directly or view it on GitHubhttps://github.com//pull/1573#issuecomment-55350201.

andrewor14 · 2014-09-12T18:14:58Z

Yup, thanks for answering my deleted question. I realized this afterwards after reading the JIRA.

tgravescs · 2014-09-18T14:18:52Z

@renozhang can you address my comments

renozhang · 2014-09-19T02:58:43Z

Sorry @tgravescs , these days very busy, I'll address them this weekend.

SparkQA · 2014-09-21T12:34:22Z

QA tests have started for PR 1573 at commit c56aba6.

This patch merges cleanly.

renozhang · 2014-09-21T12:37:40Z

@tgravescs patch updated, thanks for your review.

SparkQA · 2014-09-21T13:42:06Z

QA tests have finished for PR 1573 at commit c56aba6.

This patch passes unit tests.
This patch merges cleanly.
This patch adds no public classes.

tgravescs · 2014-09-22T14:53:33Z

docs/running-on-yarn.md

Since this could be used in more then just streaming applications perhaps we should reword this a little. Perhaps put the information about ${spark.yarn.app.container.log.dir} first and then give the example using RollingFileAppender with streaming.

Something more like: (note feel free to change the exact wording)

If you need a reference to the proper location to put the log files in the YARN so that YARN can properly display and aggregate them, use "${spark.yarn.app.container.log.dir}" in your log4j.properties. For example... (then explain the streaming example).

tgravescs · 2014-09-22T14:54:37Z

thanks @renozhang, minor request about the documentation, otherwise looks good.

SparkQA · 2014-09-23T08:54:19Z

QA tests have started for PR 1573 at commit 16c5cb8.

This patch merges cleanly.

SparkQA · 2014-09-23T09:46:02Z

QA tests have finished for PR 1573 at commit 16c5cb8.

This patch fails unit tests.
This patch merges cleanly.
This patch adds no public classes.

SparkQA · 2014-09-23T09:46:06Z

Test FAILed.
Refer to this link for build results (access rights to CI server needed):
https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/20697/

tgravescs · 2014-09-23T13:06:54Z

testfailure is unrelated to this pr

tgravescs · 2014-09-23T13:46:24Z

+1 thanks @renozhang !

…1573) We recently hit by an issue due to a Hive upgrade doesn’t work with Iceberg. As Apple Spark is heavily used with Iceberg in the production, any change at Spark has a risk to affect Iceberg function. But we don’t run any tests against Iceberg at the moment. To prevent similar issue on Iceberg side, it would be nice if we can run Iceberg unit tests in Apple Spark Rio pipeline.

renozhang changed the title ~~[YARN] SPARK-2668: Support log4j log to yarn container dir~~ [YARN] SPARK-2668: Add variable of yarn log diectory to reference from the log4j configuration Jul 25, 2014

renozhang changed the title ~~[YARN] SPARK-2668: Add variable of yarn log diectory to reference from the log4j configuration~~ [YARN] SPARK-2668: Add variable of yarn log directory to reference from the log4j configuration Jul 25, 2014

renozhang changed the title ~~[YARN] SPARK-2668: Add variable of yarn log directory to reference from the log4j configuration~~ [YARN] SPARK-2668: Add variable of yarn log directory for reference from the log4j configuration Jul 25, 2014

renozhang force-pushed the yarn-log-dir branch from 0b4028a to f70f581 Compare September 10, 2014 08:01

tgravescs reviewed Sep 11, 2014
View reviewed changes

renozhang force-pushed the yarn-log-dir branch from f70f581 to c56aba6 Compare September 21, 2014 12:30

tgravescs reviewed Sep 22, 2014
View reviewed changes

renozhang force-pushed the yarn-log-dir branch from c56aba6 to 16c5cb8 Compare September 23, 2014 08:48

renozhang added 3 commits September 23, 2014 16:49

Support log4j log to yarn container dir

503ea2d

Change variable's name, and update running-on-yarn.md

f2b5e2a

Update doc

16c5cb8

asfgit closed this in 14f8c34 Sep 23, 2014

renozhang deleted the yarn-log-dir branch June 28, 2016 10:02

[YARN] SPARK-2668: Add variable of yarn log directory for reference from the log4j configuration #1573

[YARN] SPARK-2668: Add variable of yarn log directory for reference from the log4j configuration #1573

Uh oh!

Conversation

renozhang commented Jul 24, 2014

Uh oh!

SparkQA commented Jul 24, 2014

Uh oh!

SparkQA commented Jul 24, 2014

Uh oh!

tgravescs commented Sep 9, 2014

Uh oh!

renozhang commented Sep 11, 2014

Uh oh!

tgravescs Sep 11, 2014

Choose a reason for hiding this comment

Uh oh!

renozhang commented Sep 12, 2014

Uh oh!

andrewor14 commented Sep 12, 2014

Uh oh!

tgravescs commented Sep 18, 2014

Uh oh!

renozhang commented Sep 19, 2014

Uh oh!

SparkQA commented Sep 21, 2014

Uh oh!

renozhang commented Sep 21, 2014

Uh oh!

SparkQA commented Sep 21, 2014

Uh oh!

tgravescs Sep 22, 2014

Choose a reason for hiding this comment

Uh oh!

tgravescs commented Sep 22, 2014

Uh oh!

SparkQA commented Sep 23, 2014

Uh oh!

SparkQA commented Sep 23, 2014

Uh oh!

SparkQA commented Sep 23, 2014

Uh oh!

tgravescs commented Sep 23, 2014

Uh oh!

tgravescs commented Sep 23, 2014

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants