Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[SPARK-8405][Doc] Add how to view logs on Web UI when yarn log aggregation is enabled #7463

Closed
wants to merge 4 commits into from

Conversation

carsonwang
Copy link
Contributor

Some users may not be aware that the logs are available on Web UI even if Yarn log aggregation is enabled. Update the doc to make this clear and what need to be configured.

@SparkQA
Copy link

SparkQA commented Jul 17, 2015

Test build #37617 has finished for PR 7463 at commit e5775c1.

  • This patch passes all tests.
  • This patch merges cleanly.
  • This patch adds no public classes.

@@ -68,9 +68,9 @@ In YARN terminology, executors and application masters run inside "containers".

yarn logs -applicationId <app ID>

will print out the contents of all log files from all containers from the given application. You can also view the container log files directly in HDFS using the HDFS shell or API. The directory where they are located can be found by looking at your YARN configs (`yarn.nodemanager.remote-app-log-dir` and `yarn.nodemanager.remote-app-log-dir-suffix`).
will print out the contents of all log files from all containers from the given application. You can also view the container log files directly in HDFS using the HDFS shell or API. The directory where they are located can be found by looking at your YARN configs (`yarn.nodemanager.remote-app-log-dir` and `yarn.nodemanager.remote-app-log-dir-suffix`). The logs are also available on Web UI. You need have both the Spark history server and the MR history server running and configure `yarn.log.server.url` in yarn-site.xml properly. The log url on the Spark history server UI will redirect you to the MR history server to show the aggregated logs.
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Minor text updates: MR history server -> MapReduce History Server. I'd hyperlink one of them too. url -> URL. back-tick yarn-site.xml It would be nice to link through to YARN docs that would describe these parameters too, if possible.

@carsonwang
Copy link
Contributor Author

Thanks @srowen . I updated the text. But I didn't find proper docs for MapReduce job history server and the yarn.log.server.url parameter on apache site. So I didn't add links. If a user search these, he will find necessary information on other websites.

@SparkQA
Copy link

SparkQA commented Jul 20, 2015

Test build #37789 has finished for PR 7463 at commit 5a95046.

  • This patch passes all tests.
  • This patch merges cleanly.
  • This patch adds no public classes.

@@ -68,9 +68,9 @@ In YARN terminology, executors and application masters run inside "containers".

yarn logs -applicationId <app ID>

will print out the contents of all log files from all containers from the given application. You can also view the container log files directly in HDFS using the HDFS shell or API. The directory where they are located can be found by looking at your YARN configs (`yarn.nodemanager.remote-app-log-dir` and `yarn.nodemanager.remote-app-log-dir-suffix`).
will print out the contents of all log files from all containers from the given application. You can also view the container log files directly in HDFS using the HDFS shell or API. The directory where they are located can be found by looking at your YARN configs (`yarn.nodemanager.remote-app-log-dir` and `yarn.nodemanager.remote-app-log-dir-suffix`). The logs are also available on Web UI. You need have both the Spark history server and the MapReduce history server running and configure `yarn.log.server.url` in `yarn-site.xml` properly. The log URL on the Spark history server UI will redirect you to the MapReduce history server to show the aggregated logs.
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

"The logs are also available on Web UI" is pretty generic. Can we make this more specific like on the Spark Web UI under the Executors Tab

@carsonwang
Copy link
Contributor Author

Sounds great, @tgravescs . The text was updated as you suggested.

@SparkQA
Copy link

SparkQA commented Jul 22, 2015

Test build #38044 has finished for PR 7463 at commit 74df3a1.

  • This patch passes all tests.
  • This patch merges cleanly.
  • This patch adds no public classes.

@@ -68,9 +68,9 @@ In YARN terminology, executors and application masters run inside "containers".

yarn logs -applicationId <app ID>

will print out the contents of all log files from all containers from the given application. You can also view the container log files directly in HDFS using the HDFS shell or API. The directory where they are located can be found by looking at your YARN configs (`yarn.nodemanager.remote-app-log-dir` and `yarn.nodemanager.remote-app-log-dir-suffix`).
will print out the contents of all log files from all containers from the given application. You can also view the container log files directly in HDFS using the HDFS shell or API. The directory where they are located can be found by looking at your YARN configs (`yarn.nodemanager.remote-app-log-dir` and `yarn.nodemanager.remote-app-log-dir-suffix`). The logs are also available on the Spark Web UI under the Executors Tab. You need have both the Spark history server and the MapReduce history server running and configure `yarn.log.server.url` in `yarn-site.xml` properly. The log URL on the Spark history server UI will redirect you to the MapReduce history server to show the aggregated logs.
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

can you change "You need have" to "You need to" have"

@tgravescs
Copy link
Contributor

one minor nit, otherwise looks good. @srowen do you have any other comments, I want to not sure your previous comment was addressed?

@srowen
Copy link
Member

srowen commented Jul 22, 2015

OK by me

@carsonwang
Copy link
Contributor Author

Sorry for the delay. Updated!

@SparkQA
Copy link

SparkQA commented Jul 25, 2015

Test build #38427 has finished for PR 7463 at commit 274c054.

  • This patch passes all tests.
  • This patch merges cleanly.
  • This patch adds no public classes.

@tgravescs
Copy link
Contributor

+1. Thanks!

asfgit pushed a commit that referenced this pull request Jul 27, 2015
…gation is enabled

Some users may not be aware that the logs are available on Web UI even if Yarn log aggregation is enabled. Update the doc to make this clear and what need to be configured.

Author: Carson Wang <carson.wang@intel.com>

Closes #7463 from carsonwang/YarnLogDoc and squashes the following commits:

274c054 [Carson Wang] Minor text fix
74df3a1 [Carson Wang] address comments
5a95046 [Carson Wang] Update the text in the doc
e5775c1 [Carson Wang] Update doc about how to view the logs on Web UI when yarn log aggregation is enabled

(cherry picked from commit 6228381)
Signed-off-by: Tom Graves <tgraves@yahoo-inc.com>
@asfgit asfgit closed this in 6228381 Jul 27, 2015
@carsonwang carsonwang deleted the YarnLogDoc branch August 17, 2015 01:22
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
4 participants