[SPARK-2299] Consolidate various stageIdTo* hash maps in JobProgressListener #1262

rxin · 2014-06-29T08:34:45Z

This should reduce memory usage for the web ui as well as slightly increase its speed in draining the UI event queue.

@andrewor14

…istener to speed it up.

AmplabJenkins · 2014-06-29T08:35:32Z

Merged build triggered.

AmplabJenkins · 2014-06-29T08:35:40Z

Merged build started.

rxin · 2014-06-29T08:51:03Z

core/src/main/scala/org/apache/spark/ui/jobs/StagePage.scala

@@ -67,28 +61,28 @@ private[ui] class StagePage(parent: JobProgressTab) extends WebUIPage("stage") {
          <ul class="unstyled">
            <li>
              <strong>Total task time across all tasks: </strong>
-              {UIUtils.formatDuration(listener.stageIdToTime.getOrElse(stageId, 0L) + activeTime)}
+              {UIUtils.formatDuration(stageData.executorRunTime)}


Note that I dropped activeTime here (time taken for currently active tasks) because I'm not sure if the extra data structure required to track this is worth the benefit (I don't know if anybody really looks at this ...)

For tasks that have already finished, doesn't activeTime already include executorRunTime? If so, aren't we double counting for those tasks? It seems to me that what it was before was plain wrong, though maybe I'm misunderstanding something.

AmplabJenkins · 2014-06-29T09:22:21Z

Merged build finished. All automated tests passed.

AmplabJenkins · 2014-06-29T09:22:21Z

All automated tests passed.
Refer to this link for build results: https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/16240/

andrewor14 · 2014-07-07T17:45:26Z

core/src/main/scala/org/apache/spark/ui/jobs/ExecutorTable.scala

@@ -17,6 +17,8 @@

 package org.apache.spark.ui.jobs

+import org.apache.spark.ui.jobs.UIData.StageUIData
+


nit: mind grouping this with other org.apache.spark imports?

tsudukim · 2014-07-12T01:12:32Z

Should we have only "stageId" as key of these HashMap?
Related to SPARK-2298 at JIRA, I think both stage and attemptId should be included into key in order to discriminate the original stage from re-submitted (attemptId is incremented) stage.

pwendell · 2014-07-16T21:15:55Z

Yeah I think @rxin is going to change this so that we index on both the stage and attempt. Also, we'll need to extend the listener interface to give both the attempt and stage for a task.

Conflicts: core/src/main/scala/org/apache/spark/ui/jobs/ExecutorSummary.scala core/src/main/scala/org/apache/spark/ui/jobs/ExecutorTable.scala core/src/main/scala/org/apache/spark/ui/jobs/JobProgressListener.scala core/src/main/scala/org/apache/spark/ui/jobs/StagePage.scala core/src/main/scala/org/apache/spark/ui/jobs/StageTable.scala

rxin · 2014-07-17T06:38:40Z

I pushed a new version. I'd first merge this and then have a separate PR to index the hash table by stageId + attempt.

Now it includes @kayousterhout's change. Please take another look.

SparkQA · 2014-07-17T06:42:55Z

QA tests have started for PR 1262. This patch merges cleanly.
View progress: https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/16768/consoleFull

SparkQA · 2014-07-17T08:21:40Z

QA results for PR 1262:
- This patch PASSES unit tests.
- This patch merges cleanly
- This patch adds no public classes

For more information see test ouptut:
https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/16768/consoleFull

SparkQA · 2014-07-18T00:18:04Z

QA tests have started for PR 1262. This patch merges cleanly.
View progress: https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/16795/consoleFull

SparkQA · 2014-07-18T01:56:34Z

QA results for PR 1262:
- This patch PASSES unit tests.
- This patch merges cleanly
- This patch adds no public classes

For more information see test ouptut:
https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/16795/consoleFull

rxin · 2014-07-18T01:58:33Z

Merging in master. Thanks for reviewing.

@andrewor14

…istener This should reduce memory usage for the web ui as well as slightly increase its speed in draining the UI event queue. @andrewor14 Author: Reynold Xin <rxin@apache.org> Closes apache#1262 from rxin/ui-consolidate-hashtables and squashes the following commits: 1ac3f97 [Reynold Xin] Oops. Properly handle description. f5736ad [Reynold Xin] Code review comments. b8828dc [Reynold Xin] Merge branch 'master' into ui-consolidate-hashtables 7a7b6c4 [Reynold Xin] Revert css change. f959bb8 [Reynold Xin] [SPARK-2299] Consolidate various stageIdTo* hash maps in JobProgressListener to speed it up. 63256f5 [Reynold Xin] [SPARK-2320] Reduce <pre> block font size.

rxin added 3 commits June 29, 2014 01:26

[SPARK-2320] Reduce <pre> block font size.

63256f5

[SPARK-2299] Consolidate various stageIdTo* hash maps in JobProgressL…

f959bb8

…istener to speed it up.

Revert css change.

7a7b6c4

rxin reviewed Jun 29, 2014
View reviewed changes

andrewor14 reviewed Jul 7, 2014
View reviewed changes

rxin mentioned this pull request Jul 16, 2014

SPARK-2298: Show stage attempt in UI #1384

Closed

rxin added 2 commits July 16, 2014 22:43

Code review comments.

f5736ad

Oops. Properly handle description.

1ac3f97

asfgit closed this in 72e9021 Jul 18, 2014

rxin deleted the ui-consolidate-hashtables branch July 18, 2014 02:45

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[SPARK-2299] Consolidate various stageIdTo* hash maps in JobProgressListener #1262

[SPARK-2299] Consolidate various stageIdTo* hash maps in JobProgressListener #1262

rxin commented Jun 29, 2014

AmplabJenkins commented Jun 29, 2014

AmplabJenkins commented Jun 29, 2014

rxin Jun 29, 2014

andrewor14 Jul 7, 2014

AmplabJenkins commented Jun 29, 2014

AmplabJenkins commented Jun 29, 2014

andrewor14 Jul 7, 2014

tsudukim commented Jul 12, 2014

pwendell commented Jul 16, 2014

rxin commented Jul 17, 2014

SparkQA commented Jul 17, 2014

SparkQA commented Jul 17, 2014

SparkQA commented Jul 18, 2014

SparkQA commented Jul 18, 2014

rxin commented Jul 18, 2014

		@@ -17,6 +17,8 @@

		package org.apache.spark.ui.jobs

		import org.apache.spark.ui.jobs.UIData.StageUIData

[SPARK-2299] Consolidate various stageIdTo* hash maps in JobProgressListener #1262

[SPARK-2299] Consolidate various stageIdTo* hash maps in JobProgressListener #1262

Conversation

rxin commented Jun 29, 2014

AmplabJenkins commented Jun 29, 2014

AmplabJenkins commented Jun 29, 2014

rxin Jun 29, 2014

Choose a reason for hiding this comment

andrewor14 Jul 7, 2014

Choose a reason for hiding this comment

AmplabJenkins commented Jun 29, 2014

AmplabJenkins commented Jun 29, 2014

andrewor14 Jul 7, 2014

Choose a reason for hiding this comment

tsudukim commented Jul 12, 2014

pwendell commented Jul 16, 2014

rxin commented Jul 17, 2014

SparkQA commented Jul 17, 2014

SparkQA commented Jul 17, 2014

SparkQA commented Jul 18, 2014

SparkQA commented Jul 18, 2014

rxin commented Jul 18, 2014