Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[SPARK-16827] Stop reporting spill metrics as shuffle metrics #15347

Closed
wants to merge 1 commit into from

Conversation

bchocho
Copy link
Contributor

@bchocho bchocho commented Oct 4, 2016

What changes were proposed in this pull request?

Fix a bug where spill metrics were being reported as shuffle metrics. Eventually these spill metrics should be reported (SPARK-3577), but separate from shuffle metrics. The fix itself basically reverts the line to what it was in 1.6.

How was this patch tested?

Tested on a job that was reporting shuffle writes even for the final stage, when no shuffle writes should take place. After the change the job no longer shows these writes.

Before:
screen shot 2016-10-03 at 6 39 59 pm

After:
screen shot 2016-10-03 at 11 44 44 pm

@rxin
Copy link
Contributor

rxin commented Oct 4, 2016

Can you also work on reporting spill metrics?

@bchocho
Copy link
Contributor Author

bchocho commented Oct 4, 2016

Sure, having the actual spill metrics is something we're interested in as well. I'd like to work on it, but I might not get to it immediately.

@rxin
Copy link
Contributor

rxin commented Oct 4, 2016

Jenkins, add to whitelist.

@SparkQA
Copy link

SparkQA commented Oct 4, 2016

Test build #3296 has finished for PR 15347 at commit 92f3b89.

  • This patch passes all tests.
  • This patch merges cleanly.
  • This patch adds no public classes.

this.writeMetrics = taskContext.taskMetrics().shuffleWriteMetrics();
// The spill metrics are stored in a new ShuffleWriteMetrics, and then discarded (this fixes SPARK-16827).
// TODO: Instead, separate spill metrics should be stored and reported (tracked in SPARK-3577).
this.writeMetrics = new ShuffleWriteMetrics();
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

so this isn't actually used anywhere right now? Is that what the TODO is about?

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Yes, that's correct.

@andrewor14
Copy link
Contributor

OK, this change by itself LGTM. @dafrista would you mind creating a separate JIRA (or point me to an existing one) about the TODO then? Merging this into master

@asfgit asfgit closed this in e56614c Oct 7, 2016
@bchocho
Copy link
Contributor Author

bchocho commented Oct 7, 2016

@andrewor14 sure I think the one mentioned in the TODO covers it (SPARK-3577)

@rxin
Copy link
Contributor

rxin commented Oct 12, 2016

@dafrista can you create a backport for branch-2.0?

@bchocho
Copy link
Contributor Author

bchocho commented Oct 12, 2016

@rxin Sure, I added the PR #15455

@dreamworks007
Copy link

@rxin , "Can you also work on reporting spill metrics?" --> here , you mean reporting spill time metrics , right ? Since I think write spill bytes is already included - https://github.com/facebook/FB-Spark/blob/fb-2.0/core/src/main/java/org/apache/spark/util/collection/unsafe/sort/UnsafeExternalSorter.java#L232

asfgit pushed a commit that referenced this pull request Oct 13, 2016
…trics

## What changes were proposed in this pull request?

Fix a bug where spill metrics were being reported as shuffle metrics. Eventually these spill metrics should be reported (SPARK-3577), but separate from shuffle metrics. The fix itself basically reverts the line to what it was in 1.6.

## How was this patch tested?

Cherry-picked from master (#15347)

Author: Brian Cho <bcho@fb.com>

Closes #15455 from dafrista/shuffle-metrics-2.0.
uzadude pushed a commit to uzadude/spark that referenced this pull request Jan 27, 2017
## What changes were proposed in this pull request?

Fix a bug where spill metrics were being reported as shuffle metrics. Eventually these spill metrics should be reported (SPARK-3577), but separate from shuffle metrics. The fix itself basically reverts the line to what it was in 1.6.

## How was this patch tested?

Tested on a job that was reporting shuffle writes even for the final stage, when no shuffle writes should take place. After the change the job no longer shows these writes.

Before:
![screen shot 2016-10-03 at 6 39 59 pm](https://cloud.githubusercontent.com/assets/1514239/19085897/dbf59a92-8a20-11e6-9f68-a978860c0d74.png)

After:
<img width="1052" alt="screen shot 2016-10-03 at 11 44 44 pm" src="https://cloud.githubusercontent.com/assets/1514239/19085903/e173a860-8a20-11e6-85e3-d47f9835f494.png">

Author: Brian Cho <bcho@fb.com>

Closes apache#15347 from dafrista/shuffle-metrics.
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

5 participants