Fixed resources used/wasted computation for spark jobs - (Depends on Custom SHS - Requires peakJvmUsedMemory metric) #287

skakker · 2017-09-12T07:41:09Z

Fixed the calculation of resources used/wastage, now the resources used is calculated by summing up the resources used by each executor as opposed to before where the executors were assumed to use the entire memory allocated for them. The resources allocated for use is also calculated by multiplying the time spent by each executor with the total memory allocated for an executor, and then summing them up.

shankar37 · 2017-09-12T08:25:44Z

app/com/linkedin/drelephant/spark/SparkMetricsAggregator.scala

      val totalExecutorTaskTimeMillis = totalExecutorTaskTimeMillisOf(data)
-
      val resourcesAllocatedForUse =
        aggregateresourcesAllocatedForUse(executorInstances, executorMemoryBytes, applicationDurationMillis)


Allocated resources should take care of the dynamic allocation.

shankar37 · 2017-09-12T08:27:25Z

app/com/linkedin/drelephant/spark/SparkMetricsAggregator.scala

+    var sumResourceUsage: BigInt = 0
+    executorSummaries.foreach(
+      executorSummary => {
+        var memUsed: Long = executorSummary.peakJvmUsedMemory.getOrElse(JVM_USED_MEMORY, 0) //+ MemoryFormatUtils.stringToBytes("300M")


We might want to add a buffer on top of the peak memory.

The variable "resourcesActuallyUsedWithBuffer" (line no: 64) is the resource usage with buffer and for calculating resources wasted, this value is being considered.

shkhrgpt · 2017-10-05T20:20:14Z

I think it would good if you can provide a brief description of this PR. It helps in the review.
Thanks.

skakker · 2017-10-06T06:57:30Z

@shkhrgpt: Done

akshayrai

Peak JVM Used Memory is not part of the upstream public Spark Release. Otherwise this looks good.

…e spark fetcher

…Custom SHS - Requires peakJvmUsedMemory metric) (#287)

…Custom SHS - Requires peakJvmUsedMemory metric) (linkedin#287)

…Custom SHS - Requires peakJvmUsedMemory metric) (#287)

…Custom SHS - Requires peakJvmUsedMemory metric) (linkedin#287)

…Custom SHS - Requires peakJvmUsedMemory metric) (#287)

shankar37 reviewed Sep 12, 2017

View reviewed changes

skakker force-pushed the ResourceUsage branch from 7f7717f to fc9b866 Compare September 12, 2017 10:21

skakker force-pushed the ResourceUsage branch from 4ccecdf to 656ba41 Compare October 9, 2017 12:54

akshayrai force-pushed the master branch from 7c2fd7f to 8b46933 Compare December 12, 2017 05:09

skakker force-pushed the ResourceUsage branch from 29b5ced to 7191aac Compare January 4, 2018 06:25

akshayrai approved these changes Jan 5, 2018

View reviewed changes

skakker force-pushed the ResourceUsage branch from 5b3648b to b81a2b0 Compare January 8, 2018 09:08

akshayrai changed the title ~~Fixed resources used/wasted computation for spark jobs~~ Fixed resources used/wasted computation for spark jobs - (Depends on Custom SHS - Requires peakJvmUsedMemory metric) Jan 10, 2018

skakker changed the base branch from master to customSHSWork January 10, 2018 06:13

swasti added 4 commits January 10, 2018 18:03

fixed resources used and wasted computation for spark jobs

f502568

Changed default Compression Codec from .snappy to .lz4 and changed th…

580c36f

…e spark fetcher

refined a small comment

125b3d6

required changes to test

9dfcc1e

skakker force-pushed the ResourceUsage branch from b81a2b0 to 9dfcc1e Compare January 10, 2018 12:40

akshayrai merged commit 17e2e65 into linkedin:customSHSWork Jan 10, 2018

akshayrai pushed a commit that referenced this pull request Feb 21, 2018

Fixed resources used/wasted computation for spark jobs - (Depends on …

34bac32

…Custom SHS - Requires peakJvmUsedMemory metric) (#287)

akshayrai pushed a commit that referenced this pull request Feb 27, 2018

Fixed resources used/wasted computation for spark jobs - (Depends on …

546a5b5

…Custom SHS - Requires peakJvmUsedMemory metric) (#287)

akshayrai pushed a commit that referenced this pull request Mar 6, 2018

Fixed resources used/wasted computation for spark jobs - (Depends on …

8d07476

…Custom SHS - Requires peakJvmUsedMemory metric) (#287)

arpang pushed a commit to arpang/dr-elephant that referenced this pull request Mar 14, 2018

Fixed resources used/wasted computation for spark jobs - (Depends on …

a65e8c8

…Custom SHS - Requires peakJvmUsedMemory metric) (linkedin#287)

akshayrai pushed a commit that referenced this pull request Mar 19, 2018

Fixed resources used/wasted computation for spark jobs - (Depends on …

0c1e5f1

…Custom SHS - Requires peakJvmUsedMemory metric) (#287)

akshayrai pushed a commit that referenced this pull request Mar 19, 2018

Fixed resources used/wasted computation for spark jobs - (Depends on …

ddac4bb

…Custom SHS - Requires peakJvmUsedMemory metric) (#287)

akshayrai pushed a commit that referenced this pull request Mar 30, 2018

Fixed resources used/wasted computation for spark jobs - (Depends on …

5f7fc68

…Custom SHS - Requires peakJvmUsedMemory metric) (#287)

akshayrai pushed a commit that referenced this pull request Apr 6, 2018

Fixed resources used/wasted computation for spark jobs - (Depends on …

8e193a2

…Custom SHS - Requires peakJvmUsedMemory metric) (#287)

akshayrai pushed a commit that referenced this pull request May 21, 2018

Fixed resources used/wasted computation for spark jobs - (Depends on …

21b1286

…Custom SHS - Requires peakJvmUsedMemory metric) (#287)

pralabhkumar pushed a commit to pralabhkumar/dr-elephant that referenced this pull request Aug 31, 2018

Fixed resources used/wasted computation for spark jobs - (Depends on …

2e49600

…Custom SHS - Requires peakJvmUsedMemory metric) (linkedin#287)

varunsaxena pushed a commit that referenced this pull request Oct 16, 2018

Fixed resources used/wasted computation for spark jobs - (Depends on …

e078073

…Custom SHS - Requires peakJvmUsedMemory metric) (#287)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fixed resources used/wasted computation for spark jobs - (Depends on Custom SHS - Requires peakJvmUsedMemory metric) #287

Fixed resources used/wasted computation for spark jobs - (Depends on Custom SHS - Requires peakJvmUsedMemory metric) #287

skakker commented Sep 12, 2017 •

edited

Loading

shankar37 Sep 12, 2017

shankar37 Sep 12, 2017

skakker Sep 12, 2017

shkhrgpt commented Oct 5, 2017

skakker commented Oct 6, 2017

akshayrai left a comment

Fixed resources used/wasted computation for spark jobs - (Depends on Custom SHS - Requires peakJvmUsedMemory metric) #287

Fixed resources used/wasted computation for spark jobs - (Depends on Custom SHS - Requires peakJvmUsedMemory metric) #287

Conversation

skakker commented Sep 12, 2017 • edited Loading

shankar37 Sep 12, 2017

Choose a reason for hiding this comment

shankar37 Sep 12, 2017

Choose a reason for hiding this comment

skakker Sep 12, 2017

Choose a reason for hiding this comment

shkhrgpt commented Oct 5, 2017

skakker commented Oct 6, 2017

akshayrai left a comment

Choose a reason for hiding this comment

skakker commented Sep 12, 2017 •

edited

Loading