[SPARK-24578][Core] Cap sub-region's size of returned nio buffer #21593

WenboZhao · 2018-06-19T20:43:03Z

What changes were proposed in this pull request?

This PR tries to fix the performance regression introduced by SPARK-21517.

In our production job, we performed many parallel computations, with high possibility, some task could be scheduled to a host-2 where it needs to read the cache block data from host-1. Often, this big transfer makes the cluster suffer time out issue (it will retry 3 times, each with 120s timeout, and then do recompute to put the cache block into the local MemoryStore).

The root cause is that we don't do consolidateIfNeeded anymore as we are using

Unpooled.wrappedBuffer(chunks.length, getChunks(): _*)

in ChunkedByteBuffer. If we have many small chunks, it could cause the buf.notBuffer(...) have very bad performance in the case that we have to call copyByteBuf(...) many times.

How was this patch tested?

Existing unit tests and also test in production

squito · 2018-06-20T02:55:19Z

Jenkins, ok to test

squito

lgtm

I made a suggestion for another improvement we can do while we're at it, but its small and this is a really important fix, so its OK to leave it.

squito · 2018-06-20T03:12:40Z

common/network-common/src/main/java/org/apache/spark/network/protocol/MessageWithHeader.java

+    // SPARK-24578: cap the sub-region's size of returned nio buffer to improve the performance
+    // for the case that the passed-in buffer has too many components.
+    int length = Math.min(buf.readableBytes(), NIO_BUFFER_LIMIT);
+    ByteBuffer buffer = buf.nioBuffer(buf.readerIndex(), length);


I think you can go one step further here, and call buf.nioBuffers(int, int) (plural)
https://github.com/netty/netty/blob/4.1/buffer/src/main/java/io/netty/buffer/ByteBuf.java#L2355

that will avoid the copying required to create the merged buffer (though its a bit complicated as you have to check for incomplete writes from any single target.write() call).

Also OK to leave this for now as this is a pretty important fix.

Sure, I will make a follow up PR to address this.

Why not do this in this PR since this is a small change and we don't have a new release recently?

this pr is fixing a pretty serious issue, we know Wenbo is going to roll this out immediately, and I suspect even more users will. this fix is also "obviously correct" -- the followup here is not super complicated, but also will be more prone to bugs. So I'm inclined to just get this in.

anyway if @WenboZhao can do the other part today, then sure, but I think we should get this in quickly.

Thanks @squito and @zsxwing. I would prefer to do it in a different PR with more careful benchmark and testing. As @squito, that change is more prone to bugs.

Fair enough.

I just spent several minutes to write the following codes:

private int copyByteBuf(ByteBuf buf, WritableByteChannel target) throws IOException { // SPARK-24578: cap the sub-region's size of returned nio buffer to improve the performance // for the case that the passed-in buffer has too many components. int length = Math.min(buf.readableBytes(), NIO_BUFFER_LIMIT); ByteBuffer[] buffers = buf.nioBuffers(buf.readerIndex(), length); int totalWritten = 0; for (ByteBuffer buffer : buffers) { int remaining = buffer.remaining(); int written = target.write(buffer); totalWritten += written; if (written < remaining) { break; } } buf.skipBytes(totalWritten); return totalWritten; }

Feel free to use them in your follow up PR.

@WenboZhao did you ever follow up on this, or at least file another jira for it? sorry if I missed it

for anybody watching this, eventually SPARK-25115 was opened (currently has a PR)

SparkQA · 2018-06-20T07:05:01Z

Test build #92114 has finished for PR 21593 at commit a30d4de.

This patch fails due to an unknown error code, -9.
This patch merges cleanly.
This patch adds no public classes.

cloud-fan · 2018-06-20T07:27:55Z

retest this please

SparkQA · 2018-06-20T12:02:23Z

Test build #92122 has finished for PR 21593 at commit a30d4de.

This patch passes all tests.
This patch merges cleanly.
This patch adds no public classes.

gatorsmile · 2018-06-20T15:47:00Z

cc @zsxwing @JoshRosen

zsxwing · 2018-06-20T21:24:51Z

Thanks! Merging to master and 2.3.

## What changes were proposed in this pull request? This PR tries to fix the performance regression introduced by SPARK-21517. In our production job, we performed many parallel computations, with high possibility, some task could be scheduled to a host-2 where it needs to read the cache block data from host-1. Often, this big transfer makes the cluster suffer time out issue (it will retry 3 times, each with 120s timeout, and then do recompute to put the cache block into the local MemoryStore). The root cause is that we don't do `consolidateIfNeeded` anymore as we are using ``` Unpooled.wrappedBuffer(chunks.length, getChunks(): _*) ``` in ChunkedByteBuffer. If we have many small chunks, it could cause the `buf.notBuffer(...)` have very bad performance in the case that we have to call `copyByteBuf(...)` many times. ## How was this patch tested? Existing unit tests and also test in production Author: Wenbo Zhao <wzhao@twosigma.com> Closes #21593 from WenboZhao/spark-24578. (cherry picked from commit 3f4bda7) Signed-off-by: Shixiong Zhu <zsxwing@gmail.com>

## What changes were proposed in this pull request? This PR tries to fix the performance regression introduced by SPARK-21517. In our production job, we performed many parallel computations, with high possibility, some task could be scheduled to a host-2 where it needs to read the cache block data from host-1. Often, this big transfer makes the cluster suffer time out issue (it will retry 3 times, each with 120s timeout, and then do recompute to put the cache block into the local MemoryStore). The root cause is that we don't do `consolidateIfNeeded` anymore as we are using ``` Unpooled.wrappedBuffer(chunks.length, getChunks(): _*) ``` in ChunkedByteBuffer. If we have many small chunks, it could cause the `buf.notBuffer(...)` have very bad performance in the case that we have to call `copyByteBuf(...)` many times. ## How was this patch tested? Existing unit tests and also test in production Author: Wenbo Zhao <wzhao@twosigma.com> Closes apache#21593 from WenboZhao/spark-24578. (cherry picked from commit 3f4bda7)

nebi-frame · 2023-08-11T03:42:49Z

which release does have this commit?

Anubisxcw · 2024-02-19T12:10:32Z

which release does have this commit?

since 2.3.2
https://issues.apache.org/jira/browse/SPARK-24578

Cap sub-region's size of returned nio buffer

a30d4de

squito approved these changes Jun 20, 2018

View reviewed changes

asfgit closed this in 3f4bda7 Jun 20, 2018

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[SPARK-24578][Core] Cap sub-region's size of returned nio buffer #21593

[SPARK-24578][Core] Cap sub-region's size of returned nio buffer #21593

WenboZhao commented Jun 19, 2018 •

edited

Loading

squito commented Jun 20, 2018

squito left a comment

squito Jun 20, 2018

WenboZhao Jun 20, 2018

zsxwing Jun 20, 2018

squito Jun 20, 2018

WenboZhao Jun 20, 2018

zsxwing Jun 20, 2018

squito Jul 17, 2018

squito Aug 14, 2018

SparkQA commented Jun 20, 2018

cloud-fan commented Jun 20, 2018

SparkQA commented Jun 20, 2018

gatorsmile commented Jun 20, 2018

zsxwing commented Jun 20, 2018

nebi-frame commented Aug 11, 2023

Anubisxcw commented Feb 19, 2024

[SPARK-24578][Core] Cap sub-region's size of returned nio buffer #21593

[SPARK-24578][Core] Cap sub-region's size of returned nio buffer #21593

Conversation

WenboZhao commented Jun 19, 2018 • edited Loading

What changes were proposed in this pull request?

How was this patch tested?

squito commented Jun 20, 2018

squito left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

SparkQA commented Jun 20, 2018

cloud-fan commented Jun 20, 2018

SparkQA commented Jun 20, 2018

gatorsmile commented Jun 20, 2018

zsxwing commented Jun 20, 2018

nebi-frame commented Aug 11, 2023

Anubisxcw commented Feb 19, 2024

WenboZhao commented Jun 19, 2018 •

edited

Loading