OutOfDirectMemoryError for large uploads using HttpPostMultipartRequestDecoder #10973

danielflower · 2021-01-28T14:18:39Z

Expected behavior

With a HttpDataFactory that has useDisk=true, I thought files of any size could potentially be uploaded.

Actual behavior

Example error from below unit test:

io.netty.util.internal.OutOfDirectMemoryError: failed to allocate 4194304 byte(s) of direct memory (used: 62914560, max: 64487424)
	at io.netty.util.internal.PlatformDependent.incrementMemoryCounter(PlatformDependent.java:775)
	at io.netty.util.internal.PlatformDependent.reallocateDirectNoCleaner(PlatformDependent.java:748)
	at io.netty.buffer.UnpooledUnsafeNoCleanerDirectByteBuf.reallocateDirect(UnpooledUnsafeNoCleanerDirectByteBuf.java:34)
	at io.netty.buffer.UnpooledByteBufAllocator$InstrumentedUnpooledUnsafeNoCleanerDirectByteBuf.reallocateDirect(UnpooledByteBufAllocator.java:194)
	at io.netty.buffer.UnpooledUnsafeNoCleanerDirectByteBuf.capacity(UnpooledUnsafeNoCleanerDirectByteBuf.java:52)
	at io.netty.buffer.AbstractByteBuf.ensureWritable0(AbstractByteBuf.java:307)
	at io.netty.buffer.AbstractByteBuf.ensureWritable(AbstractByteBuf.java:282)
	at io.netty.buffer.AbstractByteBuf.writeBytes(AbstractByteBuf.java:1105)
	at io.netty.buffer.AbstractByteBuf.writeBytes(AbstractByteBuf.java:1098)
	at io.netty.buffer.AbstractByteBuf.writeBytes(AbstractByteBuf.java:1089)
	at io.netty.handler.codec.http.multipart.HttpPostMultipartRequestDecoder.offer(HttpPostMultipartRequestDecoder.java:351)
	at NettyUploadTest.itCanProcessLargeFiles(NettyUploadTest.java:46)
// snip

Steps to reproduce

Set -Xmx64m and run below unit test.

Minimal yet complete reproducer code (or URL to code)

import io.netty.buffer.ByteBuf;
import io.netty.buffer.Unpooled;
import io.netty.handler.codec.http.*;
import io.netty.handler.codec.http.multipart.DefaultHttpDataFactory;
import io.netty.handler.codec.http.multipart.FileUpload;
import io.netty.handler.codec.http.multipart.HttpDataFactory;
import io.netty.handler.codec.http.multipart.HttpPostMultipartRequestDecoder;
import org.junit.Test;

import java.nio.charset.StandardCharsets;
import java.util.Arrays;

import static org.hamcrest.MatcherAssert.assertThat;
import static org.hamcrest.Matchers.is;

public class NettyUploadTest {

    @Test
    public void itCanProcessLargeFiles() throws Exception {

        int fileSize = 100_000_000; // set Xmx to a number lower than this and it crashes
        int bytesPerChunk = 1_000_000;

        String prefix = "--861fbeab-cd20-470c-9609-d40a0f704466\n" +
            "Content-Disposition: form-data; name=\"image\"; filename=\"guangzhou.jpeg\"\n" +
            "Content-Type: image/jpeg\n" +
            "Content-Length: " + fileSize + "\n" +
            "\n";

        String suffix = "\n" +
            "--861fbeab-cd20-470c-9609-d40a0f704466--\n";

        HttpRequest request = new DefaultHttpRequest(HttpVersion.HTTP_1_1, HttpMethod.POST, "/upload");
        request.headers().set("content-type", "multipart/form-data; boundary=861fbeab-cd20-470c-9609-d40a0f704466");
        request.headers().set("content-length", prefix.length() + fileSize + suffix.length());

        HttpDataFactory factory = new DefaultHttpDataFactory(true);
        HttpPostMultipartRequestDecoder decoder = new HttpPostMultipartRequestDecoder(factory, request);
        decoder.offer(new DefaultHttpContent(Unpooled.wrappedBuffer(prefix.getBytes(StandardCharsets.UTF_8))));

        byte[] body = new byte[bytesPerChunk];
        Arrays.fill(body, (byte)1);
        for (int i = 0; i < fileSize / bytesPerChunk; i++) {
            ByteBuf content = Unpooled.wrappedBuffer(body, 0, bytesPerChunk);
            decoder.offer(new DefaultHttpContent(content)); // **OutOfMemory here**
            content.release();
        }

        decoder.offer(new DefaultHttpContent(Unpooled.wrappedBuffer(suffix.getBytes(StandardCharsets.UTF_8))));
        decoder.offer(new DefaultLastHttpContent());
        FileUpload data = (FileUpload) decoder.getBodyHttpDatas().get(0);
        assertThat((int)data.length(), is(fileSize));
        assertThat(data.get().length, is(fileSize));

        factory.cleanAllHttpData();

    }

}

Netty version

Tested on 4.1.56 and 4.1.58.

JVM version (e.g. `java -version`)

jdk1.8.0_162 and 12

OS version (e.g. `uname -a`)

Windows 10

The text was updated successfully, but these errors were encountered:

franz1981 · 2021-01-29T08:43:09Z

@fredericBregier It could be related to #10623 given that right now file isn't written until a delimiter is found?

normanmaurer · 2021-01-29T08:45:23Z

I think so too... I think we should never the commit in question for now as while it fixes some "perf issues" when running in with paranoid leak detection I don't think it has any other benefits in real world use-cases. @fredericBregier WDYT ?

fredericBregier · 2021-01-29T16:46:02Z

Hi, the issue is partially related but yet an issue.

the file is totally in the buffer while reading: previously it could be in memory but by chunk
once the file is over, if the "disk" based HttpData is used, the buffer is written to a temprary file, therefore leaving the memory free
Note that another bug was fixed in the same time since before buffers were free (discardReadBytes) wrongly

I agree that this should be changed to adapt the solution with the new way to find the delimiter. Perhaps this?
Currently:

The content is added to the HttpData along slices (see

netty/codec-http/src/main/java/io/netty/handler/codec/http/multipart/HttpPostMultipartRequestDecoder.java

Line 1191 in 5c52291

httpData.addContent(content, true);

).
But only once all slices are found (

netty/codec-http/src/main/java/io/netty/handler/codec/http/multipart/HttpPostMultipartRequestDecoder.java

Line 1171 in 5c52291

newOffset = findDelimiter(undecodedChunk, delimiter, lastDataPosition);

)

If it is written (in Disk mode or Mixed mode and if size is greater than limit), the buffer might be cleared (free), so one could try to change this such as:

each time a slice is found (even if the delimiter is not found), one could add the content to the HttpData, but carefully take into consideration that perhaps not all bufer shall be taken (the end of the buffer could be in the middle of the delimiter, to not take into account)
therefore, try to release the memory (careful, since if Memory based, must not be released at all or bad content will occur) as in

netty/codec-http/src/main/java/io/netty/handler/codec/http/multipart/HttpPostMultipartRequestDecoder.java

Line 348 in 5c52291

if (undecodedChunk.refCnt() == 1 && writable < toWrite && readPos + writable >= toWrite) {

netty/codec-http/src/main/java/io/netty/handler/codec/http/multipart/HttpPostMultipartRequestDecoder.java

Line 349 in 5c52291

undecodedChunk.discardReadBytes();

) (we cannot simply reset the writerIndex and readerIndex to the beginning for this HttpData since the HttpData could be in Memory only)

Not sure it will work or easy to implement, but I think the idea is there...

fredericBregier · 2021-01-30T11:35:09Z

Hi all, I propose a fix for this (adding a test inspired from the one given, testing both in Memory and on Disk behaviors)

…oder to e5951d4 Motivation: The changes introduced in 1c23040 did cause various issues while the fix itself is not considered critical. For now it is considered the best to just rollback and investigate more. Modifications: Revert changes done in 1c23040 (and later) for the post decoders. Result: Fixes #10973

Motivation: While improvements were made on finding delimiters, when HttpDatas do not fit in memory while using Mixed or Disk mode in HttpDataFactory, the memory was exhausted. Modifications: Change the way `loadDataMultipart` tries to add contents. Instead of waiting to found the delimiter, it will add the current buffer and, if possible, will try to reuse the current allocated buffer. As the chunk is a retained buffer from the original one, if the `refCnt()` is 1 after the `addContent`, then it can be "virtually" released (changing reader and writer indexes). Note: if the HttpDataFactory is in Memory only, this will not prevent OOME since the full data will be kept in memory. Result: Now the memory cunsumption in Mixed mode or Disk mode is restricted to the minimum. A test is added to check both in Memory and on Disk behaviors.

…oder to e5951d4 (#10989) Motivation: The changes introduced in 1c23040 did cause various issues while the fix itself is not considered critical. For now it is considered the best to just rollback and investigate more. Modifications: - Revert changes done in 1c23040 (and later) for the post decoders. - Ensure we give memory back to the system as soon as possible in a safe manner Result: Fixes #10973

danielflower · 2021-02-15T13:15:04Z

Thanks to both of you. The new version is working well now.

The performance degradation with paranoid detection is very noticeable (e.g. a test with a lot of data moving was taking 1 second in 4.1.58 takes 30 seconds in 4.1.59). For a while I thought there was a performance issue in the new version before I remembered what Frederic said about leak detection performance.

So if anyone else is experiencing this and wondering what is going on, try disabling PARANOID for the slow tests.

fredericBregier · 2021-02-15T15:00:24Z

@danielflower or maybe wait for #11001 which fixes also this PARANOID issue, and improves without this level also performances (about 4 times)... ;-)

chrisvest · 2021-02-26T13:28:55Z

The #11001 PR is merged now.

…oder to e5951d4 (netty#10989) Motivation: The changes introduced in 1c23040 did cause various issues while the fix itself is not considered critical. For now it is considered the best to just rollback and investigate more. Modifications: - Revert changes done in 1c23040 (and later) for the post decoders. - Ensure we give memory back to the system as soon as possible in a safe manner Result: Fixes netty#10973

fredericBregier mentioned this issue Jan 30, 2021

Fix OutOfMemory in HttpPostMultipartRequestDecoder #10973 #10982

Closed

normanmaurer mentioned this issue Feb 3, 2021

Revert HttpPostMultipartRequestDecoder and HttpPostStandardRequestDec… #10989

Merged

normanmaurer closed this as completed in #10989 Feb 3, 2021

normanmaurer added this to the 4.1.59.Final milestone Feb 3, 2021

chrisvest mentioned this issue Apr 8, 2021

HttpPostRequestDecoder may cause memory leak #11109

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

OutOfDirectMemoryError for large uploads using HttpPostMultipartRequestDecoder #10973

OutOfDirectMemoryError for large uploads using HttpPostMultipartRequestDecoder #10973

danielflower commented Jan 28, 2021

franz1981 commented Jan 29, 2021

normanmaurer commented Jan 29, 2021

fredericBregier commented Jan 29, 2021

fredericBregier commented Jan 30, 2021

danielflower commented Feb 15, 2021

fredericBregier commented Feb 15, 2021

chrisvest commented Feb 26, 2021

OutOfDirectMemoryError for large uploads using HttpPostMultipartRequestDecoder #10973

OutOfDirectMemoryError for large uploads using HttpPostMultipartRequestDecoder #10973

Comments

danielflower commented Jan 28, 2021

Expected behavior

Actual behavior

Steps to reproduce

Minimal yet complete reproducer code (or URL to code)

Netty version

JVM version (e.g. java -version)

OS version (e.g. uname -a)

franz1981 commented Jan 29, 2021

normanmaurer commented Jan 29, 2021

fredericBregier commented Jan 29, 2021

fredericBregier commented Jan 30, 2021

danielflower commented Feb 15, 2021

fredericBregier commented Feb 15, 2021

chrisvest commented Feb 26, 2021

JVM version (e.g. `java -version`)

OS version (e.g. `uname -a`)