Aggressively remove PoolThreadCache references from its finalizer object #14155

franz1981 · 2024-07-02T12:26:01Z

Motivation:
If a cache's FastThreadLocalThread owner win the race to remove the cache, due to debugging capabilities, its finalizer will still retain a strong reference to it, causing few classes to leak (and eventually, their ClassLoader). Despite we cannot avoid finalizers to wait the finalization pass, we can reduce the memory footprint of "leaked" instances before the finalization happen.

Modification:
non-debug early cache removal can remove the cache strong reference within FreeOnFinalize, making it an emtpy shell, eligible for GC.

Result:
Smaller memory footprint while waiting finalization to happen

franz1981 · 2024-07-02T12:28:41Z

This is related quarkusio/quarkus#41607 and quarkusio/quarkus#41156 (comment)

franz1981 · 2024-07-02T12:30:45Z

@normanmaurer @chrisvest It would be great if we could just let FastThreadLocalThread which cleanupFastThreadLocals == true to not use any finalizer, trusting them, but maybe I'm making this too simple and there's still a reason to use finalizers for those, wdyt?

franz1981 · 2024-07-02T13:06:29Z

FYI @normanmaurer @chrisvest I've added 72c4731 which push the approach further by enabling opt-in to NOT use finalizers for FastThreadLocalThreads which take care of cleaning up their FastThreadLocals instances.

buffer/src/main/java/io/netty/buffer/PoolThreadCache.java

chrisvest

I ran the non-broken NettyLeak1 reproducer from #12749, and it looks like the issue is resolved. So… controversial suggestion: maybe we don't need to make the leak check and can always clear the FreeOnFinalize.cache field.

buffer/src/main/java/io/netty/buffer/PoolThreadCache.java

chrisvest · 2024-07-02T18:50:28Z

Here's a fixed version of NettyLeak0 from #12749 which also shows no leaks:

import java.util.ArrayList;
import java.util.Collections;
import java.util.List;
import java.util.concurrent.ArrayBlockingQueue;
import java.util.concurrent.atomic.AtomicInteger;

public class NettyLeak0 {

    public static void main(String[] args) throws Exception {
        System.setProperty("io.netty.allocator.useCacheForAllThreads", "true");
        System.setProperty("io.netty.allocator.maxOrder", "9");
        Thread.setDefaultUncaughtExceptionHandler((t, e) -> {
            e.printStackTrace();
            Runtime.getRuntime().halt(42);
        });

        int bufSize = 32 * 1024;

        AtomicInteger longLiving = new AtomicInteger();
        List<ByteBuf> longLivingSink = Collections.synchronizedList(new ArrayList<ByteBuf>());
        ArrayBlockingQueue<Thread> threads = new ArrayBlockingQueue<>(64);
        for (int i = 0; true; i++) {
            Thread thread = new Thread(() -> {
                for (int x = 0; x < 256; x++) {
                    if (x != 0) {
                        // allocate different size classes to fill more thread caches
                        for (int j = 0; j <= 10; j++) {
                            ByteBuf buf = PooledByteBufAllocator.DEFAULT.directBuffer(bufSize >> j);
                            buf.release();
                        }
                    } else {
                        // only one buf is alive after thread termination
                        ByteBuf buf = PooledByteBufAllocator.DEFAULT.directBuffer(1);
                        longLiving.incrementAndGet();
                        longLivingSink.add(buf);
                    }
                }
            });
            if (!threads.offer(thread)) {
                threads.take().join();
                threads.put(thread);
            }
            thread.start();

            if (i % 1000 == 0) {
                System.gc(); // to get a chance for io.netty.buffer.PoolThreadCache.finalize
                System.out.println(PooledByteBufAllocator.DEFAULT.metric());
                System.out.println(
                        "chunks count: " +
                                PooledByteBufAllocator.DEFAULT.metric().directArenas().stream()
                                        .flatMap(m -> m.chunkLists().stream())
                                        .mapToInt(m -> {
                                            int count = 0;
                                            for (PoolChunkMetric __ : m) {
                                                count += 1;
                                            }
                                            return count;
                                        })
                                        .sum()
                );
                int ll = longLiving.get();
                System.out.println("long living: " + ll);
                // real size of a buf will be > 1b & < 1kb, but just estimate it as 1kb
                System.out.println("long living size (MB): " + ll / 1024);
                System.out.println();
                longLivingSink.removeIf(buf -> buf.release() && decrement(longLiving));
            }
        }
    }

    private static boolean decrement(AtomicInteger longLiving) {
        longLiving.addAndGet(-1);
        return true;
    }
}

franz1981 · 2024-07-03T07:41:10Z

@normanmaurer wdyt about the @chrisvest suggestion?

maybe we don't need to make the leak check and can always clear the FreeOnFinalize.cache field.

normanmaurer · 2024-07-03T07:58:20Z

@normanmaurer wdyt about the @chrisvest suggestion?

maybe we don't need to make the leak check and can always clear the FreeOnFinalize.cache field.

Sounds good to me

franz1981 · 2024-07-04T10:01:02Z

PTAL @chrisvest @normanmaurer

Not very happy that I've added a new sys prop here, but I want to keep it safe for existing users...
Who requires to keep on creating/stopping event loop threads (e.g. the dev mode in quarkus) can decide to avoid finalization and just set it to speedup dropping the previous classloader (@gsmet please try this for your issue)

normanmaurer · 2024-07-04T10:53:12Z

PTAL @chrisvest @normanmaurer

Not very happy that I've added a new sys prop here, but I want to keep it safe for existing users... Who requires to keep on creating/stopping event loop threads (e.g. the dev mode in quarkus) can decide to avoid finalization and just set it to speedup dropping the previous classloader (@gsmet please try this for your issue)

That's good to me

Motivation: If a cache's FastThreadLocalThread owner win the race to remove the cache, due to debugging capabilities, it's finalizer will still retain a strong reference to it, causing few classes to leak (and eventually, their ClassLoader). Despite we cannot avoid finalizers to wait the finalization pass, we can reduce the memory footprint of "leaked" instances before the finalization happen. Modification: non-debug early cache removal can remove the cache strong reference within FreeOnFinalize, making it an emtpy shell, eligible for GC. Result: Smaller memory footprint while waiting finalization to happen

franz1981 · 2024-07-04T10:54:15Z

Let's see what the CI think @normanmaurer and thanks again!

gsmet · 2024-07-04T11:02:56Z

Let me have a look for my use case before merging. Thanks!

gsmet · 2024-07-04T11:39:58Z

I can confirm that I don't see the finalizers trying to load classes after my CL is closed, which looks promising!

…eads This is going to be useful once netty/netty#14155 lands in Quarkus.

franz1981 · 2024-07-04T13:01:42Z

So @normanmaurer , now just need CI and @chrisvest to be happy bout this

franz1981 · 2024-07-08T06:17:55Z

Windows seems the only one unhappy, likely for unrelated reasons; if you are good to go, I am done here @normanmaurer

normanmaurer · 2024-07-08T08:47:13Z

I am happy if @chrisvest is happy... @chrisvest PTAL again

chrisvest

Looks good.

franz1981 · 2024-07-09T08:24:03Z

@normanmaurer I can merge it myself, ok @normanmaurer ?
My first time merging a PR here (no jokes!)

normanmaurer · 2024-07-09T08:25:04Z

@franz1981 sure... lets go. Please also cherry-pick it to the other related branches..

…ect (#14155) Motivation: If a cache's FastThreadLocalThread owner win the race to remove the cache, due to debugging capabilities, its finalizer will still retain a strong reference to it, causing few classes to leak (and eventually, their ClassLoader). Despite we cannot avoid finalizers to wait the finalization pass, we can reduce the memory footprint of "leaked" instances before the finalization happen. Modification: non-debug early cache removal can remove the cache strong reference within FreeOnFinalize, making it an emtpy shell, eligible for GC. Result: Smaller memory footprint while waiting finalization to happen

…ect (netty#14155) Motivation: If a cache's FastThreadLocalThread owner win the race to remove the cache, due to debugging capabilities, its finalizer will still retain a strong reference to it, causing few classes to leak (and eventually, their ClassLoader). Despite we cannot avoid finalizers to wait the finalization pass, we can reduce the memory footprint of "leaked" instances before the finalization happen. Modification: non-debug early cache removal can remove the cache strong reference within FreeOnFinalize, making it an emtpy shell, eligible for GC. Result: Smaller memory footprint while waiting finalization to happen

franz1981 added the improvement label Jul 2, 2024

franz1981 requested review from chrisvest and normanmaurer July 2, 2024 12:26

franz1981 mentioned this pull request Jul 2, 2024

Netty finalizers load classes from closed class loaders quarkusio/quarkus#41607

Open

franz1981 force-pushed the 4.1_classloader_leak branch from 0850f3e to 72c4731 Compare July 2, 2024 13:29

normanmaurer requested changes Jul 2, 2024

View reviewed changes

buffer/src/main/java/io/netty/buffer/PoolThreadCache.java Outdated Show resolved Hide resolved

buffer/src/main/java/io/netty/buffer/PoolThreadCache.java Show resolved Hide resolved

chrisvest reviewed Jul 2, 2024

View reviewed changes

buffer/src/main/java/io/netty/buffer/PoolThreadCache.java Show resolved Hide resolved

franz1981 marked this pull request as ready for review July 4, 2024 09:58

franz1981 force-pushed the 4.1_classloader_leak branch from 14cbf9d to 9375b45 Compare July 4, 2024 10:01

franz1981 force-pushed the 4.1_classloader_leak branch from 9375b45 to f618686 Compare July 4, 2024 10:53

normanmaurer approved these changes Jul 4, 2024

View reviewed changes

gsmet added a commit to gsmet/quarkus that referenced this pull request Jul 4, 2024

Enable io.netty.allocator.disableCacheFinalizersForFastThreadLocalThr…

4a2ebe9

…eads This is going to be useful once netty/netty#14155 lands in Quarkus.

gsmet mentioned this pull request Jul 4, 2024

Enable io.netty.allocator.disableCacheFinalizersForFastThreadLocalThreads quarkusio/quarkus#41686

Draft

chrisvest approved these changes Jul 8, 2024

View reviewed changes

normanmaurer added this to the 4.1.112.Final milestone Jul 9, 2024

franz1981 merged commit 8039232 into netty:4.1 Jul 9, 2024
17 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Aggressively remove PoolThreadCache references from its finalizer object #14155

Aggressively remove PoolThreadCache references from its finalizer object #14155

franz1981 commented Jul 2, 2024 •

edited

Loading

franz1981 commented Jul 2, 2024

franz1981 commented Jul 2, 2024 •

edited

Loading

franz1981 commented Jul 2, 2024 •

edited

Loading

chrisvest left a comment

chrisvest commented Jul 2, 2024

franz1981 commented Jul 3, 2024

normanmaurer commented Jul 3, 2024

franz1981 commented Jul 4, 2024

normanmaurer commented Jul 4, 2024

franz1981 commented Jul 4, 2024

gsmet commented Jul 4, 2024

gsmet commented Jul 4, 2024

franz1981 commented Jul 4, 2024

franz1981 commented Jul 8, 2024

normanmaurer commented Jul 8, 2024

chrisvest left a comment

franz1981 commented Jul 9, 2024

normanmaurer commented Jul 9, 2024

Aggressively remove PoolThreadCache references from its finalizer object #14155

Aggressively remove PoolThreadCache references from its finalizer object #14155

Conversation

franz1981 commented Jul 2, 2024 • edited Loading

franz1981 commented Jul 2, 2024

franz1981 commented Jul 2, 2024 • edited Loading

franz1981 commented Jul 2, 2024 • edited Loading

chrisvest left a comment

Choose a reason for hiding this comment

chrisvest commented Jul 2, 2024

franz1981 commented Jul 3, 2024

normanmaurer commented Jul 3, 2024

franz1981 commented Jul 4, 2024

normanmaurer commented Jul 4, 2024

franz1981 commented Jul 4, 2024

gsmet commented Jul 4, 2024

gsmet commented Jul 4, 2024

franz1981 commented Jul 4, 2024

franz1981 commented Jul 8, 2024

normanmaurer commented Jul 8, 2024

chrisvest left a comment

Choose a reason for hiding this comment

franz1981 commented Jul 9, 2024

normanmaurer commented Jul 9, 2024

franz1981 commented Jul 2, 2024 •

edited

Loading

franz1981 commented Jul 2, 2024 •

edited

Loading

franz1981 commented Jul 2, 2024 •

edited

Loading