fix(core): Remove Python code node memory leak using Workers #13648

riseandignite · 2025-03-03T16:02:10Z

Summary

This PR addresses issue #7939 where users report significant memory growth when using Python in Code nodes. The solution implements a Worker approach that completely resolves the memory leak pattern.

Memory usage comparison charts: https://n8n-memory-git-master-nikitas-projects-2b098508.vercel.app/

Problem

Users have documented persistent memory growth issues when running Python code nodes:

Memory increases with each execution and never fully recovers
Baseline memory usage continually grows over time
Eventually leads to high memory consumption and OOM errors

Looking at the memory charts from my testing, the pattern is clear - our current implementation shows steadily increasing memory usage that never fully recovers. This matches exactly what users have reported in issue #7939, where memory grows from around 200MB to over 1GB and continues climbing with each Python execution.

Investigations

The root cause is in how Pyodide (our Python-in-browser implementation) manages memory:
When Python code executes through Pyodide, it creates WebAssembly memory allocations that aren't fully released afterward. Even when JavaScript references are cleared and Python's garbage collector runs, portions of this WASM heap remain allocated.

Solutions Tested

I tested three approaches:

Removing the singleton pattern: This helped somewhat by creating a fresh Pyodide instance each time, but still left significant memory unreleased.
Manual cleanup with sys.modules.clear() and gc.collect(): This provided minor improvements but couldn't reach all the memory being held.
Worker implementation: This completely solved the issue by isolating each execution in its own worker thread and terminating it afterward.

The charts clearly show that both removing the singleton and using Workers address the memory growth, but the Worker approach is significantly more effective at reclaiming memory: https://n8n-memory-git-master-nikitas-projects-2b098508.vercel.app/

The workflow used was two python scripts executed one after the other every 20 seconds for ~10-15 minutes then turned off for ~10 minutes.

my_list = [0] * 1000000
return {"a": my_list}

Why Workers Work Better

The Worker solution is superior because terminating a worker forces the browser to reclaim all resources associated with it, including the WebAssembly heap that normal garbage collection can't reach. This creates a complete reset between executions without relying on Pyodide's internal cleanup mechanisms.

This implementation should resolve the frustrating experience users like @pablorq and @merlinxcy have reported where their n8n instances continuously consume more memory until they're forced to restart.

The implementation creates a new worker for each Python execution:

Launch worker with Pyodide environment
Send Python code and context to worker
Receive results from worker
Terminate worker completely, releasing all memory

Implementation Tradeoffs

This implementation creates some tradeoffs worth mentioning:

Performance overhead: Creating a worker and initializing a new Pyodide environment for each execution adds <~150ms latency compared to reusing an existing instance depending on machine. This is typically negligible for most workflows, especially considering Python code execution itself is usually the more time-consuming part.
Resource usage: Each worker temporarily increases memory usage by approximately 40-60MB during initialization, though these resources are completely released afterward. This temporary spike is far preferable to the permanent memory growth pattern we were seeing.

However, these tradeoffs are well justified given:

The complete resolution of memory leaks
Prevention of OOM crashes in production systems
More consistent performance over time
Elimination of need for periodic restarts

In testing, the initialization overhead proved to be a worthwhile tradeoff for the stability benefits. For most Python Code node use cases, the slight performance impact will be negligible compared to the benefits of reliable memory management.

Future Optimizations

While this implementation completely solves the memory leak issue, there are potential optimizations we could explore in the future:

Worker pooling: If Python execution becomes performance-critical, we could implement a small pool of workers that are reused for a limited number of executions before being recycled. This would balance memory management with startup performance.
Selective serialization: We could optimize the data passing between the main thread and worker by implementing smarter serialization that only includes the minimum required context.

These optimizations aren't necessary for the current fix but represent potential future enhancements if needed.

Related Linear tickets, Github issues, and Community forum posts

Fixes: #7939

Review / Merge checklist

PR title and summary are descriptive. (conventions)
Docs updated or follow-up ticket created.
Tests included.
PR Labeled with release/backport (if the PR is an urgent fix that needs to be backported)

CLAassistant · 2025-03-03T16:02:17Z

All committers have signed the CLA.

Joffcom · 2025-03-03T16:31:04Z

Hey @riseandignite,

Thanks for the PR, We have created "GHC-1035" as the internal reference to get this reviewed.

One of us will be in touch if there are any changes needed, in most cases this is normally within a couple of weeks but it depends on the current workload of the team.

netroy

I've experimented with something like this before, but had to drop the idea because loading pyodide for every execution gets very resource expensive as instances scale up.

Maybe instead of completely removing the singleton, we should have a pool of workers, that we can re-cycle after a certain fixed number of code executions, or after a certain amount of time.

That won't solve the memory leak properly, but every time a worker thread is recycled, that should release the memory back.

I think it might make a lot more sense to migrate Python support to Task-Runners, and switch to real python, and move away from pyodide completely.

riseandignite · 2025-03-03T16:48:40Z

Thanks for the feedback @netroy! You raise an excellent point about scale.

Let me share some numbers to put this in context:

Current memory growth issue:

Users are reporting memory growth from ~200MB to over 1GB with standard Python usage
@pablorq had to restart their instance every few days due to this growth
The memory never decreases after Python code runs, forcing restarts

Worker initialization costs:

Each Pyodide initialization: ~40-60MB memory + ~150-200ms startup time
10 simultaneous Python executions: ~400-600MB temporary memory. 100 executions: 4-6GB.
However, this memory is fully released afterward (unlike current approach)

You're absolutely right that a worker pool would be a better compromise. I actually mentioned this as a future optimization in my PR description, but it makes sense to implement it now if scalability is a concern.

I can modify the PR to implement a pool where:

Maintain a small pool of workers (configurable, default 3-5)
Each worker handles X executions (configurable, default 10-20) before recycling
This gives us ~80-90% of the memory leak prevention benefit
While reducing resource usage by 80-95% compared to creating a worker per execution

This approach would significantly reduce the initialization overhead while still preventing the unbounded memory growth that's currently happening. Would that be a better approach? I can make these changes if you think this strikes the right balance.

And I agree that migrating to real Python via Task-Runners would be the ideal long-term solution.

fix(core): execute runCodeInPython in Worker to prevent memory leak

43988f4

riseandignite changed the title ~~fix(core): execute runCodeInPython in Worker to prevent memory leak~~ fix(core): Remove Python code node memory leak using Web Workers Mar 3, 2025

chore: Remove debug console.log statements

5323808

riseandignite marked this pull request as ready for review March 3, 2025 16:12

riseandignite mentioned this pull request Mar 3, 2025

n8n is taking a lot of RAM memory #7939

Closed

riseandignite changed the title ~~fix(core): Remove Python code node memory leak using Web Workers~~ fix(core): Remove Python code node memory leak using Workers Mar 3, 2025

n8n-assistant bot added community Authored by a community member node/improvement New feature or request in linear Issue or PR has been created in Linear for internal review labels Mar 3, 2025

netroy reviewed Mar 3, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

fix(core): Remove Python code node memory leak using Workers #13648

fix(core): Remove Python code node memory leak using Workers #13648

Uh oh!

riseandignite commented Mar 3, 2025 •

edited

Loading

Uh oh!

CLAassistant commented Mar 3, 2025 •

edited

Loading

Uh oh!

Joffcom commented Mar 3, 2025

Uh oh!

netroy left a comment •

edited

Loading

Uh oh!

riseandignite commented Mar 3, 2025

Uh oh!

Uh oh!

fix(core): Remove Python code node memory leak using Workers #13648

Are you sure you want to change the base?

fix(core): Remove Python code node memory leak using Workers #13648

Uh oh!

Conversation

riseandignite commented Mar 3, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Problem

Investigations

Solutions Tested

Why Workers Work Better

Implementation Tradeoffs

Future Optimizations

Related Linear tickets, Github issues, and Community forum posts

Review / Merge checklist

Uh oh!

CLAassistant commented Mar 3, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Joffcom commented Mar 3, 2025

Uh oh!

netroy left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

riseandignite commented Mar 3, 2025

Uh oh!

Uh oh!

riseandignite commented Mar 3, 2025 •

edited

Loading

CLAassistant commented Mar 3, 2025 •

edited

Loading

netroy left a comment •

edited

Loading