[Bug] [Node.js binding] Memory leak after releasing inference session

### Describe the issue

There's a memory leak occurring when creating an inference session using the runtime in Node. Each time a new session is created and released, the memory usage grows a little. We're running inference on an edge device with limited memory available, so after a couple sessions, the device runs out of memory. We need to reload the session, without restarting the whole application, as it's a realtime system.

Despite disabling "CpuMemArena" and "memPattern", as suggested for similar issues, the memory leak is still occurring.

### To reproduce

I've made a minimal javascript example that reproduces the issue:

``` Javascript
import * as ort from "onnxruntime-node";

const MODEL_FILE_PATH = "./model.onnx";

const getModel = async (filename) => await ort.InferenceSession.create(filename, { enableCpuMemArena: false, enableMemPattern: false });

const loadSession = async () => {
    const session = await getModel(MODEL_FILE_PATH);
    await session.release();
}

const main = async () => {
    for (let i = 0; i < 10; i++) {
        await loadSession();
    }
}

(async () => {
    await main()
})();
```

When testing locally, the application memory usage sits at 325 MB when loading the session once. When loading and releasing the session 10 times, it bloats to 994 MB. For 100 load/release cycles it sits at 9.12 GB


### Urgency

MEDIUM

### Platform

Mac

### OS Version

14.5

### ONNX Runtime Installation

Released Package

### ONNX Runtime Version or Commit ID

1.22.0-rev

### ONNX Runtime API

JavaScript

### Architecture

X64

### Execution Provider

Default CPU

### Execution Provider Library Version

_No response_

### Model File

_No response_

### Is this a quantized model?

Unknown

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[Bug] [Node.js binding] Memory leak after releasing inference session #25325

Describe the issue

To reproduce

Urgency

Platform

OS Version

ONNX Runtime Installation

ONNX Runtime Version or Commit ID

ONNX Runtime API

Architecture

Execution Provider

Execution Provider Library Version

Model File

Is this a quantized model?

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

[Bug] [Node.js binding] Memory leak after releasing inference session #25325

Description

Describe the issue

To reproduce

Urgency

Platform

OS Version

ONNX Runtime Installation

ONNX Runtime Version or Commit ID

ONNX Runtime API

Architecture

Execution Provider

Execution Provider Library Version

Model File

Is this a quantized model?

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions