Fixed memory leaks in topic reader when creating a large number of instances in sequence #547

polRk · 2025-12-08T14:25:03Z

What

This PR addresses several memory leaks in the topic reader implementation that could cause excessive memory consumption when creating and destroying readers frequently.

Why

These changes fix memory leaks that could cause applications using the topic reader to consume increasing amounts of memory over time, especially in scenarios where readers are frequently created and destroyed. The fixes ensure proper cleanup of resources including:

Pending async operations
Event listeners
Message buffers
Partition session data
Queue items

Changes

Queue Management Improvements

Added a reset() method to AsyncPriorityQueue to properly clear items and reset state
Changed retry logic in _consume_stream.ts and _consume_stream_tx.ts to call close() and reset() on the queue instead of just closing it
Updated dispose() method to properly clear the heap to prevent memory leaks

Abort Signal Handling

Replaced AbortSignal.any() usage with manual signal merging to prevent memory accumulation from composite signals
Added proper cleanup of event listeners when the read generator completes
Fixed signal checking in stream consumption to use the retry signal instead of the controller signal

Resource Cleanup

Enhanced the destroy() method in both TopicReader and TopicTxReader to properly clear:
- Partition sessions
- Message buffer
- Pending commits
Added proper disposal of the outgoing queue using dispose() instead of just close()
Added tracking of the stream promise to properly wait for stream completion during disposal

Buffer Management

Improved buffer release logic to properly track and release memory when messages are fully processed
Added better logging for buffer state tracking

Disposal Support

Added both Symbol.dispose and Symbol.asyncDispose to TopicReader interface
Implemented graceful async disposal that waits for stream completion with timeout
Added proper cleanup in transaction reader on commit/rollback/close events

Testing

Added a memory leak test (memory-leak.test.ts) that creates and destroys readers in a loop to verify memory consumption remains stable

Testing

The memory leak test validates that memory consumption remains stable even after creating and destroying 50,000 readers.

Checklist

Changeset added (if package changes)
README updated (if public API changed)

Copilot

Pull request overview

This PR addresses memory leaks in the topic reader that occur when creating and destroying reader instances frequently. The fixes focus on proper cleanup of resources including abort signals, event listeners, message buffers, partition sessions, and queue items.

Key Changes:

Replaced AbortSignal.any() with manual signal merging and proper listener cleanup to prevent memory accumulation
Added reset() method to AsyncPriorityQueue and enhanced dispose() to properly clear queue state
Enhanced resource cleanup in destroy() methods to clear partition sessions, message buffers, and pending commits

Reviewed changes

Copilot reviewed 10 out of 10 changed files in this pull request and generated 10 comments.

Show a summary per file

File	Description
packages/topic/vitest.config.ts	Increased test timeout to 15 seconds for integration tests
packages/topic/tests/memory-leak.test.ts	Added comprehensive memory leak test that creates/destroys 50,000 readers to validate stable memory consumption
packages/topic/src/reader/types.ts	Extended TopicReader interface to support both sync and async disposal patterns
packages/topic/src/reader/index.ts	Added stream promise tracking, enhanced cleanup in destroy methods, implemented disposal symbols for both sync and async cleanup
packages/topic/src/reader/_shared.ts	Updated documentation for background token refresher parameter
packages/topic/src/reader/_read.ts	Replaced AbortSignal.any() with manual signal merging and added proper event listener cleanup in finally block
packages/topic/src/reader/_consume_stream_tx.ts	Updated retry logic to call close() and reset() on queue for clean state on reconnection
packages/topic/src/reader/_consume_stream.ts	Updated retry logic and fixed signal checking to use retry signal instead of controller signal
packages/topic/src/queue.ts	Added reset() method to clear queue state and enhanced dispose() to prevent memory leaks

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

packages/topic/src/queue.ts

packages/topic/tests/memory-leak.test.ts

Copilot · 2025-12-08T14:29:56Z

packages/topic/src/reader/_consume_stream.ts

+		state.outgoingQueue.close()
+		state.outgoingQueue.reset()


Potential race condition: close() and reset() are called sequentially on state.outgoingQueue at the start of each retry. However, the queue is shared state that could be accessed by concurrent operations (e.g., the read generator trying to send read requests, or the background token refresher). This could lead to:

push() being called between close() and reset(), which would throw "Queue closed" error

Iterator operations happening during the reset

Consider either:

Using a new queue instance for each retry instead of resetting the existing one, or

Adding proper synchronization to prevent concurrent access during the close/reset sequence

Copilot · 2025-12-08T14:29:56Z

packages/topic/src/reader/_consume_stream_tx.ts

+		state.outgoingQueue.close()
+		state.outgoingQueue.reset()


Potential race condition: Same issue as in _consume_stream.ts - close() and reset() are called sequentially on a shared queue that could be accessed concurrently by the read generator or background operations. This could cause "Queue closed" errors or iterator issues during the reset sequence.

Copilot · 2025-12-08T14:29:57Z

packages/topic/src/reader/index.ts

 		state.controller.abort(reason)
-		state.outgoingQueue.close()
+		state.outgoingQueue.dispose()
 		state.readOffsets.clear()
+		state.partitionSessions.clear()
+		state.buffer.length = 0
 		state.disposed = true


The order of cleanup operations in destroy() is inconsistent with the regular TopicReader. In TopicReader.destroy(), state.disposed is set to true before calling state.controller.abort(), while here it's set after.

Setting disposed = true first is safer because it prevents re-entry issues - if any code checks state.disposed during the abort process, it will correctly see that the reader is already being disposed. This should match the pattern used in the regular reader for consistency.

Copilot · 2025-12-08T14:29:57Z

packages/topic/src/reader/index.ts

+
+			// Wait for the stream consumption to finish (with timeout)
+			if (streamPromise) {
+				try {
+					await Promise.race([
+						streamPromise,
+						new Promise<void>((resolve) =>
+							setTimeout(resolve, 1000)
+						), // 1s timeout
+					])
+				} catch {
+					// Ignore errors from stream - it's already being closed
+				}
+			}


[nitpick] The async disposal waits for close() to complete but then also waits for streamPromise with a timeout. However, close() internally calls destroy() at the end (line 138), which aborts the controller signal that _consume_stream is waiting on. This means that streamPromise should complete quickly after close() returns.

The additional wait for streamPromise with a 1-second timeout may be unnecessary since the stream should already be finishing due to the abort signal. Consider either:

Removing the streamPromise wait since destroy was already called, or

Adding a comment explaining why this additional wait is needed despite close() calling destroy()

Suggested change

// Wait for the stream consumption to finish (with timeout)

if (streamPromise) {

try {

await Promise.race([

streamPromise,

new Promise<void>((resolve) =>

setTimeout(resolve, 1000)

), // 1s timeout

])

} catch {

// Ignore errors from stream - it's already being closed

}

}

// No need to wait for streamPromise: close() already calls destroy(), which aborts the stream.

Copilot · 2025-12-08T14:29:57Z

packages/topic/src/reader/types.ts

 	disposed: boolean

 	// Data structures
+	// Note: outgoingQueue is mutable because it needs to be recreated on each retry


The comment states "outgoingQueue is mutable because it needs to be recreated on each retry" but the field is marked as readonly and the implementation doesn't recreate it - it calls close() and reset() on the same instance instead.

Either the comment should be updated to reflect the actual implementation, or if the queue truly should be recreated on retry (which might be safer to avoid shared state issues), the implementation should be changed to match the comment.

Suggested change

// Note: outgoingQueue is mutable because it needs to be recreated on each retry

// Note: outgoingQueue is reset (not recreated) on each retry

Copilot · 2025-12-08T14:29:57Z

packages/topic/src/queue.ts

+	reset() {
+		dbg.log('resetting queue, clearing %d items', this.heap.length)
+		this.heap.length = 0
+		this.closed = false


The reset() method doesn't clear pending operations (pendingShift and pendingResume). When called after close(), there may be pending promises that were created during the next() call that will never be resolved. This could cause memory leaks if callers are waiting on these promises.

Consider adding cleanup for these pending operations:

reset() { dbg.log('resetting queue, clearing %d items', this.heap.length) this.heap.length = 0 this.closed = false // Clean up any pending operations if (this.pendingShift) { this.pendingShift({ value: undefined as any, done: true }) delete this.pendingShift } if (this.pendingResume) { this.pendingResume() delete this.pendingResume } }

Suggested change

this.closed = false

this.closed = false

// Clean up any pending operations

if (this.pendingShift) {

dbg.log('resolving pending shift with done: true (reset)')

let resolve = this.pendingShift

delete this.pendingShift

resolve({ value: undefined as any, done: true })

}

if (this.pendingResume) {

dbg.log('resolving pending resume (reset)')

let resolve = this.pendingResume

delete this.pendingResume

resolve()

}

Copilot · 2025-12-08T14:29:58Z

packages/topic/src/reader/_read.ts

+					if (messageCount >= limit) {
+						ctx.buffer.unshift(response) // Put the response back to the front of the buffer
+						break
 					}


The condition 'message ... = limit' is always false.

Suggested change

if (messageCount >= limit) {

ctx.buffer.unshift(response) // Put the response back to the front of the buffer

break

}

Copilot · 2025-12-08T14:29:58Z

packages/topic/src/reader/_read.ts

+								if (messageCount >= limit) {
+									batch.messageData.unshift(msg) // Put the message back to the front of the batch
+									break
 								}


The condition 'message ... = limit' is always false.

Suggested change

if (messageCount >= limit) {

batch.messageData.unshift(msg) // Put the message back to the front of the batch

break

}

Signed-off-by: Vladislav Polyakov <polRk@ydb.tech>

Copilot AI review requested due to automatic review settings December 8, 2025 14:25

Copilot started reviewing on behalf of polRk December 8, 2025 14:25 View session

polRk force-pushed the fix-memory-leaks-in-reader branch from 2868d02 to 8b2c440 Compare December 8, 2025 14:28

Copilot AI reviewed Dec 8, 2025

View reviewed changes

Fix memory leaks in reader

9e9fe6e

Signed-off-by: Vladislav Polyakov <polRk@ydb.tech>

polRk force-pushed the fix-memory-leaks-in-reader branch from cd2154f to 9e9fe6e Compare December 8, 2025 14:39

polRk merged commit a0f39b6 into main Dec 8, 2025
8 checks passed

polRk deleted the fix-memory-leaks-in-reader branch December 8, 2025 14:58

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Fixed memory leaks in topic reader when creating a large number of instances in sequence #547

Fixed memory leaks in topic reader when creating a large number of instances in sequence #547

Uh oh!

polRk commented Dec 8, 2025 •

edited

Loading

Uh oh!

Copilot AI left a comment

Uh oh!

Uh oh!

Uh oh!

Copilot AI Dec 8, 2025

Uh oh!

Copilot AI Dec 8, 2025

Uh oh!

Copilot AI Dec 8, 2025

Uh oh!

Copilot AI Dec 8, 2025

Uh oh!

Copilot AI Dec 8, 2025

Uh oh!

Copilot AI Dec 8, 2025

Uh oh!

Copilot AI Dec 8, 2025

Uh oh!

Copilot AI Dec 8, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

	// Note: outgoingQueue is mutable because it needs to be recreated on each retry
	// Note: outgoingQueue is reset (not recreated) on each retry

-		this.closed = false
+		this.closed = false
+		// Clean up any pending operations
+		if (this.pendingShift) {
+			dbg.log('resolving pending shift with done: true (reset)')
+			let resolve = this.pendingShift
+			delete this.pendingShift
+			resolve({ value: undefined as any, done: true })
+		}
+		if (this.pendingResume) {
+			dbg.log('resolving pending resume (reset)')
+			let resolve = this.pendingResume
+			delete this.pendingResume
+			resolve()
+		}

Fixed memory leaks in topic reader when creating a large number of instances in sequence #547

Fixed memory leaks in topic reader when creating a large number of instances in sequence #547

Uh oh!

Conversation

polRk commented Dec 8, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What

Why

Changes

Testing

Checklist

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Reviewed changes

Uh oh!

Uh oh!

Uh oh!

Copilot AI Dec 8, 2025

Choose a reason for hiding this comment

Uh oh!

Copilot AI Dec 8, 2025

Choose a reason for hiding this comment

Uh oh!

Copilot AI Dec 8, 2025

Choose a reason for hiding this comment

Uh oh!

Copilot AI Dec 8, 2025

Choose a reason for hiding this comment

Uh oh!

Copilot AI Dec 8, 2025

Choose a reason for hiding this comment

Uh oh!

Copilot AI Dec 8, 2025

Choose a reason for hiding this comment

Uh oh!

Copilot AI Dec 8, 2025

Choose a reason for hiding this comment

Uh oh!

Copilot AI Dec 8, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

polRk commented Dec 8, 2025 •

edited

Loading