[offloader] fix async scheduling support with KV cache offloader by AlpinDale · Pull Request #1596 · dphnAI/sonar

AlpinDale · 2025-11-04T10:59:58Z

No description provided.

Signed-off-by: AlpinDale <alpindale@gmail.com>

gemini-code-assist

Code Review

This pull request addresses a crash in the KV cache offloader during asynchronous scheduling by removing an assertion. My review suggests reintroducing this assertion conditionally. This change ensures that the safety check remains active for synchronous operations, thereby enhancing code robustness, while still accommodating the specific requirements of asynchronous scheduling that led to the original issue.

gemini-code-assist · 2025-11-04T11:02:17Z

+            # NOTE: In async scheduling, placeholders may temporarily make
+            # len(req.block_hashes) < num_blocks * self.block_size_factor.


While removing the assertion fixes the crash with async scheduling, it also removes a valuable safety check that can catch other potential bugs. A safer approach would be to make the assertion conditional, so it only applies to requests that are not currently undergoing an asynchronous KV cache load. This preserves the safeguard for synchronous cases.

# The assertion is skipped for requests with an ongoing async load. if req_id not in self._reqs_being_loaded: num_gpu_blocks = num_blocks * self.block_size_factor assert len(req.block_hashes) >= num_gpu_blocks

[offloader] fix async scheduling support with KV cache offloader

6ed5122

Signed-off-by: AlpinDale <alpindale@gmail.com>

AlpinDale merged commit 7016e79 into main Nov 4, 2025
1 check passed

AlpinDale deleted the offloader-async branch November 4, 2025 11:01

gemini-code-assist Bot reviewed Nov 4, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

[offloader] fix async scheduling support with KV cache offloader#1596

[offloader] fix async scheduling support with KV cache offloader#1596
AlpinDale merged 1 commit into
mainfrom
offloader-async

AlpinDale commented Nov 4, 2025

Uh oh!

Uh oh!

gemini-code-assist Bot left a comment

Uh oh!

gemini-code-assist Bot Nov 4, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

		# NOTE: In async scheduling, placeholders may temporarily make
		# len(req.block_hashes) < num_blocks * self.block_size_factor.

Uh oh!

Uh oh!

Conversation

AlpinDale commented Nov 4, 2025

Uh oh!

Uh oh!

gemini-code-assist Bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

gemini-code-assist Bot Nov 4, 2025

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant