[lora] allow int64 values for LoRA ID to avoid overflow by AlpinDale · Pull Request #1574 · dphnAI/sonar

AlpinDale · 2025-11-04T04:48:18Z

No description provided.

Signed-off-by: AlpinDale <alpindale@gmail.com>

gemini-code-assist

Code Review

This pull request updates the dtype of request_lora_mapping from np.int32 to np.int64 in both gpu_input_batch.py and tpu_input_batch.py to prevent potential overflows with LoRA IDs. While this change is necessary to support larger IDs, it introduces a risk of type mismatch with downstream components like CUDA or TPU kernels that might still expect 32-bit integers. I've added critical comments highlighting the need to ensure all consumers of this array are updated to prevent potential data corruption or crashes.

gemini-code-assist · 2025-11-04T04:55:12Z


        # lora related
-        self.request_lora_mapping = np.zeros((self.max_num_reqs,), dtype=np.int32)
+        self.request_lora_mapping = np.zeros((self.max_num_reqs,), dtype=np.int64)


This change to np.int64 can cause a critical type mismatch. If downstream consumers of this array (e.g., CUDA kernels) still expect np.int32, it can lead to silent data corruption or crashes due to incorrect memory interpretation. It is crucial that all components using request_lora_mapping are also updated to handle 64-bit integers.

gemini-code-assist · 2025-11-04T04:55:12Z


        # lora related
-        self.request_lora_mapping = np.zeros((self.max_num_reqs,), dtype=np.int32)
+        self.request_lora_mapping = np.zeros((self.max_num_reqs,), dtype=np.int64)


This change to np.int64 can cause a critical type mismatch. If downstream consumers of this array (e.g., TPU kernels) still expect np.int32, it can lead to silent data corruption or crashes due to incorrect memory interpretation. It is crucial that all components using request_lora_mapping are also updated to handle 64-bit integers.

[lora] allow int64 values for LoRA ID to avoid overflow

0dfc8fe

Signed-off-by: AlpinDale <alpindale@gmail.com>

AlpinDale merged commit 253aec1 into main Nov 4, 2025
1 check passed

AlpinDale deleted the int64-lora-id branch November 4, 2025 04:53

gemini-code-assist Bot reviewed Nov 4, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

[lora] allow int64 values for LoRA ID to avoid overflow#1574

[lora] allow int64 values for LoRA ID to avoid overflow#1574
AlpinDale merged 1 commit into
mainfrom
int64-lora-id

AlpinDale commented Nov 4, 2025

Uh oh!

Uh oh!

gemini-code-assist Bot left a comment

Uh oh!

gemini-code-assist Bot Nov 4, 2025

Uh oh!

gemini-code-assist Bot Nov 4, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Uh oh!

Uh oh!

Conversation

AlpinDale commented Nov 4, 2025

Uh oh!

Uh oh!

gemini-code-assist Bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

gemini-code-assist Bot Nov 4, 2025

Choose a reason for hiding this comment

Uh oh!

gemini-code-assist Bot Nov 4, 2025

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant