Refine prefill CUDA graph capture sizes by shihaobai · Pull Request #1331 · ModelTC/LightLLM

shihaobai · 2026-06-04T12:52:32Z

No description provided.

gemini-code-assist

Code Review

This pull request updates the generation of graph_handle_token_nums in prefill_cuda_graph.py to use a combination of specific ranges instead of exponential values. A critical runtime bug was identified where set[int](...) is used, which will raise a TypeError in Python because generic type subscription is not callable at runtime. The reviewer recommended reverting this to set(...).

Important

The consumer version of Gemini Code Assist on GitHub is being sunset. Starting June 18, 2026, new organization installations will be blocked, and all code review activity will officially cease on July 17, 2026.
For more details on the timeline and next steps, please review the Help Documentation.

gemini-code-assist · 2026-06-04T12:53:06Z

        graph_handle_token_nums.append(self.max_handle_token_num)

-        graph_handle_token_nums = list(set(graph_handle_token_nums))
+        graph_handle_token_nums = list(set[int](graph_handle_token_nums))


Using set[int](...) at runtime will raise a TypeError: 'types.GenericAlias' object is not callable in Python. Subscription of built-in types like set[...] is only for type annotations and cannot be instantiated directly. Use set(...) instead.

Suggested change

graph_handle_token_nums = list(set[int](graph_handle_token_nums))

graph_handle_token_nums = list(set(graph_handle_token_nums))

prefill cudagraph capture size

af897dd

gemini-code-assist Bot reviewed Jun 4, 2026

View reviewed changes

fix

76ee67b

shihaobai merged commit 63269d5 into main Jun 4, 2026
1 check passed

shihaobai deleted the prefill_cudagraph branch June 4, 2026 13:01

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Refine prefill CUDA graph capture sizes#1331

Refine prefill CUDA graph capture sizes#1331
shihaobai merged 2 commits into
mainfrom
prefill_cudagraph

shihaobai commented Jun 4, 2026

Uh oh!

gemini-code-assist Bot left a comment

Uh oh!

gemini-code-assist Bot Jun 4, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

	graph_handle_token_nums = list(set[int](graph_handle_token_nums))
	graph_handle_token_nums = list(set(graph_handle_token_nums))

Conversation

shihaobai commented Jun 4, 2026

Uh oh!

gemini-code-assist Bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

gemini-code-assist Bot Jun 4, 2026

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant