[CI/Build Don't add FLASHINFER backend in test_cpu_offloading.py #29229

rasmith · 2025-11-22T06:52:00Z

This fixes a test failure where tests/v1/kv_offload/test_cpu_offloading.py adds the FLASHINFER backend to the test, but ROCm platform does not support flashinfer library. Doing this allows the test to be successful in AMD CI. The test runs to completion and the result is:

1 passed, 3 warnings

Signed-off-by: Randall Smith <ransmith@amd.com>

gemini-code-assist

Code Review

The primary change correctly makes the FLASHINFER backend conditional on the CUDA platform, which resolves the described CI failure on ROCm. This is a good fix. However, a debug print statement has been introduced in one of the test files, which should be removed before this pull request is merged.

gemini-code-assist · 2025-11-22T06:52:32Z

tests/v1/sample/test_sampling_params_e2e.py

    params = SamplingParams(temperature=0, bad_words=[bad_words_1, bad_words_2])
    output = llm.generate(PROMPT, params)
    new_text = output[0].outputs[0].text
+    print(f"new_text={new_text}")


This print statement appears to be a leftover from debugging. Such statements should be removed from test code to keep the test output clean and avoid confusion during test runs.

Signed-off-by: Randall Smith <ransmith@amd.com>

…m-project#29229) Signed-off-by: Randall Smith <ransmith@amd.com> Co-authored-by: Randall Smith <ransmith@amd.com>

…m-project#29229) Signed-off-by: Randall Smith <ransmith@amd.com> Co-authored-by: Randall Smith <ransmith@amd.com> Signed-off-by: Runkai Tao <rt572@physics.rutgers.edu>

Randall Smith added 2 commits November 22, 2025 00:48

Don't add FLASHINFER backend in test_cpu_offloading.py

87be73e

Signed-off-by: Randall Smith <ransmith@amd.com>

add the right file

d411c9c

Signed-off-by: Randall Smith <ransmith@amd.com>

gemini-code-assist bot reviewed Nov 22, 2025

View reviewed changes

mergify bot added the v1 label Nov 22, 2025

get rid of change

3dd9f39

Signed-off-by: Randall Smith <ransmith@amd.com>

DarkLight1337 approved these changes Nov 22, 2025

View reviewed changes

DarkLight1337 enabled auto-merge (squash) November 22, 2025 09:15

github-actions bot added the ready ONLY add when PR is ready to merge/full CI is needed label Nov 22, 2025

DarkLight1337 merged commit 8e22da1 into vllm-project:main Nov 22, 2025
19 checks passed

ywang96 pushed a commit to ywang96/vllm that referenced this pull request Nov 23, 2025

[CI/Build Don't add FLASHINFER backend in test_cpu_offloading.py (vll…

9b6f89b

…m-project#29229) Signed-off-by: Randall Smith <ransmith@amd.com> Co-authored-by: Randall Smith <ransmith@amd.com>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

[CI/Build Don't add FLASHINFER backend in test_cpu_offloading.py #29229

[CI/Build Don't add FLASHINFER backend in test_cpu_offloading.py #29229

Uh oh!

rasmith commented Nov 22, 2025 •

edited by github-actions bot

Loading

Uh oh!

gemini-code-assist bot left a comment

Uh oh!

gemini-code-assist bot Nov 22, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Uh oh!

[CI/Build Don't add FLASHINFER backend in test_cpu_offloading.py #29229

[CI/Build Don't add FLASHINFER backend in test_cpu_offloading.py #29229

Uh oh!

Conversation

rasmith commented Nov 22, 2025 • edited by github-actions bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

gemini-code-assist bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

gemini-code-assist bot Nov 22, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

rasmith commented Nov 22, 2025 •

edited by github-actions bot

Loading