-
-
Notifications
You must be signed in to change notification settings - Fork 10.5k
[CI Failure] Fix test_flashinfer_cutlass_mxfp4_mxfp8_fused_moe #24750
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[CI Failure] Fix test_flashinfer_cutlass_mxfp4_mxfp8_fused_moe #24750
Conversation
Signed-off-by: mgoin <mgoin64@gmail.com>
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Code Review
This pull request addresses a CI failure in test_flashinfer_cutlass_mxfp4_mxfp8_fused_moe
by ensuring that the dequantized reference weight tensors are moved to the correct CUDA device. The change is correct and resolves the device mismatch issue. I have provided suggestions to consolidate the tensor conversion and device placement calls for improved code clarity and efficiency.
Signed-off-by: mgoin <mgoin64@gmail.com>
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Thanks for the work!
w2_q.view(torch.uint8), | ||
w2_scale.view(torch.uint8).reshape(-1)).to(torch.float32).reshape( | ||
num_experts, hidden_size, intermediate_size) | ||
num_experts, hidden_size, intermediate_size).to(device) |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
What will happen if we don't add the to(device)
here?
Signed-off-by: mgoin <mgoin64@gmail.com>
Signed-off-by: mgoin <mgoin64@gmail.com>
…project#24750) Signed-off-by: mgoin <mgoin64@gmail.com>
…project#24750) Signed-off-by: mgoin <mgoin64@gmail.com> Signed-off-by: bbartels <benjamin@bartels.dev>
Signed-off-by: mgoin <mgoin64@gmail.com>
…project#24750) Signed-off-by: mgoin <mgoin64@gmail.com>
Purpose
Also possibly found the culprit for the blackwell cutlass mla failing test https://buildkite.com/vllm/ci/builds/30554/steps/canvas?jid=01993edf-720e-4749-81eb-da58099b7c78
Test Plan
Test Result
Essential Elements of an Effective PR Description Checklist
supported_models.md
andexamples
for a new model.