Skip to content

Conversation

LucasWilkinson
Copy link
Collaborator

#21404 broke DeepGEMM warm-up, small fix

Signed-off-by: Lucas Wilkinson <lwilkins@redhat.com>
Copy link
Contributor

@gemini-code-assist gemini-code-assist bot left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Code Review

This pull request provides a critical bug fix for the DeepGEMM warmup process. The change correctly updates the attribute access from m.weight_scale_inv to m.weight_scale in _extract_data_from_linear_base_module. This is necessary because Fp8LinearMethod deletes the weight_scale_inv attribute and creates weight_scale after processing the weights. Without this fix, an AttributeError would occur during model warmup for models using block-wise FP8 quantization, preventing them from being used. The fix is correct and essential.

@tlrmchlsmth tlrmchlsmth added the ready ONLY add when PR is ready to merge/full CI is needed label Sep 18, 2025
@tlrmchlsmth tlrmchlsmth enabled auto-merge (squash) September 18, 2025 21:26
@vllm-bot vllm-bot merged commit 9fac6aa into vllm-project:main Sep 18, 2025
9 of 13 checks passed
@mgoin mgoin deleted the lwilkinson/fix-deepgemm-warmup branch September 18, 2025 22:26
ywang96 pushed a commit to ywang96/vllm that referenced this pull request Sep 19, 2025
Signed-off-by: Lucas Wilkinson <lwilkins@redhat.com>
debroy-rh pushed a commit to debroy-rh/vllm that referenced this pull request Sep 19, 2025
Signed-off-by: Lucas Wilkinson <lwilkins@redhat.com>
chenxi-yang pushed a commit to chenxi-yang/vllm that referenced this pull request Sep 20, 2025
Signed-off-by: Lucas Wilkinson <lwilkins@redhat.com>

Signed-off-by: Chenxi Yang <cxyang@fb.com>
FeiDaLI pushed a commit to FeiDaLI/vllm that referenced this pull request Sep 25, 2025
Signed-off-by: Lucas Wilkinson <lwilkins@redhat.com>
charlifu pushed a commit to ROCm/vllm that referenced this pull request Sep 25, 2025
Signed-off-by: Lucas Wilkinson <lwilkins@redhat.com>
Signed-off-by: charlifu <charlifu@amd.com>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
ready ONLY add when PR is ready to merge/full CI is needed
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants