Skip to content

Conversation

LucasWilkinson
Copy link
Collaborator

@LucasWilkinson LucasWilkinson commented Sep 30, 2025

In the final release DeepseekV32 was being registered separately so the config override was not getting picked-up anymore

Signed-off-by: Lucas Wilkinson <lwilkins@redhat.com>
@mergify mergify bot added the deepseek Related to DeepSeek models label Sep 30, 2025
Copy link
Contributor

@gemini-code-assist gemini-code-assist bot left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Code Review

This pull request addresses a bug where the specific configuration for DeepseekV3.2 models was not being applied. The changes involve renaming the configuration class to DeepseekV32ForCausalLM and updating the model configuration map accordingly. The logic for applying the custom KV cache settings has also been simplified. While the changes are generally good, I've identified a potential issue in the handling of the bfloat16 cache data type that could lead to unexpected behavior.

@LucasWilkinson LucasWilkinson changed the title [BugFix] Fix default kv-cache-default for DeepseekV3.2 [BugFix] Fix default kv-cache-dtype default for DeepseekV3.2 Sep 30, 2025
@LucasWilkinson LucasWilkinson added this to the v0.11.0 Cherry Picks milestone Sep 30, 2025
Signed-off-by: Lucas Wilkinson <lwilkins@redhat.com>
@heheda12345
Copy link
Collaborator

CC @youkaichao @zyongye

@simon-mo simon-mo merged commit 8d7da92 into main Oct 1, 2025
9 checks passed
@simon-mo simon-mo deleted the lwilkinson/fix-defaults branch October 1, 2025 04:58
simon-mo pushed a commit that referenced this pull request Oct 1, 2025
Signed-off-by: Lucas Wilkinson <lwilkins@redhat.com>
Signed-off-by: simon-mo <simon.mo@hey.com>
pdasigi pushed a commit to pdasigi/vllm that referenced this pull request Oct 2, 2025
yewentao256 pushed a commit that referenced this pull request Oct 3, 2025
Signed-off-by: Lucas Wilkinson <lwilkins@redhat.com>
Signed-off-by: yewentao256 <zhyanwentao@126.com>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
deepseek Related to DeepSeek models
Projects
None yet
Development

Successfully merging this pull request may close these issues.

4 participants