[Bugfix] Use lazy string reference for DeepseekV3Config in config registry #28958

yongming-qin · 2025-11-18T19:28:44Z

Purpose

Fix an inconsistency in _CONFIG_REGISTRY:
deepseek_v32 was the only entry using an eager class import:

deepseek_v32 = DeepseekV3Config

While all other entries use string-based lazy loading.
This caused unnecessary eager imports and broke environments with older transformers versions.

Fix:

Change to "DeepseekV3Config"
Move the import to vllm/transformers_utils/configs/__init__.py so LazyConfigDict can resolve it

Before:

Test Plan

1. Verify Transformers does not know deepseek_v32:

from transformers.models.auto.configuration_auto import CONFIG_MAPPING
[k for k in CONFIG_MAPPING if "deepseek" in k.lower()]

Result:
['deepseek_v2', 'deepseek_v3', 'deepseek_vl', 'deepseek_vl_hybrid']

2. Verify vLLM now resolves it lazily:

from vllm.transformers_utils.config import get_config
get_config("deepseek-ai/DeepSeek-V3.2-Exp-Base", trust_remote_code=False)

Produces the expected Transformers warning, confirming correct mapping.

Test Result

vLLM version: 0.11.1rc7.dev217
Transformers: 4.57.1
Before: ImportError on older Transformers versions
After: Lazy loading works; deepseek_v32 → DeepseekV3Config resolves correctly

Signed-off-by: Luke <yq0536@gmail.com>

github-actions · 2025-11-18T19:28:57Z

👋 Hi! Thank you for contributing to the vLLM project.

💬 Join our developer Slack at https://slack.vllm.ai to discuss your PR in #pr-reviews, coordinate on features in #feat- channels, or join special interest groups in #sig- channels.

Just a reminder: PRs would not trigger full CI run by default. Instead, it would only run fastcheck CI which starts running only a small and essential subset of CI tests to quickly catch errors.

You ask your reviewers to trigger select CI tests on top of fastcheck CI.

Once the PR is approved and ready to go, your PR reviewer(s) can run CI to test the changes comprehensively before merging.

To run CI, PR reviewers can either: Add ready label to the PR or enable auto-merge.

If you have any questions, please reach out to us on Slack at https://slack.vllm.ai.

🚀

gemini-code-assist

Code Review

This pull request aims to fix an inconsistency in _CONFIG_REGISTRY by using a string reference for DeepseekV3Config to enable lazy loading. While the intention is correct and addresses a potential issue with eager imports, the current implementation is incomplete and will cause a runtime AttributeError. This highlights the importance of testing even for seemingly small changes. My review includes a critical comment that explains why the change will fail and provides guidance on how to correctly implement the lazy loading for this configuration. The fix involves a small change in a different file to make DeepseekV3Config available to the lazy loading mechanism.

gemini-code-assist · 2025-11-18T19:30:22Z

vllm/transformers_utils/config.py

    chatglm="ChatGLMConfig",
    deepseek_vl_v2="DeepseekVLV2Config",
-    deepseek_v32=DeepseekV3Config,
+    deepseek_v32="DeepseekV3Config",


This change to use a string reference for lazy loading is a good idea, but it will cause a runtime AttributeError as implemented.

The LazyConfigDict expects to find the config class within the vllm.transformers_utils.configs module when a string is provided. However, DeepseekV3Config is imported from transformers and is not exposed in that module after your change to remove the direct import in this file.

To fix this, you need to make DeepseekV3Config available in vllm.transformers_utils.configs. You can achieve this by adding from transformers import DeepseekV3Config to vllm/transformers_utils/configs/__init__.py and including "DeepseekV3Config" in its __all__ list. This will preserve the lazy loading behavior, as vllm.transformers_utils.configs is only imported when a config is first accessed from the registry.

Makes sense to me. I have added from transformers import DeepseekV3Config in vllm/transformers_utils/configs/__init__.py
and added comments to explain such case.

chatgpt-codex-connector

💡 Codex Review

Here are some automated review suggestions for this pull request.

ℹ️ About Codex in GitHub

Codex has been enabled to automatically review pull requests in this repo. Reviews are triggered when you

Open a pull request for review
Mark a draft as ready
Comment "@codex review".

If Codex has suggestions, it will comment; otherwise it will react with 👍.

When you sign up for Codex through ChatGPT, Codex can also answer questions or update the PR, like "@codex address that feedback".

chatgpt-codex-connector · 2025-11-18T19:30:48Z

vllm/transformers_utils/config.py

    chatglm="ChatGLMConfig",
    deepseek_vl_v2="DeepseekVLV2Config",
-    deepseek_v32=DeepseekV3Config,
+    deepseek_v32="DeepseekV3Config",
    flex_olmo="FlexOlmoConfig",


DeepseekV3 configs now fail to resolve

The registry now stores deepseek_v32 as the string "DeepseekV3Config", but LazyConfigDict.__getitem__ only resolves strings from vllm.transformers_utils.configs. That package does not export DeepseekV3Config, so any model with model_type deepseek_v32 will raise AttributeError when loading the config, even on transformers versions that provide the class. Previously the config loaded correctly when the class was available; after this change Deepseek V3 models cannot be parsed at all.

Useful? React with 👍 / 👎.

Solved in the reply to gemini-code-assist[bot]
https://github.com/vllm-project/vllm/pull/28958/files#r2539426365

Signed-off-by: Luke <yq0536@gmail.com>

yongming-qin · 2025-11-18T23:18:44Z

Checked the AI suggestion and updated the PR description.

Signed-off-by: Luke <yq0536@gmail.com>

[Bugfix] Fix config registry issue

4a12840

Signed-off-by: Luke <yq0536@gmail.com>

mergify bot added the deepseek Related to DeepSeek models label Nov 18, 2025

gemini-code-assist bot reviewed Nov 18, 2025

View reviewed changes

chatgpt-codex-connector bot reviewed Nov 18, 2025

View reviewed changes

yongming-qin added 2 commits November 18, 2025 17:01

[Bugfix] Expose DeepseekV3Config in configs for lazy loading

d16f226

Signed-off-by: Luke <yq0536@gmail.com>

[Bugfix] Break a long line to meet the 88-character limit

1d531f1

Signed-off-by: Luke <yq0536@gmail.com>

yongming-qin added 3 commits November 18, 2025 18:26

Apply ruff auto-fixes

51ce742

Signed-off-by: Luke <yq0536@gmail.com>

Merge branch 'main' into bugfix-deepseekv3config-lazy

dc40137

Polish the docstring

5062047

Signed-off-by: Luke <yq0536@gmail.com>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

[Bugfix] Use lazy string reference for DeepseekV3Config in config registry #28958

[Bugfix] Use lazy string reference for DeepseekV3Config in config registry #28958

yongming-qin commented Nov 18, 2025 •

edited by github-actions bot

Loading

Uh oh!

github-actions bot commented Nov 18, 2025

Uh oh!

gemini-code-assist bot left a comment

Uh oh!

gemini-code-assist bot Nov 18, 2025

Uh oh!

yongming-qin Nov 18, 2025

Uh oh!

chatgpt-codex-connector bot left a comment

Uh oh!

chatgpt-codex-connector bot Nov 18, 2025

Uh oh!

yongming-qin Nov 18, 2025

Uh oh!

yongming-qin commented Nov 18, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Uh oh!

[Bugfix] Use lazy string reference for DeepseekV3Config in config registry #28958

Are you sure you want to change the base?

[Bugfix] Use lazy string reference for DeepseekV3Config in config registry #28958

Conversation

yongming-qin commented Nov 18, 2025 • edited by github-actions bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Purpose

Test Plan

Test Result

Uh oh!

github-actions bot commented Nov 18, 2025

Uh oh!

gemini-code-assist bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

gemini-code-assist bot Nov 18, 2025

Choose a reason for hiding this comment

Uh oh!

yongming-qin Nov 18, 2025

Choose a reason for hiding this comment

Uh oh!

chatgpt-codex-connector bot left a comment

Choose a reason for hiding this comment

💡 Codex Review

Uh oh!

chatgpt-codex-connector bot Nov 18, 2025

Choose a reason for hiding this comment

Uh oh!

yongming-qin Nov 18, 2025

Choose a reason for hiding this comment

Uh oh!

yongming-qin commented Nov 18, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

yongming-qin commented Nov 18, 2025 •

edited by github-actions bot

Loading