Support fp4 blockwise load by Jintao-Huang · Pull Request #96 · modelscope/mcore-bridge

Jintao-Huang · 2026-05-26T08:08:43Z

No description provided.

gemini-code-assist

Code Review

This pull request refactors weight loading in GPTBridge by extracting _set_param and introduces state dict key conversions for DeepSeek-V4. It also adds a utility function fp4_to_fp8 to unpack FP4 tensors. However, the newly added fp4_to_fp8 function and _check_fp4 flag are currently unused in gpt_bridge.py, which will lead to shape mismatch errors when loading packed FP4 tensors. Additionally, creating the LUT tensor on the target device during every call to fp4_to_fp8 introduces unnecessary overhead and should be cached.

Jintao-Huang · 2026-05-26T08:34:59Z

/gemini review

gemini-code-assist

Code Review

This pull request refactors parameter setting logic by introducing a helper method _set_param and adds support for FP4-to-FP8 dequantization, specifically for DeepSeek V4 models. The feedback highlights critical improvements to ensure runtime stability and performance: adding a defensive check to prevent an AttributeError when scale_inv is None, caching the LUT tensor in fp4_to_fp8 to avoid redundant host-to-device transfers, and adding checks to prevent division issues during block size calculation.

Jintao-Huang added 2 commits May 26, 2026 15:47

support fp4 blockwise load

51562d4

update

45307fb

gemini-code-assist Bot reviewed May 26, 2026

View reviewed changes

Comment thread src/mcore_bridge/bridge/gpt_bridge.py

Comment thread src/mcore_bridge/utils/dequantizer.py

update

da3ceda

hjh0119 approved these changes May 26, 2026

View reviewed changes

tastelikefeet approved these changes May 26, 2026

View reviewed changes

update

a91d71e

gemini-code-assist Bot reviewed May 26, 2026

View reviewed changes

Comment thread src/mcore_bridge/model/gpts/deepseek_v4.py

Comment thread src/mcore_bridge/utils/dequantizer.py

Comment thread src/mcore_bridge/utils/dequantizer.py

Jintao-Huang merged commit b7fab88 into modelscope:main May 26, 2026
1 check passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Support fp4 blockwise load#96

Support fp4 blockwise load#96
Jintao-Huang merged 4 commits into
modelscope:mainfrom
Jintao-Huang:support_fp4_blockwise_load

Jintao-Huang commented May 26, 2026

Uh oh!

gemini-code-assist Bot left a comment

Uh oh!

Uh oh!

Uh oh!

Jintao-Huang commented May 26, 2026

Uh oh!

gemini-code-assist Bot left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

Jintao-Huang commented May 26, 2026

Uh oh!

gemini-code-assist Bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

Uh oh!

Uh oh!

Jintao-Huang commented May 26, 2026

Uh oh!

gemini-code-assist Bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants