[model] support bailing by Jintao-Huang · Pull Request #55 · modelscope/mcore-bridge

Jintao-Huang · 2026-04-29T03:30:58Z

No description provided.

gemini-code-assist

Code Review

This pull request introduces support for the bailing_moe model by adding it to the model constants, exporting it in the GPTs module, and implementing the BailingMoeBridge class. Additionally, the .gitignore was updated and the configuration parser now includes score_function in its mapping. Feedback indicates that the bailing_moe model type should be explicitly handled in the configuration conversion logic to ensure that qk_layernorm is enabled and the router score function is correctly set to sigmoid.

gemini-code-assist · 2026-04-29T03:32:45Z

    'q_lora_rank': ['q_lora_rank'],
    'kv_lora_rank': ['kv_lora_rank'],
-    'moe_router_score_function': ['scoring_func', 'moe_router_use_sigmoid'],
+    'moe_router_score_function': ['scoring_func', 'moe_router_use_sigmoid', 'score_function'],


While adding score_function to the config_mapping is correct, the bailing_moe model type should also be explicitly handled in the hf_to_mcore_config function (around line 120 and 164) to ensure qk_layernorm is enabled and the router score function is set to sigmoid. The bridge definition in bailing_moe.py includes QK normalization keys and expert bias, which strongly suggests these configurations are required for the model to function correctly in Megatron-Core.

Jintao-Huang · 2026-05-08T14:00:54Z

/gemini review

gemini-code-assist

Code Review

This pull request introduces support for the Bailing MoE model, including a specialized attention implementation to handle its specific QKV weight layout, updates to the configuration parser for MoE parameters, and the addition of integration tests. The review feedback identifies missing bias handling in the Bailing MoE attention layer and state dictionary mapping, which could lead to incorrect model behavior. It also points out that an existing test case was accidentally disabled.

support bailing

6168a17

gemini-code-assist Bot reviewed Apr 29, 2026

View reviewed changes

Jintao-Huang added 5 commits April 29, 2026 14:29

fix

796efc5

update

ea099f0

update

d10ced2

fix

d107b02

Merge branch 'main' into support_bailing

c609fbc

Jintao-Huang mentioned this pull request May 8, 2026

期望支持bailing_moe的megatron modelscope/ms-swift#9274

Closed

1 task

support bailing_moe

6355d85

gemini-code-assist Bot reviewed May 8, 2026

View reviewed changes

Comment thread src/mcore_bridge/model/gpts/bailing_moe.py

Comment thread src/mcore_bridge/model/gpts/bailing_moe.py

Comment thread tests/test_llm.py

hjh0119 approved these changes May 8, 2026

View reviewed changes

fix

a7038c5

Jintao-Huang merged commit 62e0100 into modelscope:main May 8, 2026
1 check passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[model] support bailing#55

[model] support bailing#55
Jintao-Huang merged 8 commits into
modelscope:mainfrom
Jintao-Huang:support_bailing

Jintao-Huang commented Apr 29, 2026

Uh oh!

gemini-code-assist Bot left a comment

Uh oh!

gemini-code-assist Bot Apr 29, 2026

Uh oh!

Jintao-Huang commented May 8, 2026

Uh oh!

gemini-code-assist Bot left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

Jintao-Huang commented Apr 29, 2026

Uh oh!

gemini-code-assist Bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

gemini-code-assist Bot Apr 29, 2026

Choose a reason for hiding this comment

Uh oh!

Jintao-Huang commented May 8, 2026

Uh oh!

gemini-code-assist Bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants