Skip to content

[bugfix] fix mtp keys#9163

Merged
Jintao-Huang merged 2 commits into
modelscope:mainfrom
Jintao-Huang:fix_mtp_keys
Apr 21, 2026
Merged

[bugfix] fix mtp keys#9163
Jintao-Huang merged 2 commits into
modelscope:mainfrom
Jintao-Huang:fix_mtp_keys

Conversation

@Jintao-Huang
Copy link
Copy Markdown
Collaborator

No description provided.

Copy link
Copy Markdown
Contributor

@gemini-code-assist gemini-code-assist Bot left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Code Review

This pull request introduces a new get_text_config method in HfConfigFactory to retrieve text-specific configurations and updates the weight-saving logic in swift/megatron/init.py to support multiple MTP layer attribute names. Key feedback includes correcting a typo in the fallback assignment for num_nextn_predict_layers and enhancing get_text_config to handle dictionary-based configurations for better consistency and robustness.

Comment thread swift/megatron/init.py Outdated
Comment thread swift/utils/hf_config.py
@Jintao-Huang Jintao-Huang merged commit e570603 into modelscope:main Apr 21, 2026
2 of 3 checks passed
Jintao-Huang added a commit that referenced this pull request Apr 23, 2026
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants