Skip to content

update mlp_padding_free#9262

Merged
Jintao-Huang merged 2 commits into
modelscope:mainfrom
Jintao-Huang:update_mlp_padding_free
May 5, 2026
Merged

update mlp_padding_free#9262
Jintao-Huang merged 2 commits into
modelscope:mainfrom
Jintao-Huang:update_mlp_padding_free

Conversation

@Jintao-Huang
Copy link
Copy Markdown
Collaborator

No description provided.

Copy link
Copy Markdown
Contributor

@gemini-code-assist gemini-code-assist Bot left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Code Review

This pull request updates the documentation and logic for mlp_padding_free to support its use with sequence_parallel, provided that mcore-bridge>=1.3.0.dev is installed. It also clarifies that mlp_padding_free remains incompatible with context_parallel. Feedback suggests updating the error message in megatron_args.py to remove the mention of sequence_parallel as incompatible and adding a descriptive hint to the require_version call.

Comment thread swift/megatron/arguments/megatron_args.py Outdated
@Jintao-Huang Jintao-Huang merged commit ac237dc into modelscope:main May 5, 2026
1 of 3 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants