Support zero3 hierarchical gather in the ref sync callback by hjh0119 · Pull Request #9170 · modelscope/ms-swift

hjh0119 · 2026-04-21T09:08:44Z

gemini-code-assist

Code Review

This pull request implements reference model weight synchronization for RLHF trainers. It introduces a SyncRefModelCallback and a _sync_ref_model_weights method within the RolloutTrainerMixin to handle weight mixing, including support for DeepSpeed ZeRO-3. Feedback indicates that the initialization of parameter groups happens too early, which may lead to incorrect configurations when LoRA is enabled. Additionally, the synchronization method contains performance inefficiencies due to redundant dictionary creations and iterations inside loops.

hjh0119 · 2026-04-21T09:35:42Z

/gemini review

gemini-code-assist

Code Review

This pull request introduces a custom SyncRefModelCallback and a _sync_ref_model_weights method within RolloutTrainerMixin to support reference model weight synchronization during training. It also refactors the initialization of parameter groups to ensure they are available for this process. The review feedback suggests improving the robustness of the synchronization logic by ensuring empty parameter groups result in a no-op rather than defaulting to all parameters, and by enhancing error diagnostics to identify specific missing parameters when using DeepSpeed.

hjh0119 · 2026-04-21T12:42:02Z

/gemini review

gemini-code-assist

Code Review

This pull request introduces a custom SyncRefModelCallback and the _sync_ref_model_weights method within RolloutTrainerMixin to support reference model weight synchronization. It also moves the initialization of parameter groups to prepare_rollout to ensure availability regardless of vLLM usage. A review comment suggests refactoring the _sync_ref_model_weights method to reduce code duplication between the DeepSpeed ZeRO-3 and standard execution paths.

support ref sync

9383056

hjh0119 changed the title ~~Support hierarchical gather in the ref sync callback~~ Support zero3 hierarchical gather in the ref sync callback Apr 21, 2026

gemini-code-assist Bot reviewed Apr 21, 2026

View reviewed changes

Comment thread swift/rlhf_trainers/rollout_mixin.py Outdated

Comment thread swift/rlhf_trainers/rollout_mixin.py Outdated

gemini

0d9c8d7

gemini-code-assist Bot reviewed Apr 21, 2026

View reviewed changes

Comment thread swift/rlhf_trainers/rollout_mixin.py Outdated

Comment thread swift/rlhf_trainers/rollout_mixin.py Outdated

gemini

564f294

gemini-code-assist Bot reviewed Apr 21, 2026

View reviewed changes

Comment thread swift/rlhf_trainers/rollout_mixin.py Outdated

gemini

6e50e4f

Jintao-Huang approved these changes Apr 25, 2026

View reviewed changes

hjh0119 merged commit 9681664 into modelscope:main Apr 25, 2026
2 of 3 checks passed

hjh0119 deleted the sync-ref branch April 25, 2026 06:41

Jintao-Huang pushed a commit that referenced this pull request Apr 25, 2026

Support zero3 hierarchical gather in the ref sync callback (#9170)

23fbb07

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Support zero3 hierarchical gather in the ref sync callback#9170

Support zero3 hierarchical gather in the ref sync callback#9170
hjh0119 merged 4 commits into
modelscope:mainfrom
hjh0119:sync-ref

hjh0119 commented Apr 21, 2026

Uh oh!

gemini-code-assist Bot left a comment

Uh oh!

Uh oh!

Uh oh!

hjh0119 commented Apr 21, 2026

Uh oh!

gemini-code-assist Bot left a comment

Uh oh!

Uh oh!

Uh oh!

hjh0119 commented Apr 21, 2026

Uh oh!

gemini-code-assist Bot left a comment

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

hjh0119 commented Apr 21, 2026

Uh oh!

gemini-code-assist Bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

Uh oh!

Uh oh!

hjh0119 commented Apr 21, 2026

Uh oh!

gemini-code-assist Bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

Uh oh!

Uh oh!

hjh0119 commented Apr 21, 2026

Uh oh!

gemini-code-assist Bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants