[gkd] support buffers & fix some bugs by hjh0119 · Pull Request #9278 · modelscope/ms-swift

hjh0119 · 2026-05-07T03:06:50Z

1、Support rollout buffers to enable sampling multiple batches of data in a single generation pass (generation_batch_size / steps_per_generation)
2、Fix compatibility issue between GKD and padding_free
3、Fix the bug in GKD multimodal inference when calling the vLLM server API #9222

gemini-code-assist

Code Review

This pull request refactors and standardizes the handling of generation batch parameters across GRPO and GKD trainers by centralizing logic in a shared mixin. Key improvements include the implementation of buffered inputs for GKD to allow sample reuse across training steps and the addition of multimodal support for fetching teacher logprobs via an external API. Feedback focuses on improving debuggability by recommending that the code log warnings when falling back to raw images during multimodal encoding or when retrying API requests without specific processor arguments.

hjh0119 added 2 commits May 7, 2026 10:47

fix gkd

d3a6e5b

Merge branch 'main' into fix-gkd

01f741c

gemini-code-assist Bot reviewed May 7, 2026

View reviewed changes

Comment thread swift/rlhf_trainers/gkd_trainer.py Outdated

Comment thread swift/rlhf_trainers/gkd_trainer.py

fix

042cafc

Jintao-Huang approved these changes May 7, 2026

View reviewed changes

hjh0119 merged commit b78f05d into modelscope:main May 7, 2026
1 of 3 checks passed

hjh0119 deleted the fix-gkd branch May 7, 2026 06:57

hjh0119 mentioned this pull request May 7, 2026

更新4.3.1后 GKD训练教师API不接收"videos"参数 #9222

Closed

1 task

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[gkd] support buffers & fix some bugs#9278

[gkd] support buffers & fix some bugs#9278
hjh0119 merged 3 commits into
modelscope:mainfrom
hjh0119:fix-gkd

hjh0119 commented May 7, 2026

Uh oh!

gemini-code-assist Bot left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

hjh0119 commented May 7, 2026

Uh oh!

gemini-code-assist Bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants