[bugfix] fix grpo generate by Jintao-Huang · Pull Request #9183 · modelscope/ms-swift

Jintao-Huang · 2026-04-22T09:23:35Z

No description provided.

hjh0119 · 2026-04-22T09:29:53Z

gemini-code-assist

Code Review

This pull request updates the GRPOTrainer to set remove_unused_columns to True for the inference template, which prevents potential crashes in the TransformersEngine. The reviewer suggests extending this change to the vLLM engine path in RolloutTrainerMixin to maintain consistency and prevent similar issues.

gemini-code-assist · 2026-04-22T09:29:58Z

            infer_template = copy(self.template)
            infer_template.padding_free = False
            infer_template.sequence_parallel_size = 1
+            infer_template.remove_unused_columns = True


This fix correctly addresses potential crashes in TransformersEngine when remove_unused_columns=False is set in the training configuration. By explicitly setting it to True for the inference template, you ensure that the data_collator filters out non-model inputs before generation.

However, the same issue likely affects the vLLM engine path. In RolloutTrainerMixin._prepare_vllm_engine (line 202 of rollout_mixin.py), a similar template copy is created but remove_unused_columns is not set to True. To maintain consistency and prevent similar issues when using vLLM, you should consider applying this fix there as well.

Jintao-Huang · 2026-04-22T10:27:35Z

/gemini review

gemini-code-assist

Code Review

This pull request introduces column filtering in the GRPOTrainer by defining a set of keys to be excluded and applying this filter during batch preparation and model input generation. It also enables remove_unused_columns in the inference template. The review feedback recommends optimizing lookup performance by using a set for the filtered keys and making the filtering logic conditional on the template's configuration to avoid unnecessary processing.

gemini-code-assist · 2026-04-22T10:31:28Z

+        self._filtered_keys = [
+            'prompt_id', 'request_id', 'response_token_ids', 'finish_reason', 'is_truncated', 'add_eos'
+        ]


Defining a set version of _filtered_keys would allow for O(1) lookup performance in the training loop. Keeping the list version is still useful for compatibility with list concatenations (like in _prepare_model_inputs).

Suggested change

self._filtered_keys = [

'prompt_id', 'request_id', 'response_token_ids', 'finish_reason', 'is_truncated', 'add_eos'

]

self._filtered_keys = [

'prompt_id', 'request_id', 'response_token_ids', 'finish_reason', 'is_truncated', 'add_eos'

]

self._filtered_keys_set = set(self._filtered_keys)

gemini-code-assist · 2026-04-22T10:31:29Z

+                for encoded_inputs in batch_encoded_inputs:
+                    extra_kwargs = encoded_inputs.get('_extra_kwargs') or {}
+                    for k in list(extra_kwargs.keys()):
+                        if k not in self._filtered_keys:
+                            extra_kwargs.pop(k)


This filtering logic is only necessary when template.remove_unused_columns is False. If it's True (the default), _extra_kwargs won't be merged into the collated batch by the template's data collator. Adding this check avoids unnecessary iterations over the batch samples. Additionally, using the set version of filtered keys improves lookup performance.

Suggested change

for encoded_inputs in batch_encoded_inputs:

extra_kwargs = encoded_inputs.get('_extra_kwargs') or {}

for k in list(extra_kwargs.keys()):

if k not in self._filtered_keys:

extra_kwargs.pop(k)

if not template.remove_unused_columns:

for encoded_inputs in batch_encoded_inputs:

extra_kwargs = encoded_inputs.get('_extra_kwargs') or {}

for k in list(extra_kwargs.keys()):

if k not in self._filtered_keys_set:

extra_kwargs.pop(k)

fix grpo generate

fca4992

hjh0119 approved these changes Apr 22, 2026

View reviewed changes

gemini-code-assist Bot reviewed Apr 22, 2026

View reviewed changes

tastelikefeet approved these changes Apr 22, 2026

View reviewed changes

Jintao-Huang added 2 commits April 22, 2026 18:18

fix

874277f

fix

5d82ae5

gemini-code-assist Bot reviewed Apr 22, 2026

View reviewed changes

fix

e414585

hjh0119 approved these changes Apr 22, 2026

View reviewed changes

Jintao-Huang merged commit f9aad77 into modelscope:main Apr 22, 2026
3 checks passed

Jintao-Huang added a commit that referenced this pull request Apr 23, 2026

[bugfix] fix grpo generate (#9183)

9cdb66c

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[bugfix] fix grpo generate#9183

[bugfix] fix grpo generate#9183
Jintao-Huang merged 4 commits into
modelscope:mainfrom
Jintao-Huang:fix_grpo_generate

Jintao-Huang commented Apr 22, 2026

Uh oh!

hjh0119 commented Apr 22, 2026

Uh oh!

gemini-code-assist Bot left a comment

Uh oh!

gemini-code-assist Bot Apr 22, 2026

Uh oh!

Jintao-Huang commented Apr 22, 2026

Uh oh!

gemini-code-assist Bot left a comment

Uh oh!

gemini-code-assist Bot Apr 22, 2026

Uh oh!

gemini-code-assist Bot Apr 22, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

Jintao-Huang commented Apr 22, 2026

Uh oh!

hjh0119 commented Apr 22, 2026

Uh oh!

gemini-code-assist Bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

gemini-code-assist Bot Apr 22, 2026

Choose a reason for hiding this comment

Uh oh!

Jintao-Huang commented Apr 22, 2026

Uh oh!

gemini-code-assist Bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

gemini-code-assist Bot Apr 22, 2026

Choose a reason for hiding this comment

Uh oh!

gemini-code-assist Bot Apr 22, 2026

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants