-
Notifications
You must be signed in to change notification settings - Fork 904
Open
Labels
needs more infoAdditional information or clarification is required to proceedAdditional information or clarification is required to proceed
Description
基座:Qwen2.5-VL-7B-Instruct
GRPO时,启用 sequence_parallel_size>1 时,会报错:
[rank5]: File "/usr/local/lib/python3.11/site-packages/swift/trainers/rlhf_trainer/grpo_trainer.py", line 1428, in _compute_loss_and_metrics
[rank5]: per_token_loss1 = coef_1 * advantages.unsqueeze(1)
[rank5]: ~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~
[rank5]: RuntimeError: Expected all tensors to be on the same device, but found at least two devices, cuda:5 and cuda:4!
Metadata
Metadata
Assignees
Labels
needs more infoAdditional information or clarification is required to proceedAdditional information or clarification is required to proceed