GRPO MCORE Path: Improve MoE Training MFU #985

Closed

Assignees

Labels

Performancedeepseekqa_rcca_donet-mcore

opened

on Aug 26, 2025

Opening this issue to compare and align GRPO mcore-training MFU in RL with pre-training MFU in Megatron-LM, for MoE models: DeepSeek V3, Qwen 30B, Qwen 235 B.

Metadata

Assignees

guyueh1

Labels

Performancedeepseekqa_rcca_donet-mcore

Type

No type

Fields

No fields configured for issues without a type.

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests