You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
This commit was created on GitHub.com and signed with GitHub’s verified signature.
Introduced a new flexible and robust multi-backend mechanism and updated vendor adaptation methods.
Enabled heterogeneous prefill-decoding disaggregation across vendor chips within a single instance via FlagCX (beta).
Upgraded DeepSeek-v3 pre-training with the new Megatron-LM and added heterogeneous pre-training across different chips for MoE models like DeepSeek-v3.