Skip to content

v0.8.0

Choose a tag to compare

@aoyulong aoyulong released this 30 Apr 11:24
· 1 commit to release/v0.8.0 since this release
779a1e3
  • Introduced a new flexible and robust multi-backend mechanism and updated vendor adaptation methods.
  • Enabled heterogeneous prefill-decoding disaggregation across vendor chips within a single instance via FlagCX (beta).
  • Upgraded DeepSeek-v3 pre-training with the new Megatron-LM and added heterogeneous pre-training across different chips for MoE models like DeepSeek-v3.