Skip to content

v1.4.1

Choose a tag to compare

@Jintao-Huang Jintao-Huang released this 27 May 15:23
· 18 commits to main since this release

中文版

新特性

  1. 新增 model_type 支持:gemma4、deepseek_v4。
  2. README 新增使用 Mcore-Bridge 创建模型并执行 forward、计算损失的最简示例。
  3. 兼容 megatron-core main 与 dev 分支。

English Version

New Features

  1. Added model_type support for: gemma4, deepseek_v4.
  2. Added a minimal example in README demonstrating how to create a model using Mcore-Bridge to perform forward pass and compute loss.
  3. Compatible with both megatron-core main and dev branches.

What's Changed

Full Changelog: v1.4.0...v1.4.1