v1.4.2
新特性
- 新增 model_type 支持:bailing_hybrid。
- 修复 olmoe/bailing_moe 在TP > 1时的损失异常。
New Features
- Add model_type support: bailing_hybrid.
- Fix abnormal loss for olmoe/bailing_moe when TP > 1.
What's Changed
- [bugfix] fix bug by @Jintao-Huang in #99
- [bugfix] fix qwen3_next norm sp by @Jintao-Huang in #100
- [model] Support bailing_hybrid by @Jintao-Huang in #85
- refactor olmoe by @Jintao-Huang in #101
- [bugfix] fix npu GDN by @Jintao-Huang in #103
Full Changelog: v1.4.1...v1.4.2