·
52 commits
to master
since this release
Immutable
release. Only release title and notes can be modified.
Highlights
- New model support: Kimi-K2.5 / K2.6 VLM training and MCore↔HF checkpoint conversion
- Wan2.2 Diffusion: Data packing training support; FSDP + context parallelism (ring & ulysses)
- GR00T-N1.6 VLA: Full-iteration / per-microbatch CUDA graph for accelerated training
- INT4 checkpoint dequantization: Support HF compressed-tensors INT4 format
What's Changed
- docs(readme): add LoongForge banner by @nullnonenilNULL in #38
- Standard sft_dataset_config.yaml: Remove history from the Alpaca data… by @gsl322 in #52
- [bridge] Enable fp8 force pow2on for bridge by @pengxiangyu in #50
- [convert] Convert checkpoint support get hf checkpoint file names by expert_ids by @pengxiangyu in #53
- [groot_n1.6] Support full-iteration / per-microbatch CUDA graph by @Dana57740 in #54
- [Task] wan fsdp + cp(ring + ulysses) by @haichengv12 in #56
- [ci, docs] feat: add initial GitHub Actions CI surface by @Zachary-wW in #51
- [ckpt] feat: support HF INT4 compressed-tensors dequantization by @kaimo455 in #60
- [vlm, data, train] fix: align packed FP8 padding and media broadcast by @kaimo455 in #61
- [data] fix: log skipped VLM dataloader samples as warnings by @kaimo455 in #62
- [data, vlm] fix: skip overlong Kimi VLM samples by @kaimo455 in #63
- [Task] Wan2.2 data packing training support by @VEGA1215 in #64
- [ckpt] feat: support Kimi K2.x checkpoint conversion by @kaimo455 in #67
- [docs] docs: split bilingual docs into separate source trees by @Zachary-wW in #65
- [docs] docs: add Kimi-K2.6 support model mapping by @Zachary-wW in #68
- [docs] docs: fix diffusion docs structure and align ZH/EN toctree by @Zachary-wW in #69
New Contributors
- @nullnonenilNULL made their first contribution in #38
- @Dana57740 made their first contribution in #54
- @haichengv12 made their first contribution in #56
- @VEGA1215 made their first contribution in #64
Full Changelog: v0.1.0...v0.1.1