Release v0.1.1 · baidu-baige/LoongForge

Highlights

New model support: Kimi-K2.5 / K2.6 VLM training and MCore↔HF checkpoint conversion
Wan2.2 Diffusion: Data packing training support; FSDP + context parallelism (ring & ulysses)
GR00T-N1.6 VLA: Full-iteration / per-microbatch CUDA graph for accelerated training
INT4 checkpoint dequantization: Support HF compressed-tensors INT4 format

docs(readme): add LoongForge banner by @nullnonenilNULL in #38
Standard sft_dataset_config.yaml: Remove history from the Alpaca data… by @gsl322 in #52
[bridge] Enable fp8 force pow2on for bridge by @pengxiangyu in #50
[convert] Convert checkpoint support get hf checkpoint file names by expert_ids by @pengxiangyu in #53
[groot_n1.6] Support full-iteration / per-microbatch CUDA graph by @Dana57740 in #54
[Task] wan fsdp + cp(ring + ulysses) by @haichengv12 in #56
[ci, docs] feat: add initial GitHub Actions CI surface by @Zachary-wW in #51
[ckpt] feat: support HF INT4 compressed-tensors dequantization by @kaimo455 in #60
[vlm, data, train] fix: align packed FP8 padding and media broadcast by @kaimo455 in #61
[data] fix: log skipped VLM dataloader samples as warnings by @kaimo455 in #62
[data, vlm] fix: skip overlong Kimi VLM samples by @kaimo455 in #63
[Task] Wan2.2 data packing training support by @VEGA1215 in #64
[ckpt] feat: support Kimi K2.x checkpoint conversion by @kaimo455 in #67
[docs] docs: split bilingual docs into separate source trees by @Zachary-wW in #65
[docs] docs: add Kimi-K2.6 support model mapping by @Zachary-wW in #68
[docs] docs: fix diffusion docs structure and align ZH/EN toctree by @Zachary-wW in #69

Full Changelog: v0.1.0...v0.1.1