Skip to content

v0.1.1

Latest

Choose a tag to compare

@Zachary-wW Zachary-wW released this 26 May 09:03
· 52 commits to master since this release
Immutable release. Only release title and notes can be modified.
c5c4205

Highlights

  • New model support: Kimi-K2.5 / K2.6 VLM training and MCore↔HF checkpoint conversion
  • Wan2.2 Diffusion: Data packing training support; FSDP + context parallelism (ring & ulysses)
  • GR00T-N1.6 VLA: Full-iteration / per-microbatch CUDA graph for accelerated training
  • INT4 checkpoint dequantization: Support HF compressed-tensors INT4 format

What's Changed

  • docs(readme): add LoongForge banner by @nullnonenilNULL in #38
  • Standard sft_dataset_config.yaml: Remove history from the Alpaca data… by @gsl322 in #52
  • [bridge] Enable fp8 force pow2on for bridge by @pengxiangyu in #50
  • [convert] Convert checkpoint support get hf checkpoint file names by expert_ids by @pengxiangyu in #53
  • [groot_n1.6] Support full-iteration / per-microbatch CUDA graph by @Dana57740 in #54
  • [Task] wan fsdp + cp(ring + ulysses) by @haichengv12 in #56
  • [ci, docs] feat: add initial GitHub Actions CI surface by @Zachary-wW in #51
  • [ckpt] feat: support HF INT4 compressed-tensors dequantization by @kaimo455 in #60
  • [vlm, data, train] fix: align packed FP8 padding and media broadcast by @kaimo455 in #61
  • [data] fix: log skipped VLM dataloader samples as warnings by @kaimo455 in #62
  • [data, vlm] fix: skip overlong Kimi VLM samples by @kaimo455 in #63
  • [Task] Wan2.2 data packing training support by @VEGA1215 in #64
  • [ckpt] feat: support Kimi K2.x checkpoint conversion by @kaimo455 in #67
  • [docs] docs: split bilingual docs into separate source trees by @Zachary-wW in #65
  • [docs] docs: add Kimi-K2.6 support model mapping by @Zachary-wW in #68
  • [docs] docs: fix diffusion docs structure and align ZH/EN toctree by @Zachary-wW in #69

New Contributors

Full Changelog: v0.1.0...v0.1.1