v0.1.1
What's Changed
- [Docs] Add CLI docs by @SolitaryThinker in #406
- [Docs] Fix image by @SolitaryThinker in #407
- [Teacache] allow None for forward_context batch when using teacache by @SolitaryThinker in #412
- [V1] Remove vLLM dependency by @SolitaryThinker in #413
- Fulfill worker response on interrupt by @kevin314 in #417
- [bug] fix bs > 1 by @SolitaryThinker in #418
- Fix version number by @Edenzzzz in #422
- [Tests] don't run 3.10 and 3.11 for SSIM by @SolitaryThinker in #427
- Use version.py by @Edenzzzz in #424
- Unify env report script in issue template by @Edenzzzz in #423
- Set device for encode by @kevin314 in #420
- [Misc] Small fixes to Torch code by @applesaucethebun in #395
- misc: Trigger transformers CI for layers and attention code change by @Edenzzzz in #434
- [Training] [2/n] add bwd for all2all and all_gather by @SolitaryThinker in #439
- [Training] [3/n] Add training args and dependencies by @SolitaryThinker in #440
- [Training] [4/n] add training save checkpoint by @SolitaryThinker in #441
- [Training] [1/n] Add latent datasets by @SolitaryThinker in #438
- Update STA mask strategy downloading by @BrianChen1129 in #445
- [Training] [5/n] Add single gpu training pipeline by @SolitaryThinker in #447
- [Training] [0/n] Add preprocessing pipeline by @JerryZhou54 in #442
- [Training] [6/n]Mixed precision training by @SolitaryThinker in #448
- [Training] [7/n] gradient clipping by @SolitaryThinker in #449
- [Training] [8/n] SP Training by @SolitaryThinker in #450
- misc: add remote pdb for debugging workers by @Edenzzzz in #456
- [Misc] Remove InferenceEngine by @Edenzzzz in #455
- [Misc] disable cast_forward_inputs by @SolitaryThinker in #460
- Bring back mask files under asset/ and update new Wan mask strategy file by @BrianChen1129 in #462
- Fix WanVideo by @JerryZhou54 in #461
- [Training] Add distributed checkpointing by @kevin314 in #458
- Update v1 inference scripts by @JerryZhou54 in #467
- [Training] Support Multi-Node training with FSDP + SP by @SolitaryThinker in #459
- [misc] Polish V1 training code by @Edenzzzz in #469
- [misc] Find unused port in distributed init by @Edenzzzz in #475
- [LoRA] Support V1 LoRA inference by @Edenzzzz in #451
- [bugfix] fix bz >1 for training by @SolitaryThinker in #477
- [Issue template] Move env report to the end for readability by @Edenzzzz in #476
- [Preprocess] I2V dataset by @BrianChen1129 in #473
- [Distill] support distill for wan by @AliceChenyy in #444
- [STA] Implement mask search and update mask strategy for V1's Wan2.1 by @KevinZeng08 in #415
- [bugfix] [training] Add negative prompt to preprocessing and validation by @jzhang38 in #479
- [bugfix] [misc] fix denoising stage init; rename distributed env function; fix logging. by @jzhang38 in #481
- Add torch.compile for all small ops by @Edenzzzz in #432
- Revert "Add torch.compile for all small ops" by @Edenzzzz in #484
- [Bug] Fix multi gpus issues in v1 scripts by @BrianChen1129 in #489
- [misc] Improve distributed related env variables and setup by @jzhang38 in #487
- [bugfix][Cli Inference] Resolve runtime errors when running fastvideo generate by @JerryZhou54 in #493
- Fix pre-commit CI by @Edenzzzz in #494
- [bugfix][Cli Inference] Resolve runtime errors when running fastvideo generate by @JerryZhou54 in #495
- [Feature] Adding VSA inference by @BrianChen1129 in #478
- [misc] Add missing license headers by @SolitaryThinker in #499
- [Feat][Dataloader] 1/n Refactor parquet map-style dataloader by @jzhang38 in #492
- [Feature][VSA]Update STA publish workflow by @BrianChen1129 in #498
- [misc] rename dp_size to hdsp_replicate_dim by @jzhang38 in #491
- [CI] [Training] Initial e2e small training test by @SolitaryThinker in #504
- [feat] Add parquet iterable dataset. by @jzhang38 in #506
- [Refactor][Configurations] clean config orgnization by @Eigensystem in #505
- fix logging by @jzhang38 in #509
- [CI] Restrict training CI to v1 by @Edenzzzz in #508
- [misc] Fix preprocessing and dataloader extra padding by @jzhang38 in #514
- [Feature][Training]vsa for t2v training ready by @BrianChen1129 in #513
- [Feature][Preprocess]Add Readme doc for preprocess by @BrianChen1129 in #518
- [CI] [Training] drop negative prompt in validation dataset and CI test for preprocess + training overfit by @SolitaryThinker in #519
- [Bugfix][Preprocess]fix mini dataset name by @BrianChen1129 in #520
- [Refactor] Fix attn backend selection not correctly setting env variable by @Edenzzzz in #516
- [misc] [ci] fix e2e preprocess+training data path by @SolitaryThinker in #521
- [bugfix] [Training] use diffusers fp32layernorm for wan2.1 by @SolitaryThinker in #490
- [CI] Update Docker image to flash-attn 2.8.0 / CUDA 12.8 by @kevin314 in #524
- [CI] Add current PR test workflow to Buildkite/Modal by @kevin314 in #512
- [CI][bugfix] Use new 3.12 docker image by @SolitaryThinker in #526
- [Bugfix][Inference]Fix envs.attn_backend by @BrianChen1129 in #525
- [Ci] add sta and vsa install to docker image by @SolitaryThinker in #528
- [Feature][CI]Add STA-inference/VSA-training test by @BrianChen1129 in #527
- [Bugfix][Readme]Fix readme website bugs and add VSA finetune docs by @BrianChen1129 in #531
- [Refactor] Move dict_to_3d_list under utils by @Edenzzzz in #507
- Specify cu128 Pytorch installation by @kevin314 in #530
- [Feat] Add Stage input and output verification by @SolitaryThinker in #523
- [misc] Remove gradient checking code by @SolitaryThinker in #532
- [bugfix] Fix stage validator for multi text encoder models by @SolitaryThinker in #535
- [bugfix] [VSA] Fix layernorm type for VSA Wan2.1 TransformerBlock by @SolitaryThinker in #534
- [misc] [training] Reorganize training pipeline by @SolitaryThinker in #533
- [chore] Bump torch to 2.7.1 to support Blackwell by @Edenzzzz in #483
- [Training] Refactor and improve validation datasets by @SolitaryThinker in #539
- [Feature][Training]Add diffusers format checkpoint saving for inference by @BrianChen1129 in #542
- [Kernel] Remove all syncs from STA & VSA kernels by @Edenzzzz in #517
- [CI] Fix CI checks by @Edenzzzz in #553
- [Feature][Training] Add cfg rate for dataset loader by @BrianChen1129 in #556
- [Training] Fixes SP for training; Improve Datasets and schema by @SolitaryThinker in #555
- [CI] fix vae and ssim tests by @SolitaryThinker in #557
- [Feature] [Training] Add i2v training by @SolitaryThinker in #559
- [Feature][Inference] Add VSA inference script by @BrianChen1129 in #561
- [Docs] [Training] add readme for example training by @SolitaryThinker in #563
- [Feature] Load weights from distributed by @Edenzzzz in #470
- [CI] Fix SSIM and transformers CI by @Edenzzzz in #564
- [CI] Add tests to Modal by @kevin314 in #562
- Fix CLIP config by @Edenzzzz in #568
- [Revert] "[Feature] Load weights from distributed" by @SolitaryThinker in #571
- [ci] [misc] fix training test threshold by @SolitaryThinker in #573
- [Bugfix]Fix VSA sp for training/inference by @BrianChen1129 in #574
- fully_shard usage on RMSNorm by @weifengpy in #577
- [CI] Fix pre-commit CI by @Edenzzzz in #578
- [misc] [training] Add --video_length_tolerance_range 10 to preprocessing scripts by @SolitaryThinker in #581
- [Training] add caption to validation log by @SolitaryThinker in #582
- [Feat] activation checkpointing by @Eigensystem in #584
- [Feat][Training] Rename weight conversion function and update gradient checkpoint in scripts by @BrianChen1129 in #589
- [Feature]Update Readme by @BrianChen1129 in #591
- [CI] Fix fork builds by @kevin314 in #590
- [chore] Release 0.1.1 by @SolitaryThinker in #592
New Contributors
- @Edenzzzz made their first contribution in #422
- @AliceChenyy made their first contribution in #444
- @KevinZeng08 made their first contribution in #415
- @Eigensystem made their first contribution in #505
- @weifengpy made their first contribution in #577
Full Changelog: v0.1.0...v0.1.1