Skip to content

Wan 2.1 / Wan2.2 Lightning & Sparse Distillation & Tiny VAE Acceleration Methods #12146

@ghunkins

Description

@ghunkins
Image

Is your feature request related to a problem? Please describe.
Some incredible acceleration methods have been released for both Wan2.1 and Wan2.2. These are currently appear to be unsupported with the official Diffusers pipelines.

  • Wan2.2-Lightning enables 4-step inference for Wan2.2 via LoRAs
  • LightX2V is related and also includes 4-step inference for Wan2.1 via LoRAs
  • FastVideo enables 3-step inference via distillation for Wan2.1 and Wan2.2 variants
  • TAEHV enables 4-6x increase in VAE decoding speed (2-3s to 0.5s)

Describe the solution you'd like.

  1. The ability to load lightning LoRAs with Wan2.1 and Wan2.2 pipelines
  2. The ability to load FastWan distilled models
  3. The ability to load TAEHV as a Wan VAE alternative

Describe alternatives you've considered.

The FastVideo and LightX2V official inference scripts.

TAEHV has a method for retro-fitting a Diffusers pipeline.

There are also various ComfyUI implementations.

Additional context.
Some of the LoRA acceleration might be supported already but is broken by recent changes to the LoRA loading ecosystem. Not certain. Also could use some guidance on editing the scheduler to allow for the time-step distillation.

I am currently investigating this space and figured I would open this issue to allow myself and others to post working examples and any relevant PRs. Thanks to all who do in advance.

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions