Skip to content

0.1.5

Choose a tag to compare

@Saganaki22 Saganaki22 released this 14 Jun 09:42
· 3 commits to main since this release

Zonos2 TTS ComfyUI v0.1.5

Added

  • Added mixed FP8 E4M3 checkpoint support.
  • Added the drbaph/ZONOS-FP8 model catalog entry.
  • Added automatic download support for zonos2-fp8-mixed.safetensors.
  • Added FP8 metadata, policy, tensor-shape, and runtime validation.
  • Added native ComfyUI quantized tensor integration.

FP8 Policy

  • MoE expert gate/up (w13) projections use FP8 E4M3.
  • Expert down (w2), attention, LM head, routers, embeddings, norms, and other sensitive paths remain BF16.
  • FP8 checkpoints support dtype: auto and dtype: bf16.
  • Unsupported FP16 execution is rejected with a clear error.

AIMDO

  • Added actual-size-aware static and DynamicVRAM selection for FP8 models.
  • Mixed FP8 automatically selects AIMDO VBAR below approximately 12.78 GiB total VRAM when DynamicVRAM is enabled.
  • FP8 w13 and BF16 w2 expert projections are independently pageable.
  • Routed expert projections are loaded into VRAM on demand.
  • VBAR residency, page faults, and eviction use real AIMDO operations.
  • Systems with sufficient VRAM continue using the faster static loading path.

Downloads

  • Main checkpoints, DAC assets, and speaker encoder assets are checked independently.
  • Existing complete asset directories are not downloaded again.
  • A missing selected checkpoint is still downloaded when shared assets already exist.
  • BF16 and FP8 presets download from their respective Hugging Face repositories.

Validation

  • Retired all-layer FP8 checkpoint layouts are rejected.
  • Malformed 3D expert tensors are rejected before inference.
  • Clear compatibility errors are provided for unsupported FP8 checkpoints.
  • Added native and loader regression coverage.

Documentation

  • Updated English and Chinese documentation.
  • Added FP8 installation, memory usage, dtype, AIMDO, model structure, and troubleshooting information.
  • Added the mixed FP8 Hugging Face badge and model link.

Verification

  • 33 automated tests passing.
  • Static FP8 generation verified.
  • Real AIMDO VBAR generation verified with all expert w13 and w2 projections registered.