Skip to content

b8b69f7

Latest

Choose a tag to compare

@github-actions github-actions released this 31 May 15:39
· 28 commits to main since this release

What's Changed

  • UMA backend detection: new cosyvoice_is_backend_uma API + --inference-buffer-policy CLI option
  • Concurrency support: cosyvoice_duplicate_context API, reference counting, multi-worker inference, server concurrency
  • TTS split & fade-in: text split context flags, fade-in postprocess, exposed in CLI and server
  • Model refactoring: split into shared/worker context structures for better encapsulation
  • fix: occasional frontend crash (#21)
  • misc: Ctrl+C exit, timing info enhancement, nucleus probabilities management, default LLM seq len → 4096, documentation updates

Full Changelog: ee825ac...3e46dc4

Release Packages

These packages only include the core cosyvoice binaries and tools, plus ONNX Runtime.
GGML, ICU, and (for FFmpeg backend) FFmpeg are not included.

To use cosyvoice, you need to download the matching backend libraries separately:

  • GGML:
    Download from llama.cpp releases and extract the GGML library.
    Place GGML files in the same directory as cosyvoice binaries.

  • ICU:
    Required unless using a "No ICU" build.
    On Linux/macOS: install via your package manager.
    On Windows: Download ICU.

  • FFmpeg:
    Required only for FFmpeg backend builds (not needed for miniaudio).
    On Linux/macOS: install via your package manager.
    On Windows: Download FFmpeg
    For Windows, use stable n8.1 builds, e.g.
    ffmpeg-n8.1-latest-win64-lgpl-shared-8.1.zip.

Windows

Windows x64 (FFmpeg)
Windows x64 (miniaudio)
Windows x64 (miniaudio, No ICU)
Windows x64 (No Frontend, No Audio, No ICU)

Linux

Ubuntu x86_64 (FFmpeg)
Ubuntu x86_64 (miniaudio)
Ubuntu x86_64 (miniaudio, No ICU)
Ubuntu x86_64 (No Frontend, No Audio, No ICU)

macOS

macOS arm64 (FFmpeg)
macOS arm64 (miniaudio)
macOS arm64 (miniaudio, No ICU)
macOS arm64 (No Frontend, No Audio, No ICU)

Notes