Skip to content

FunASR llama.cpp runtime v0.1.2

Latest

Choose a tag to compare

@LauraGPT LauraGPT released this 21 Jun 17:55
· 6 commits to main since this release
f1a55b0

Prebuilt self-contained binaries for running Fun-ASR-Nano (and SenseVoice / Paraformer) locally with the FunASR llama.cpp / GGUF runtime — built-in FSMN-VAD, whisper.cpp-style on-device ASR, strong on Chinese.

New: q8 GGUF models are ~half the size of f16 with the same accuracy.

bash download-funasr-model.sh nano ./gguf
llama-funasr-cli --enc ./gguf/funasr-encoder-f16.gguf -m ./gguf/qwen3-0.6b-q8_0.gguf -a audio.wav --vad ./gguf/fsmn-vad.gguf

No Python, no build. Linux (x64/arm64), macOS (arm64), Windows (x64). Docs: runtime/llama.cpp/README.md.