·
6 commits
to main
since this release
Prebuilt self-contained binaries for running Fun-ASR-Nano (and SenseVoice / Paraformer) locally with the FunASR llama.cpp / GGUF runtime — built-in FSMN-VAD, whisper.cpp-style on-device ASR, strong on Chinese.
New: q8 GGUF models are ~half the size of f16 with the same accuracy.
bash download-funasr-model.sh nano ./gguf
llama-funasr-cli --enc ./gguf/funasr-encoder-f16.gguf -m ./gguf/qwen3-0.6b-q8_0.gguf -a audio.wav --vad ./gguf/fsmn-vad.ggufNo Python, no build. Linux (x64/arm64), macOS (arm64), Windows (x64). Docs: runtime/llama.cpp/README.md.