Release FunASR llama.cpp runtime v0.1.2 · FunAudioLLM/Fun-ASR

Prebuilt self-contained binaries for running Fun-ASR-Nano (and SenseVoice / Paraformer) locally with the FunASR llama.cpp / GGUF runtime — built-in FSMN-VAD, whisper.cpp-style on-device ASR, strong on Chinese.

New: q8 GGUF models are ~half the size of f16 with the same accuracy.

bash download-funasr-model.sh nano ./gguf
llama-funasr-cli --enc ./gguf/funasr-encoder-f16.gguf -m ./gguf/qwen3-0.6b-q8_0.gguf -a audio.wav --vad ./gguf/fsmn-vad.gguf

No Python, no build. Linux (x64/arm64), macOS (arm64), Windows (x64). Docs: runtime/llama.cpp/README.md.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

FunASR llama.cpp runtime v0.1.2

Choose a tag to compare

Sorry, something went wrong.

Sorry, something went wrong.

Uh oh!

No results found

Uh oh!