3.10.0 (2025-06-12)
Features
- JSON Schema Grammar:
$defs
and$ref
support with full inferred types (#472) (9cdbce9) inspect gguf
command: format and print the Jinja chat template with--key .chatTemplate
(#472) (9cdbce9)
Bug Fixes
JinjaTemplateChatWrapper
: first function call prefix detection (#472) (9cdbce9)QwenChatWrapper
: improve Qwen chat template detection (#472) (9cdbce9)- apply
maxTokens
on function calling parameters (#472) (9cdbce9) - adjust default prompt completion length based on SWA size when relevant (#472) (9cdbce9)
- improve thought segmentation syntax extraction (#472) (9cdbce9)
- adapt to
llama.cpp
changes (#472) (9cdbce9)
Shipped with llama.cpp
release b5640
To use the latest
llama.cpp
release available, runnpx -n node-llama-cpp source download --release latest
. (learn more)