v3.9.0
3.9.0 (2025-06-04)
Features
- reasoning budget (#468) (ea8d904) (documentation: Set Reasoning Budget)
- SWA (Sliding Window Attention) support - greatly reduced context memory consumption on supported models (#468) (ea8d904)
- documentation: LLMs friendly
llms.md
andllms-full.md
files (#468) (ea8d904)
Bug Fixes
Shipped with llama.cpp
release b5590
To use the latest
llama.cpp
release available, runnpx -n node-llama-cpp source download --release latest
. (learn more)