Skip to content

v3.10.0

Latest
Compare
Choose a tag to compare
@github-actions github-actions released this 12 Jun 01:14
59cf309

3.10.0 (2025-06-12)

Features

  • JSON Schema Grammar: $defs and $ref support with full inferred types (#472) (9cdbce9)
  • inspect gguf command: format and print the Jinja chat template with --key .chatTemplate (#472) (9cdbce9)

Bug Fixes

  • JinjaTemplateChatWrapper: first function call prefix detection (#472) (9cdbce9)
  • QwenChatWrapper: improve Qwen chat template detection (#472) (9cdbce9)
  • apply maxTokens on function calling parameters (#472) (9cdbce9)
  • adjust default prompt completion length based on SWA size when relevant (#472) (9cdbce9)
  • improve thought segmentation syntax extraction (#472) (9cdbce9)
  • adapt to llama.cpp changes (#472) (9cdbce9)

Shipped with llama.cpp release b5640

To use the latest llama.cpp release available, run npx -n node-llama-cpp source download --release latest. (learn more)