Skip to content

v4.0.0

Latest

Choose a tag to compare

@github-actions github-actions released this 11 Jun 02:44
· 2 commits to main since this release
8d33613

Features

Breaking changes

  • Inference parameters: batch_n_tokens renamed to n_batch
  • Inference parameters now requires embedding_batch_size
  • Streamed tokens now arrive under kind-specific keys (ContentToken, ReasoningToken, ToolCallToken, UndeterminableToken) instead of Token, and the stream ends with a Done event carrying token usage
  • Minimum supported Rust version is now 1.95.0