Skip to content

manifest v6.8.3

Choose a tag to compare

@github-actions github-actions released this 01 Jun 06:05
· 271 commits to main since this release
288f0ed

🐛 Patch Changes

  • 65bca08: Fix token-cost calculation for providers that report cache-read prompt tokens. Manifest now uses models.dev cache-read/cache-write prices when available, so cached DeepSeek input tokens are billed at the cache-hit rate instead of the full input-token rate.
  • 08f9c6f: Fix /v1/responses streaming to emit the full OpenAI Responses API event lifecycle when bridging a Chat Completions upstream. The converter now opens a message item and content part (response.output_item.added / response.content_part.added) before the text deltas and closes them (response.output_text.done / response.content_part.done / response.output_item.done) with a populated response.completed, instead of emitting bare output_text.delta + response.completed{output:[]}. Strict Responses API clients (Pi, OpenClaw-style) previously dropped the deltas and rendered empty assistant messages.