Skip to content

Extension 0.5.8

Choose a tag to compare

@github-actions github-actions released this 27 Apr 15:25
· 62 commits to main since this release

Thinking Mode for Chinese AI Providers

✨ New Feature

  • Raw streaming pipeline for thinking-capable providers: Qwen, Zhipu (GLM), and MiniMax now use a direct fetch-based streaming path instead of the Vercel AI SDK. This allows request-body flags such as enable_thinking to be passed explicitly, unlocking native thinking/reasoning mode for these providers without requiring an API format override.
  • Improved loading indicator: The "preparing response" dots now stay visible throughout the entire reasoning phase — not just before the first token arrives — so the UI always reflects that the model is actively thinking.

Full Changelog: ext/v0.5.7...ext/v0.5.8