Three new providers — Kimi (Moonshot), Grok (xAI), and Qwen (Alibaba) — covering the major coding models. Kimi and Qwen include their flat-fee coding-plan subscriptions alongside pay-per-use; Grok adds graded
/thinkingeffort.
Added
- Kimi (Moonshot AI).
kimidrives the Kimi Code subscription
(api.kimi.com/coding, model aliaskimi-for-coding);kimi-apiis
pay-per-use (api.moonshot.ai, defaultkimi-k2.7-code);kimi-cnfor
mainland China. Keys:KIMI_CODE_API_KEY/MOONSHOT_API_KEY. - Qwen (Alibaba Model Studio).
qwendrives the Coding Plan
subscription (coding-intl.dashscope…,sk-sp-key);qwen-apiis
pay-per-use (DashScope, defaultqwen3-coder-plus); plusqwen-cn/
qwen-cn-apiand a free ModelScope tier (modelscope). Keys:
BAILIAN_CODING_PLAN_API_KEY/DASHSCOPE_API_KEY/MODELSCOPE_API_KEY. - Grok (xAI).
grok— pay-per-use (api.x.ai), defaultgrok-build-0.1
plusgrok-4.3and the fast/reasoning variants. Key:XAI_API_KEY.
Changed
/thinkingnow covers Grok (reasoning_effort— low/medium/high). Kimi
and the Qwen coder models have no graded knob, so they stay out of the picker.- Qwen tool turns are sent non-streamed. DashScope rejects
toolswith
stream:true, so agent turns that carry tools buffer the reply (handled
transparently); other providers keep streaming. - Kimi K2.x code models fix temperature internally — Codeep withholds the
sampling params so they don't 400.