feat: add neuralwatt provider with 14 models#1610
Merged
rekram1-node merged 3 commits intoanomalyco:devfrom May 2, 2026
Merged
feat: add neuralwatt provider with 14 models#1610rekram1-node merged 3 commits intoanomalyco:devfrom
rekram1-node merged 3 commits intoanomalyco:devfrom
Conversation
Add Neuralwatt as an OpenAI-compatible inference provider with energy-aware GPU optimization. Includes 14 models across 6 sub-providers (Mistral, ZAI, OpenAI, Moonshot, MiniMax, Qwen). Models include reasoning variants (Kimi K2.5/K2.6, GLM 5.1 FP8, MiniMax M2.5, Qwen3.5 397B, GPT OSS 20B) and fast non-reasoning variants (Kimi K2.5/K2.6 Fast, GLM 5/5.1 Fast, Qwen3.5/3.6 Fast), plus Devstral Small 2 and Qwen3.6 35B A3B. Logo derived from official Neuralwatt favicon (currentColor variant). Pricing sourced from Neuralwatt's published rates.
The Neuralwatt API now returns accurate pricing and capabilities, eliminating the need for manual patches (patch.json is now empty). Changes: - Update pricing for all 14 models from API (significant changes for GLM, GPT-OSS, Qwen, and MiniMax models) - Devstral Small 2 now supports image input (vision) - kimi-k2.5-fast now supports image input (vision) - kimi-k2.6-fast now supports reasoning + image input (was non-reasoning) - Qwen3.6-35B-A3B now supports reasoning (was non-reasoning) - GLM models context window: 202,752 → 200,000 - Rename fast variant model IDs to match API (dropped org prefix): zai-org/glm-5-fast → glm-5-fast zai-org/glm-5.1-fast → glm-5.1-fast moonshotai/kimi-k2.5-fast → kimi-k2.5-fast moonshotai/kimi-k2.6-fast → kimi-k2.6-fast Qwen/qwen3.5-397b-fast → qwen3.5-397b-fast Qwen/qwen3.6-35b-fast → qwen3.6-35b-fast
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Add this suggestion to a batch that can be applied as a single commit.This suggestion is invalid because no changes were made to the code.Suggestions cannot be applied while the pull request is closed.Suggestions cannot be applied while viewing a subset of changes.Only one suggestion per line can be applied in a batch.Add this suggestion to a batch that can be applied as a single commit.Applying suggestions on deleted lines is not supported.You must change the existing code in this line in order to create a valid suggestion.Outdated suggestions cannot be applied.This suggestion has been applied or marked resolved.Suggestions cannot be applied from pending reviews.Suggestions cannot be applied on multi-line comments.Suggestions cannot be applied while the pull request is queued to merge.Suggestion cannot be applied right now. Please check back later.
Summary
Adds Neuralwatt as an OpenAI-compatible inference provider with energy-aware GPU optimization.
Update: The Neuralwatt models API was updated and now returns accurate pricing, capabilities, and limits. Model IDs, pricing, and capabilities in this PR follow the upstream API faithfully — no manual patches needed (
patch.jsonis now empty).Provider Details
https://api.neuralwatt.com/v1(OpenAI-compatible)@ai-sdk/openai-compatibleNEURALWATT_API_KEYModels (14 total)
Reasoning models (with
[interleaved] field = "reasoning_content")moonshotai/Kimi-K2.5moonshotai/Kimi-K2.6kimi-k2.6-fastzai-org/GLM-5.1-FP8MiniMaxAI/MiniMax-M2.5Qwen/Qwen3.5-397B-A17B-FP8Qwen/Qwen3.6-35B-A3Bopenai/gpt-oss-20bFast variants (optimized for speed, non-reasoning unless noted)
kimi-k2.5-fastglm-5-fastglm-5.1-fastqwen3.5-397b-fastqwen3.6-35b-fastOther
mistralai/Devstral-Small-2-24B-Instruct-2512Files Added
providers/neuralwatt/provider.toml— Provider configproviders/neuralwatt/logo.svg— Logo from official Neuralwatt favicon (adapted tocurrentColor)providers/neuralwatt/README.md— Provider documentationproviders/neuralwatt/models/**/*.toml— 14 model definition filesNotes
/v1/models) — now fully accurate, no manual patches requiredglm-5-fast)kimi-k2.6-fastalso supports reasoning[interleaved] field = "reasoning_content"per the existing patternfill="currentColor"for theme compatibilityValidation
bun validatepassesbun run build(web) succeeds