Skip to content

feat: add neuralwatt provider with 14 models#1610

Merged
rekram1-node merged 3 commits intoanomalyco:devfrom
monotykamary:feat/neuralwatt-provider
May 2, 2026
Merged

feat: add neuralwatt provider with 14 models#1610
rekram1-node merged 3 commits intoanomalyco:devfrom
monotykamary:feat/neuralwatt-provider

Conversation

@monotykamary
Copy link
Copy Markdown
Contributor

@monotykamary monotykamary commented Apr 27, 2026

Summary

Adds Neuralwatt as an OpenAI-compatible inference provider with energy-aware GPU optimization.

Update: The Neuralwatt models API was updated and now returns accurate pricing, capabilities, and limits. Model IDs, pricing, and capabilities in this PR follow the upstream API faithfully — no manual patches needed (patch.json is now empty).

Provider Details

Models (14 total)

Reasoning models (with [interleaved] field = "reasoning_content")

Model ID Name Vision Input $/M Output $/M
moonshotai/Kimi-K2.5 Kimi K2.5 $0.52 $2.59
moonshotai/Kimi-K2.6 Kimi K2.6 $0.69 $3.22
kimi-k2.6-fast Kimi K2.6 Fast $0.69 $3.22
zai-org/GLM-5.1-FP8 GLM 5.1 FP8 $1.10 $3.60
MiniMaxAI/MiniMax-M2.5 MiniMax M2.5 $0.35 $1.38
Qwen/Qwen3.5-397B-A17B-FP8 Qwen3.5 397B A17B FP8 $0.69 $4.14
Qwen/Qwen3.6-35B-A3B Qwen3.6 35B A3B $0.05 $0.10
openai/gpt-oss-20b GPT OSS 20B $0.03 $0.16

Fast variants (optimized for speed, non-reasoning unless noted)

Model ID Name Vision Input $/M Output $/M
kimi-k2.5-fast Kimi K2.5 Fast $0.52 $2.59
glm-5-fast GLM 5 Fast $1.10 $3.60
glm-5.1-fast GLM 5.1 Fast $1.10 $3.60
qwen3.5-397b-fast Qwen3.5 397B Fast $0.69 $4.14
qwen3.6-35b-fast Qwen3.6 35B Fast $0.05 $0.10

Other

Model ID Name Vision Input $/M Output $/M
mistralai/Devstral-Small-2-24B-Instruct-2512 Devstral Small 2 24B Instruct 2512 $0.12 $0.35

Files Added

  • providers/neuralwatt/provider.toml — Provider config
  • providers/neuralwatt/logo.svg — Logo from official Neuralwatt favicon (adapted to currentColor)
  • providers/neuralwatt/README.md — Provider documentation
  • providers/neuralwatt/models/**/*.toml — 14 model definition files

Notes

  • All model IDs, pricing, capabilities, and limits are sourced directly from the Neuralwatt API (/v1/models) — now fully accurate, no manual patches required
  • Model IDs match the Neuralwatt API identifier format (some fast variants use bare names without org prefix, e.g. glm-5-fast)
  • "Fast" variants are optimized for lower latency; kimi-k2.6-fast also supports reasoning
  • Vision models support image input via OpenAI-compatible API
  • Reasoning models use [interleaved] field = "reasoning_content" per the existing pattern
  • Logo SVG adapted from the official favicon with fill="currentColor" for theme compatibility

Validation

  • bun validate passes
  • bun run build (web) succeeds

Add Neuralwatt as an OpenAI-compatible inference provider with
energy-aware GPU optimization. Includes 14 models across 6
sub-providers (Mistral, ZAI, OpenAI, Moonshot, MiniMax, Qwen).

Models include reasoning variants (Kimi K2.5/K2.6, GLM 5.1 FP8,
MiniMax M2.5, Qwen3.5 397B, GPT OSS 20B) and fast non-reasoning
variants (Kimi K2.5/K2.6 Fast, GLM 5/5.1 Fast, Qwen3.5/3.6 Fast),
plus Devstral Small 2 and Qwen3.6 35B A3B.

Logo derived from official Neuralwatt favicon (currentColor variant).
Pricing sourced from Neuralwatt's published rates.
The Neuralwatt API now returns accurate pricing and capabilities,
eliminating the need for manual patches (patch.json is now empty).

Changes:
- Update pricing for all 14 models from API (significant changes for
  GLM, GPT-OSS, Qwen, and MiniMax models)
- Devstral Small 2 now supports image input (vision)
- kimi-k2.5-fast now supports image input (vision)
- kimi-k2.6-fast now supports reasoning + image input (was non-reasoning)
- Qwen3.6-35B-A3B now supports reasoning (was non-reasoning)
- GLM models context window: 202,752 → 200,000
- Rename fast variant model IDs to match API (dropped org prefix):
  zai-org/glm-5-fast → glm-5-fast
  zai-org/glm-5.1-fast → glm-5.1-fast
  moonshotai/kimi-k2.5-fast → kimi-k2.5-fast
  moonshotai/kimi-k2.6-fast → kimi-k2.6-fast
  Qwen/qwen3.5-397b-fast → qwen3.5-397b-fast
  Qwen/qwen3.6-35b-fast → qwen3.6-35b-fast
@rekram1-node rekram1-node merged commit d2e16ba into anomalyco:dev May 2, 2026
1 check passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants