feat: add neuralwatt provider with 14 models by monotykamary · Pull Request #1610 · anomalyco/models.dev

monotykamary · 2026-04-27T08:03:23Z

Summary

Adds Neuralwatt as an OpenAI-compatible inference provider with energy-aware GPU optimization.

Update: The Neuralwatt models API was updated and now returns accurate pricing, capabilities, and limits. Model IDs, pricing, and capabilities in this PR follow the upstream API faithfully — no manual patches needed (patch.json is now empty).

Provider Details

API: https://api.neuralwatt.com/v1 (OpenAI-compatible)
SDK: @ai-sdk/openai-compatible
Auth: NEURALWATT_API_KEY
Docs: https://portal.neuralwatt.com/docs

Models (14 total)

Reasoning models (with `[interleaved] field = "reasoning_content"`)

Model ID	Name	Vision	Input $/M	Output $/M
`moonshotai/Kimi-K2.5`	Kimi K2.5	✅	$0.52	$2.59
`moonshotai/Kimi-K2.6`	Kimi K2.6	✅	$0.69	$3.22
`kimi-k2.6-fast`	Kimi K2.6 Fast	✅	$0.69	$3.22
`zai-org/GLM-5.1-FP8`	GLM 5.1 FP8	❌	$1.10	$3.60
`MiniMaxAI/MiniMax-M2.5`	MiniMax M2.5	❌	$0.35	$1.38
`Qwen/Qwen3.5-397B-A17B-FP8`	Qwen3.5 397B A17B FP8	❌	$0.69	$4.14
`Qwen/Qwen3.6-35B-A3B`	Qwen3.6 35B A3B	❌	$0.05	$0.10
`openai/gpt-oss-20b`	GPT OSS 20B	❌	$0.03	$0.16

Fast variants (optimized for speed, non-reasoning unless noted)

Model ID	Name	Vision	Input $/M	Output $/M
`kimi-k2.5-fast`	Kimi K2.5 Fast	✅	$0.52	$2.59
`glm-5-fast`	GLM 5 Fast	❌	$1.10	$3.60
`glm-5.1-fast`	GLM 5.1 Fast	❌	$1.10	$3.60
`qwen3.5-397b-fast`	Qwen3.5 397B Fast	❌	$0.69	$4.14
`qwen3.6-35b-fast`	Qwen3.6 35B Fast	❌	$0.05	$0.10

Other

Model ID	Name	Vision	Input $/M	Output $/M
`mistralai/Devstral-Small-2-24B-Instruct-2512`	Devstral Small 2 24B Instruct 2512	✅	$0.12	$0.35

Files Added

providers/neuralwatt/provider.toml — Provider config
providers/neuralwatt/logo.svg — Logo from official Neuralwatt favicon (adapted to currentColor)
providers/neuralwatt/README.md — Provider documentation
providers/neuralwatt/models/**/*.toml — 14 model definition files

Notes

All model IDs, pricing, capabilities, and limits are sourced directly from the Neuralwatt API (/v1/models) — now fully accurate, no manual patches required
Model IDs match the Neuralwatt API identifier format (some fast variants use bare names without org prefix, e.g. glm-5-fast)
"Fast" variants are optimized for lower latency; kimi-k2.6-fast also supports reasoning
Vision models support image input via OpenAI-compatible API
Reasoning models use [interleaved] field = "reasoning_content" per the existing pattern
Logo SVG adapted from the official favicon with fill="currentColor" for theme compatibility

Validation

✅ bun validate passes
✅ bun run build (web) succeeds

Add Neuralwatt as an OpenAI-compatible inference provider with energy-aware GPU optimization. Includes 14 models across 6 sub-providers (Mistral, ZAI, OpenAI, Moonshot, MiniMax, Qwen). Models include reasoning variants (Kimi K2.5/K2.6, GLM 5.1 FP8, MiniMax M2.5, Qwen3.5 397B, GPT OSS 20B) and fast non-reasoning variants (Kimi K2.5/K2.6 Fast, GLM 5/5.1 Fast, Qwen3.5/3.6 Fast), plus Devstral Small 2 and Qwen3.6 35B A3B. Logo derived from official Neuralwatt favicon (currentColor variant). Pricing sourced from Neuralwatt's published rates.

The Neuralwatt API now returns accurate pricing and capabilities, eliminating the need for manual patches (patch.json is now empty). Changes: - Update pricing for all 14 models from API (significant changes for GLM, GPT-OSS, Qwen, and MiniMax models) - Devstral Small 2 now supports image input (vision) - kimi-k2.5-fast now supports image input (vision) - kimi-k2.6-fast now supports reasoning + image input (was non-reasoning) - Qwen3.6-35B-A3B now supports reasoning (was non-reasoning) - GLM models context window: 202,752 → 200,000 - Rename fast variant model IDs to match API (dropped org prefix): zai-org/glm-5-fast → glm-5-fast zai-org/glm-5.1-fast → glm-5.1-fast moonshotai/kimi-k2.5-fast → kimi-k2.5-fast moonshotai/kimi-k2.6-fast → kimi-k2.6-fast Qwen/qwen3.5-397b-fast → qwen3.5-397b-fast Qwen/qwen3.6-35b-fast → qwen3.6-35b-fast

monotykamary added 3 commits April 27, 2026 15:01

fix(neuralwatt): sync model capabilities with provider API

051dc62

rekram1-node merged commit d2e16ba into anomalyco:dev May 2, 2026
1 check passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: add neuralwatt provider with 14 models#1610

feat: add neuralwatt provider with 14 models#1610
rekram1-node merged 3 commits intoanomalyco:devfrom
monotykamary:feat/neuralwatt-provider

monotykamary commented Apr 27, 2026 •

edited

Loading

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

monotykamary commented Apr 27, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Provider Details

Models (14 total)

Reasoning models (with [interleaved] field = "reasoning_content")

Fast variants (optimized for speed, non-reasoning unless noted)

Other

Files Added

Notes

Validation

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

monotykamary commented Apr 27, 2026 •

edited

Loading

Reasoning models (with `[interleaved] field = "reasoning_content"`)