Skip to content

feat: add Nebius Token Factory models#796

Merged
rekram1-node merged 6 commits intoanomalyco:devfrom
manascb1344:nebius-token-factory-models
Feb 5, 2026
Merged

feat: add Nebius Token Factory models#796
rekram1-node merged 6 commits intoanomalyco:devfrom
manascb1344:nebius-token-factory-models

Conversation

@manascb1344
Copy link
Copy Markdown
Contributor

This PR adds all 45 models available on Nebius Token Factory to the models.dev database. All models are configured with proper pricing, limits, modalities, and capabilities.

New Providers Added (11):

  • MiniMaxAI - MiniMax-M2.1
  • PrimeIntellect - INTELLECT-3
  • black-forest-labs - FLUX.1-schnell, FLUX.1-dev
  • BAAI - bge-multilingual-gemma2, BGE-ICL
  • intfloat - e5-mistral-7b-instruct
  • Google - Gemma-2-2b-it, Gemma-2-9b-it-fast, Gemma-3-27b-it, Gemma-3-27b-it-fast

New Models Added (33 total):

Provider Models
MiniMaxAI MiniMax-M2.1
Z.ai GLM-4.7, GLM-4.7-FP8
DeepSeek DeepSeek-V3.2, DeepSeek-V3-0324 (Base & Fast), DeepSeek-R1-0528 (Base & Fast)
OpenAI gpt-oss-120b, gpt-oss-20b
Moonshot AI Kimi-K2-Instruct, Kimi-K2-Thinking
Qwen Qwen3-Coder-480B-A35B-Instruct, Qwen3-Next-80B-A3B-Thinking, Qwen3-235B-A22B-Thinking-2507, Qwen3-235B-A22B-Instruct-2507, Qwen3-30B-A3B-Thinking-2507, Qwen3-30B-A3B-Instruct-2507, Qwen3-Coder-30B-A3B-Instruct, Qwen3-32B (Base & Fast), Qwen2.5-Coder-7B-fast, Qwen2.5-VL-72B-Instruct, Qwen3-Embedding-8B
NousResearch Hermes-4-405B, Hermes-4-70B
Meta Llama-3.3-70B-Instruct (Base & Fast), Meta-Llama-3.1-8B-Instruct (Base & Fast), Llama-Guard-3-8B
NVIDIA Llama-3.1-Nemotron-Ultra-253B-v1, Nemotron-Nano-V2-12b, NVIDIA-Nemotron-3-Nano-30B-A3B
Prime Intellect INTELLECT-3
BAAI bge-multilingual-gemma2, BGE-ICL
intfloat e5-mistral-7b-instruct
Black Forest Labs FLUX.1-schnell, FLUX.1-dev

Deprecated Models (2):

  • deepseek-ai/DeepSeek-V3 - Replaced by V3.2 and V3-0324
  • meta-llama/Llama-3.1-405B-Instruct - No longer available

Changes:

  • Added 33 new model TOML files
  • Updated 12 existing model files with new pricing and features
  • All models validated with bun validate

Categories Covered:

  • Text-to-text (39 models)
  • Vision (4 models: Gemma-3-27b-it, Gemma-3-27b-it-fast, Qwen2.5-VL-72B-Instruct, Nemotron-Nano-V2-12b)
  • Embeddings (4 models: bge-multilingual-gemma2, BGE-ICL, e5-mistral-7b-instruct, Qwen3-Embedding-8B)
  • Text-to-image (2 models: FLUX.1-schnell, FLUX.1-dev)
  • Safety guardrails (1 model: Llama-Guard-3-8B)

- Add MiniMaxAI provider with MiniMax-M2.1 model
- Add PrimeIntellect provider with INTELLECT-3 model
- Add black-forest-labs provider with FLUX.1-schnell and FLUX.1-dev
- Add BAAI provider with bge-multilingual-gemma2 and BGE-ICL
- Add intfloat provider with e5-mistral-7b-instruct
- Add Google provider with Gemma-2-2b-it, Gemma-2-9b-it-fast, Gemma-3-27b-it, and Gemma-3-27b-it-fast
- Add DeepSeek-V3.2, DeepSeek-V3-0324 (Base & Fast), DeepSeek-R1-0528 (Base & Fast)
- These are new models available on Nebius Token Factory
- Add Qwen3-Next-80B-A3B-Thinking
- Add Qwen3-30B-A3B-Thinking-2507 and Qwen3-30B-A3B-Instruct-2507
- Add Qwen3-Coder-30B-A3B-Instruct
- Add Qwen3-32B (Base & Fast)
- Add Qwen2.5-Coder-7B-fast
- Add Qwen2.5-VL-72B-Instruct
- Add Qwen3-Embedding-8B
- Add GLM-4.7 and GLM-4.7-FP8 (Z.ai)
- Add Kimi-K2-Thinking (Moonshot AI)
- Add Llama-Guard-3-8B, Meta-Llama-3.1-8B-Instruct (Base & Fast) (Meta)
- Add Nemotron-Nano-V2-12b and NVIDIA-Nemotron-3-Nano-30B-A3B (NVIDIA)
- Update GLM-4.5 and GLM-4.5-Air with new pricing
- Update gpt-oss-120b and gpt-oss-20b with new pricing and features
- Update Kimi-K2-Instruct with new pricing and multimodal support
- Update Hermes-4-405B and Hermes-4-70B with new pricing
- Update Llama-3.3-70B-Instruct (Base & Fast) with new pricing
- Mark Llama-3.1-405B-Instruct as deprecated (no longer available)
- Update Llama-3.1-Nemotron-Ultra-253B-v1 with new pricing
- Mark DeepSeek-V3 as deprecated (replaced by V3.2 and V3-0324)
@rekram1-node rekram1-node merged commit 9573a8f into anomalyco:dev Feb 5, 2026
1 check passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants