Ministral 3B
PreviewGive feedback
Model navigation navigation
Source: Un Ministral, des Ministraux - Introducing the world’s best edge models.
We demonstrate the performance of les Ministraux across multiple tasks where they consistently outperform their peers. We re-evaluated all models with our internal framework for fair comparison.
Model | MMLU | AGIEval | Winogrande | Arc-c | TriviaQA |
---|---|---|---|---|---|
Gemma 2 2B | 52.4 | 33.8 | 68.7 | 42.6 | 47.8 |
Llama 3.2 3B | 56.2 | 37.4 | 59.6 | 43.1 | 50.7 |
Ministral 3B | 60.9 | 42.1 | 72.7 | 64.2 | 56.7 |
Mistral 7B | 62.4 | 42.5 | 74.2 | 67.9 | 62.5 |
Llama 3.1 8B | 64.7 | 44.4 | 74.6 | 46.0 | 60.2 |
Ministral 8B | 65.0 | 48.3 | 75.3 | 71.9 | 65.5 |
Model | HumanEval (pass@1) | GSM8K (maj@8) |
---|---|---|
Gemma 2 2B | 20.1 | 35.5 |
Llama 3.2 3B | 29.9 | 37.2 |
Ministral 3B | 34.2 | 50.9 |
Mistral 7B | 26.8 | 51.3 |
Llama 3.1 8B | 37.8 | 61.7 |
Ministral 8B | 34.8 | 64.5 |
Model | French MMLU | German MMLU | Spanish MMLU |
---|---|---|---|
Gemma 2 2B | 41.0 | 40.1 | 41.7 |
Llama 3.2 3B | 42.3 | 42.2 | 43.1 |
Ministral 3B | 49.1 | 48.3 | 49.5 |
Mistral 7B | 50.6 | 49.6 | 51.4 |
Llama 3.1 8B | 50.8 | 52.8 | 54.6 |
Ministral 8B | 57.5 | 57.4 | 59.6 |
Model | MTBench | Arena Hard | Wild bench |
---|---|---|---|
Gemma 2 2B | 7.5 | 51.7 | 32.5 |
Llama 3.2 3B | 7.2 | 46.0 | 27.2 |
Ministral 3B | 8.1 | 64.3 | 36.3 |
Mistral 7B | 6.7 | 44.3 | 33.1 |
Llama 3.1 8B | 7.5 | 62.4 | 37.0 |
Gemma 2 9B | 7.6 | 68.7 | 43.8 |
Ministral 8B | 8.3 | 70.9 | 41.3 |
Model | MBPP (pass@1) | HumanEval (pass@1) | Math (maj@1) |
---|---|---|---|
Gemma 2 2B | 54.5 | 42.7 | 22.8 |
Llama 3.2 3B | 64.6 | 61.0 | 38.4 |
Ministral 3B | 67.7 | 77.4 | 51.7 |
Mistral 7B | 50.2 | 38.4 | 13.2 |
Llama 3.1 8B | 69.7 | 67.1 | 49.3 |
Gemma 2 9B | 68.5 | 67.7 | 47.4 |
Ministral 8B | 70.0 | 76.8 | 54.5 |
About
Ministral 3B is a state-of-the-art Small Language Model (SLM) optimized for edge computing and on-device applications. As it is designed for low-latency and compute-efficient inference, it it also the perfect model for standard GenAI applications that have
Context
131k input · 4k output
Training date
Undisclosed
Rate limit tier
Provider support
Languages
(5)French, German, Spanish, Italian, and English