Releases · ollama/ollama

New models

Cohere Aya 23: A new state-of-the-art, multilingual LLM covering 23 different languages.
Mistral 7B 0.3: A new version of Mistral 7B with initial support for function calling.
Phi-3 Medium: a 14B parameters, lightweight, state-of-the-art open model by Microsoft.
Phi-3 Mini 128K and Phi-3 Medium 128K: versions of the Phi-3 models that support a context window size of 128K
Granite code: A family of open foundation models by IBM for Code Intelligence

Llama 3 import

It is now possible to import and quantize Llama 3 and its finetunes from Safetensors format to Ollama.

First, clone a Hugging Face repo with a Safetensors model:

git clone https://huggingface.co/meta-llama/Meta-Llama-3-8B-Instruct
cd Meta-Llama-3-8B-Instruct

Next, create a Modelfile:

FROM .

TEMPLATE """{{ if .System }}<|start_header_id|>system<|end_header_id|>

{{ .System }}<|eot_id|>{{ end }}{{ if .Prompt }}<|start_header_id|>user<|end_header_id|>

{{ .Prompt }}<|eot_id|>{{ end }}<|start_header_id|>assistant<|end_header_id|>

{{ .Response }}<|eot_id|>"""

PARAMETER stop <|start_header_id|>
PARAMETER stop <|end_header_id|>
PARAMETER stop <|eot_id|>

Then, create and quantize a model:

ollama create --quantize q4_0 -f Modelfile my-llama3 
ollama run my-llama3

What's Changed

Fixed issues where wide characters such as Chinese, Korean, Japanese and Russian languages.
Added new OLLAMA_NOHISTORY=1 environment variable that can be set to disable history when using ollama run
New experimental OLLAMA_FLASH_ATTENTION=1 flag for ollama serve that improves token generation speed on Apple Silicon Macs and NVIDIA graphics cards
Fixed error that would occur on Windows running ollama create -f Modelfile
ollama create can now create models from I-Quant GGUF files
Fixed EOF errors when resuming downloads via ollama pull
Added a Ctrl+W shortcut to ollama run

New Contributors

@rapmd73 made their first contribution in #4467
@sammcj made their first contribution in #4120
@likejazz made their first contribution in #4535

Full Changelog: v0.1.38...v0.1.39

New Models

Falcon 2: A new 11B parameters causal decoder-only model built by TII and trained over 5T tokens.
Yi 1.5: A new high-performing version of Yi, now licensed as Apache 2.0. Available in 6B, 9B and 34B sizes.

What's Changed

`ollama ps`

A new command is now available: ollama ps. This command displays currently loaded models, their memory footprint, and the processors used (GPU or CPU):

% ollama ps
NAME             	ID          	SIZE  	PROCESSOR      	UNTIL              
mixtral:latest   	7708c059a8bb	28 GB 	47%/53% CPU/GPU	Forever           	
llama3:latest    	a6990ed6be41	5.5 GB	100% GPU       	4 minutes from now	
all-minilm:latest	1b226e2802db	585 MB	100% GPU       	4 minutes from now

`/clear`

To clear the chat history for a session when running ollama run, use /clear:

>>> /clear
Cleared session context

Fixed issue where switching loaded models on Windows would take several seconds
Running /save will no longer abort the chat session if an incorrect name is provided
The /api/tags API endpoint will now correctly return an empty list [] instead of null if no models are provided

New Contributors

@fangtaosong made their first contribution in #4387
@machimachida made their first contribution in #4424

Full Changelog: v0.1.37...v0.1.38

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

What's Changed

New Contributors

Contributors

What's Changed

What's Changed

New Contributors

Contributors

New models

What's Changed

New Contributors

Contributors

What's Changed

New models

What's Changed

New Examples

New Contributors

Contributors

New models

Llama 3 import

What's Changed

New Contributors

Contributors

New Models

What's Changed

`ollama ps`

`/clear`

New Contributors

Contributors

What's Changed

New Contributors

Contributors

What's Changed

Releases: ollama/ollama

v0.1.45

What's Changed

New Contributors

Contributors

v0.1.44

What's Changed

v0.1.43

What's Changed

New Contributors

Contributors

v0.1.42

New models

What's Changed

New Contributors

Contributors

v0.1.41

What's Changed

v0.1.40

New models

What's Changed

New Examples

New Contributors

Contributors

v0.1.39

New models

Llama 3 import

What's Changed

New Contributors

Contributors

v0.1.38

New Models

What's Changed

ollama ps

/clear

New Contributors

Contributors

v0.1.37

What's Changed

New Contributors

Contributors

v0.1.36

What's Changed

`ollama ps`

`/clear`