Default Keep Alive environment variable #3094

pdevine · 2024-03-13T05:11:04Z

This change adds a new environment variable called OLLAMA_KEEP_ALIVE which sets how long a model will be loaded into memory. It uses the same semantics as the keep_alive parameter in the generate, chat, and embeddings API calls, namely:

if set to a positive value, it will default to whatever time was set
if set to zero it will unload immediately after generation
if set to a negative value it will remain in memory

You can either use a value in seconds (e.g. OLLAMA_KEEP_ALIVE=60 for 60 seconds), or as a duration string (e.g. OLLAMA_KEEP_ALIVE=10m).

This change works with both the API, and with the REPL.

Fixes #2508

uxfion · 2024-03-14T13:03:24Z

Is this effective on /v1/chat/completions endpoint?

--------- Co-authored-by: Chris-AS1 <8493773+Chris-AS1@users.noreply.github.com>

pdevine · 2024-03-14T21:13:03Z

@uxfion This is an environment variable for the server, so will work independently of the OpenAI endpoints (i.e. it will unload the model at whatever time you give it, regardless of how the client is access it).

uxfion · 2024-03-15T01:21:58Z

@uxfion This is an environment variable for the server, so will work independently of the OpenAI endpoints (i.e. it will unload the model at whatever time you give it, regardless of how the client is access it).

~~I mean, when I set OLLAMA_KEEP_ALIVE=-1 and then request through /v1/chat/completions endpoint, it should always save the model in memory, right?~~

I tested it and found that it works very well. Thank you so much for your amazing work👍👍! We really love ollama😻

oxaronick · 2024-03-19T14:50:20Z

This is nice to have, thanks. I'm trying it out now.

However, it looks like the client can still override the server. One feature of @Chris-AS1's solution that I liked was that, as a server admin, I could set a value that overrode client-provided keepalives, preventing one person on a team from unloading the model for everyone else.

Do you think the server value should override the client value?

saffatbokul · 2024-03-25T07:22:49Z

Thanks so much. This will be very useful.

I agree with @oxaronick that the client should not be able to override the server setting for keep_alive.

--------- Co-authored-by: Chris-AS1 <8493773+Chris-AS1@users.noreply.github.com>

BananaAcid · 2024-04-05T02:19:51Z

The pushed solution should probably be for a var named like OLLAMA_KEEP_ALIVE_DEFAULT … since it will only be looked into, if the JSON req.keep_alive is null. Which does not allow the server to overwrite the var currently. Would be great to have one for overwriting as well.

--------- Co-authored-by: Chris-AS1 <8493773+Chris-AS1@users.noreply.github.com>

pdevine added 2 commits March 12, 2024 19:02

add OLLAMA_KEEP_ALIVE env variable to set the default keep alive

7e5e973

handle non-duration values

500d584

jmorganca approved these changes Mar 13, 2024

View reviewed changes

BruceMacD approved these changes Mar 13, 2024

View reviewed changes

Chris-AS1 and others added 2 commits March 13, 2024 11:28

added parsing tests for the session duration

a055e60

remove struct duration test

d8b6b4b

pdevine merged commit 47cfe58 into main Mar 13, 2024
12 checks passed

pdevine deleted the pdevine/defaultkeepalive branch March 13, 2024 20:29

pdevine mentioned this pull request Mar 13, 2024

feat: Support ollama's keep_alive request parameter by overwriting with ENV on ollama serve #2267

Closed

samyfodil pushed a commit to ollama-cloud/ollama-as-wasm-plugin that referenced this pull request Mar 14, 2024

Default Keep Alive environment variable (ollama#3094)

ff4f756

--------- Co-authored-by: Chris-AS1 <8493773+Chris-AS1@users.noreply.github.com>

byebyebruce pushed a commit to byebyebruce/ollama that referenced this pull request Mar 26, 2024

Default Keep Alive environment variable (ollama#3094)

8179c23

--------- Co-authored-by: Chris-AS1 <8493773+Chris-AS1@users.noreply.github.com>

Adphi pushed a commit to Adphi/ollama that referenced this pull request Mar 30, 2024

Default Keep Alive environment variable (ollama#3094)

bc5e3b2

--------- Co-authored-by: Chris-AS1 <8493773+Chris-AS1@users.noreply.github.com>

zhewang1-intc pushed a commit to zhewang1-intc/ollama that referenced this pull request May 13, 2024

Default Keep Alive environment variable (ollama#3094)

d2e9274

--------- Co-authored-by: Chris-AS1 <8493773+Chris-AS1@users.noreply.github.com>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Default Keep Alive environment variable #3094

Default Keep Alive environment variable #3094

pdevine commented Mar 13, 2024

uxfion commented Mar 14, 2024

pdevine commented Mar 14, 2024

uxfion commented Mar 15, 2024 •

edited

oxaronick commented Mar 19, 2024 •

edited

saffatbokul commented Mar 25, 2024

BananaAcid commented Apr 5, 2024 •

edited

Default Keep Alive environment variable #3094

Default Keep Alive environment variable #3094

Conversation

pdevine commented Mar 13, 2024

uxfion commented Mar 14, 2024

pdevine commented Mar 14, 2024

uxfion commented Mar 15, 2024 • edited

oxaronick commented Mar 19, 2024 • edited

saffatbokul commented Mar 25, 2024

BananaAcid commented Apr 5, 2024 • edited

uxfion commented Mar 15, 2024 •

edited

oxaronick commented Mar 19, 2024 •

edited

BananaAcid commented Apr 5, 2024 •

edited