API Endpoint for Listing Loaded Running Models #4013

strikeoncmputrz · 2024-04-29T01:25:17Z

It would be excellent to be able to interrogate the API to determine which models are running at any given time, rather than just seeing which checkpoints were pulled.

I use a variety of clients to interact with Ollama's API. I sometimes run models with a long keep_alive and assume others have similar use cases.

The only way I know of to identify a running model is through processes: ps aux | grep -- '--model' | grep -v grep | grep -Po '(?<=--model\s).*' | cut -d ' ' -f1. This will give you the full path to the model's blob. From there, you can compare that with the output of ollama show --modelfile (or the /api/show endpoint).

I checked the open issues and reddit and didn't see any similar RFIs or requests.

I wrote a bash script (depends on jq) that implements this as POC.

The text was updated successfully, but these errors were encountered:

pdevine · 2024-04-29T23:21:18Z

I think this would be great, along with an ollama ps which shows which models are currently loaded in memory. It should include when the model's TTL is going to expire as well.

strikeoncmputrz · 2024-05-03T21:28:23Z

TTL is a great idea!

unmotivatedgene · 2024-05-10T02:40:30Z

Yes please add this especially with the new concurrency options I want to know what models are sticking around and taking up all my VRAM.

strikeoncmputrz added the feature request New feature or request label Apr 29, 2024

pdevine mentioned this issue May 10, 2024

Ollama ps command for showing currently loaded models #4327

Merged

pdevine closed this as completed in #4327 May 14, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

API Endpoint for Listing Loaded Running Models #4013

API Endpoint for Listing Loaded Running Models #4013

strikeoncmputrz commented Apr 29, 2024 •

edited

Loading

pdevine commented Apr 29, 2024

strikeoncmputrz commented May 3, 2024

unmotivatedgene commented May 10, 2024

API Endpoint for Listing Loaded Running Models #4013

API Endpoint for Listing Loaded Running Models #4013

Comments

strikeoncmputrz commented Apr 29, 2024 • edited Loading

pdevine commented Apr 29, 2024

strikeoncmputrz commented May 3, 2024

unmotivatedgene commented May 10, 2024

strikeoncmputrz commented Apr 29, 2024 •

edited

Loading