Ollama: Pull models automatically at startup #1554

ThomasVitale · 2024-10-17T22:19:02Z

Introduce support for Ollama model auto-pull at startup time
Enhance support for Ollama model auto-pull at run time
Update documentation about integrating with Ollama and managing models
Adopt Builder pattern in Ollama Model classes for better code readability
Unify Ollama model auto-pull functionality in production and test code
Improve integration tests for Ollama with Testcontainers

* Introduce support for Ollama model auto-pull at startup time * Enhance support for Ollama model auto-pull at run time * Update documentation about integrating with Ollama and managing models * Adopt Builder pattern in Ollama Model classes for better code readability * Unify Ollama model auto-pull functionality in production and test code * Improve integration tests for Ollama with Testcontainers

ThomasVitale · 2024-10-17T22:21:59Z

models/spring-ai-ollama/src/main/java/org/springframework/ai/ollama/OllamaChatModel.java

-			FunctionCallbackContext functionCallbackContext) {
-		this(ollamaApi, defaultOptions, functionCallbackContext, List.of());
-	}
+	private ChatModelObservationConvention observationConvention = DEFAULT_OBSERVATION_CONVENTION;

 	public OllamaChatModel(OllamaApi ollamaApi, OllamaOptions defaultOptions,


The argument list grew so much that I didn't want to add even more overloaded constructors. Instead, I introduced a Builder to help making this whole initialisation code more readable.

ThomasVitale · 2024-10-17T22:22:32Z

models/spring-ai-ollama/src/main/java/org/springframework/ai/ollama/OllamaEmbeddingModel.java

-	public OllamaEmbeddingModel(OllamaApi ollamaApi, OllamaOptions defaultOptions) {
-		this(ollamaApi, defaultOptions, ObservationRegistry.NOOP);
-	}
+	private EmbeddingModelObservationConvention observationConvention = DEFAULT_OBSERVATION_CONVENTION;

 	public OllamaEmbeddingModel(OllamaApi ollamaApi, OllamaOptions defaultOptions,


Also here, I introduced a Builder

ThomasVitale · 2024-10-17T22:23:39Z

models/spring-ai-ollama/src/main/java/org/springframework/ai/ollama/api/OllamaApi.java

@@ -954,13 +957,15 @@ public record ProgressResponse(
 	 * Download a model from the Ollama library. Cancelled pulls are resumed from where they left off,
 	 * and multiple calls will share the same download progress.
 	 */
-	public ProgressResponse pullModel(PullModelRequest pullModelRequest) {
-		return this.restClient.post()
+	public Flux<ProgressResponse> pullModel(PullModelRequest pullModelRequest) {


Using the streaming option is grew because it gives us continuous status updates on the download, that we can log to keep the user up-to-date. It also makes it easy to define timeouts and retries.

ThomasVitale · 2024-10-17T22:24:58Z

...ring-ai-ollama/src/main/java/org/springframework/ai/ollama/management/PullModelStrategy.java

+ */
+public enum PullModelStrategy {
+
+	/**


When pulling models, there are two options here: always (latest model version) and when_missing (the model could be stale).

ThomasVitale · 2024-10-17T22:25:38Z

models/spring-ai-ollama/src/test/java/org/springframework/ai/ollama/BaseOllamaIT.java

-		ollamaApi.pullModel(new OllamaApi.PullModelRequest(model));
-		logger.info("Completed pulling the '{}' model", model);
+		var ollamaModelManager = new OllamaModelManager(ollamaApi);
+		ollamaModelManager.pullModel(model, PullModelStrategy.WHEN_MISSING);


The integration test setup now uses the same auto-pull functionality as the production code.

ThomasVitale · 2024-10-17T22:26:30Z

...-ai-ollama/src/test/java/org/springframework/ai/ollama/OllamaChatModelFunctionCallingIT.java

@@ -52,23 +52,24 @@ class OllamaChatModelFunctionCallingIT extends BaseOllamaIT {

 	private static final Logger logger = LoggerFactory.getLogger(OllamaChatModelFunctionCallingIT.class);

-	private static final String MODEL = OllamaModel.LLAMA3_1.getName();
+	private static final String MODEL = "qwen2.5:3b";


I got many failures with llama3.1, even after refining the prompt. With qwen2.5:3b I got all green results, and it's even a smaller model (so better for integration testing).

ThomasVitale · 2024-10-17T22:28:20Z

...ing-ai-ollama/src/main/java/org/springframework/ai/ollama/management/OllamaModelManager.java

+						logger.info("Pulling the '{}' model - Status: {}", modelName, progressResponses.get(progressResponses.size() - 1).status());
+					}
+				})
+				.takeUntil(progressResponses -> progressResponses.get(0) != null && progressResponses.get(0).status().equals("success"))


Here we continuously print out the current status of the download, and the whole operation is configured with timeout and retry.

ThomasVitale · 2024-10-17T22:31:30Z

...rc/main/java/org/springframework/ai/autoconfigure/ollama/OllamaInitializationProperties.java

+@ConfigurationProperties(OllamaInitializationProperties.CONFIG_PREFIX)
+public class OllamaInitializationProperties {
+
+	public static final String CONFIG_PREFIX = "spring.ai.ollama.init";


The naming tries to be consistent with other similar Spring Boot features like spring.sql.init to initialise database schemas.

tzolov · 2024-10-18T16:46:13Z

Great stuff @ThomasVitale , thanks.
I've been thinking that perhaps we can allow listing multiple models (additional property in OllamaInitializationProperties) to be pulled at init time?
We can discuss this for future improvements.

tzolov · 2024-10-18T16:47:08Z

rebased and merged at 8eef6e6

ThomasVitale · 2024-10-18T18:58:45Z

@tzolov thanks! I like the idea of the list of models. I'll work on a followup PR, including also a couple of improvements to add a bit more flexibility.

ThomasVitale · 2024-10-19T13:36:23Z

@tzolov I have made some improvements and added the possibility to pull an explicit list of models: #1566

ThomasVitale commented Oct 17, 2024

View reviewed changes

tzolov self-assigned this Oct 18, 2024

tzolov added this to the 1.0.0-M4 milestone Oct 18, 2024

tzolov added the ollama label Oct 18, 2024

tzolov closed this Oct 18, 2024

ThomasVitale mentioned this pull request Oct 21, 2024

Enable Ollama integration test #322

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Ollama: Pull models automatically at startup #1554

Ollama: Pull models automatically at startup #1554

Uh oh!

ThomasVitale commented Oct 17, 2024

Uh oh!

ThomasVitale Oct 17, 2024

Uh oh!

ThomasVitale Oct 17, 2024

Uh oh!

ThomasVitale Oct 17, 2024

Uh oh!

ThomasVitale Oct 17, 2024

Uh oh!

ThomasVitale Oct 17, 2024

Uh oh!

ThomasVitale Oct 17, 2024

Uh oh!

ThomasVitale Oct 17, 2024

Uh oh!

ThomasVitale Oct 17, 2024

Uh oh!

tzolov commented Oct 18, 2024

Uh oh!

tzolov commented Oct 18, 2024

Uh oh!

ThomasVitale commented Oct 18, 2024

Uh oh!

ThomasVitale commented Oct 19, 2024

Uh oh!

Uh oh!

Ollama: Pull models automatically at startup #1554

Ollama: Pull models automatically at startup #1554

Uh oh!

Conversation

ThomasVitale commented Oct 17, 2024

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

tzolov commented Oct 18, 2024

Uh oh!

tzolov commented Oct 18, 2024

Uh oh!

ThomasVitale commented Oct 18, 2024

Uh oh!

ThomasVitale commented Oct 19, 2024

Uh oh!

Uh oh!