Feature/ollama service by jansaldo · Pull Request #59 · AymurAI/backend

jansaldo · 2025-11-19T17:30:27Z

This pull request adds support for managing an Ollama service as part of the development and deployment workflow. The changes introduce a new Ollama container to the Docker Compose setup, provide Makefile commands for common Ollama operations, and integrate Ollama startup into the VSCode development environment.

Docker and service integration:

Added an ollama service to docker-compose.yml with GPU support, persistent storage, and automatic restart, and made API services depend on it to ensure proper startup order. [1] [2] [3]

Developer tooling and automation:

Introduced multiple Makefile targets for managing the Ollama service and models (start, stop, restart, pull, run, list, and remove), streamlining development and model management workflows.
Added a VSCode task (tasks.json) to start the Ollama service using the new Makefile command, and configured the Python launch configuration to run this task before debugging. [1] [2]

Summary by Sourcery

Integrate an Ollama service into the development workflow by adding it to Docker Compose, providing Makefile commands for model management, and automating its startup in VSCode.

New Features:

Add Ollama service to docker-compose.yml with GPU support, persistent storage, and restart policy
Introduce Makefile targets for starting, stopping, restarting, and managing Ollama models (pull, run, list, remove)
Configure VSCode tasks and pre-launch setup to automatically start the Ollama service before debugging

sourcery-ai · 2025-11-19T17:30:33Z

Reviewer's Guide

This PR integrates an Ollama service into the development workflow by adding it to Docker Compose with GPU support and persistent storage, exposing Makefile commands for Ollama operations, and configuring VSCode to start Ollama automatically before debugging.

Flow diagram for Makefile Ollama commands

flowchart TD
  A["Developer runs Makefile command"] --> B{"Which command?"}
  B -->|ollama-up| C["Start Ollama container"]
  B -->|ollama-stop| D["Stop Ollama container"]
  B -->|ollama-restart| E["Restart Ollama container"]
  B -->|ollama-pull| F["Pull model in Ollama container"]
  B -->|ollama-run| G["Run model in Ollama container"]
  B -->|ollama-list| H["List models in Ollama container"]
  B -->|ollama-rm| I["Remove model from Ollama container"]

File-Level Changes

Change	Details	Files
Introduce Ollama service in Docker Compose with GPU, volume, and restart policies	Added new ollama service block with image, ports, volumes, and restart Configured NVIDIA GPU reservation under deploy.resources Declared persistent volume for Ollama data Updated API services to depend_on ollama for startup ordering	`docker-compose.yml`
Add Makefile targets to manage Ollama lifecycle and models	Defined ollama-up, ollama-stop, and ollama-restart targets Implemented ollama-pull, ollama-run, ollama-list, and ollama-rm with MODEL parameter checks Ensured service is up before executing exec commands	`Makefile`
Configure VSCode to launch Ollama service before debugging	Created tasks.json entry to run ‘make ollama-up’ Updated launch.json to include the Ollama startup task as preLaunchTask	`.vscode/tasks.json` `.vscode/launch.json`

Tips and commands

Interacting with Sourcery

Trigger a new review: Comment @sourcery-ai review on the pull request.
Continue discussions: Reply directly to Sourcery's review comments.
Generate a GitHub issue from a review comment: Ask Sourcery to create an
issue from a review comment by replying to it. You can also reply to a
review comment with @sourcery-ai issue to create an issue from it.
Generate a pull request title: Write @sourcery-ai anywhere in the pull
request title to generate a title at any time. You can also comment
@sourcery-ai title on the pull request to (re-)generate the title at any time.
Generate a pull request summary: Write @sourcery-ai summary anywhere in
the pull request body to generate a PR summary at any time exactly where you
want it. You can also comment @sourcery-ai summary on the pull request to
(re-)generate the summary at any time.
Generate reviewer's guide: Comment @sourcery-ai guide on the pull
request to (re-)generate the reviewer's guide at any time.
Resolve all Sourcery comments: Comment @sourcery-ai resolve on the
pull request to resolve all Sourcery comments. Useful if you've already
addressed all the comments and don't want to see them anymore.
Dismiss all Sourcery reviews: Comment @sourcery-ai dismiss on the pull
request to dismiss all existing Sourcery reviews. Especially useful if you
want to start fresh with a new review - don't forget to comment
@sourcery-ai review to trigger a new review!

Customizing Your Experience

Access your dashboard to:

Enable or disable review features such as the Sourcery-generated pull request
summary, the reviewer's guide, and others.
Change the review language.
Add, remove or edit custom review instructions.
Adjust other review settings.

Getting Help

Contact our support team for questions or feedback.
Visit our documentation for detailed guides and information.
Keep in touch with the Sourcery team by following us on X/Twitter, LinkedIn or GitHub.

sourcery-ai

Hey there - I've reviewed your changes and they look great!

Sourcery is free for open source - if you like our reviews please consider sharing them ✨

_{Help me be more useful! Please click 👍 or 👎 on each comment and I'll use the feedback to improve your reviews.}

Copilot

Pull Request Overview

This PR integrates Ollama as a containerized service into the development and deployment workflow. The changes add Docker Compose configuration for running Ollama with GPU support, provide developer-friendly Makefile commands for managing the service and models, and configure VSCode to automatically start Ollama before debugging.

Added Ollama service to docker-compose.yml with GPU support and persistent storage
Created Makefile targets for common Ollama operations (start, stop, pull, list models, etc.)
Integrated Ollama startup into VSCode debugging workflow via preLaunchTask

Reviewed Changes

Copilot reviewed 4 out of 4 changed files in this pull request and generated 6 comments.

File	Description
docker-compose.yml	Adds Ollama service with GPU configuration, persistent volume, and makes API services depend on it
Makefile	Introduces 7 new targets for managing Ollama service lifecycle and model operations
.vscode/tasks.json	Defines "Start Ollama service" task that executes the Makefile ollama-up command
.vscode/launch.json	Configures Python debugger to automatically start Ollama service before launching

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

Copilot · 2025-11-19T17:34:14Z

+        deploy:
+            resources:
+                reservations:
+                    devices:
+                        - driver: nvidia
+                          count: all
+                          capabilities: [gpu]


The Ollama service configuration assumes GPU availability via NVIDIA drivers, but there's no fallback configuration for CPU-only environments. This will cause the service to fail in environments without NVIDIA GPUs or proper drivers.

Consider adding a CPU-only profile or making GPU configuration optional:

ollama: image: ollama/ollama container_name: ollama profiles: - gpu deploy: resources: reservations: devices: - driver: nvidia count: all capabilities: [gpu] # ... rest of config ollama-cpu: image: ollama/ollama container_name: ollama profiles: - cpu # ... config without GPU requirements

Copilot · 2025-11-19T17:34:15Z

            "subProcess": false,
            "envFile": "${workspaceFolder}/.env",
-            "python": "${workspaceFolder}/.venv/bin/python"
+            "python": "${workspaceFolder}/.venv/bin/python",


[nitpick] The preLaunchTask "Start Ollama service" will run every time you start debugging in VSCode, but the task doesn't check if Ollama is already running. The --no-recreate flag in the Makefile will prevent recreating existing containers, but the task will still execute and potentially cause delays or confusion if it fails.

Consider adding task properties to handle this better:

{ "label": "Start Ollama service", "type": "shell", "command": "make", "args": ["ollama-up"], "options": { "cwd": "${workspaceFolder}" }, "problemMatcher": [], "isBackground": false, "presentation": { "reveal": "silent", "panel": "shared" } }

This makes the task output less intrusive when Ollama is already running.

Suggested change

"python": "${workspaceFolder}/.venv/bin/python",

"python": "${workspaceFolder}/.venv/bin/python",

// NOTE: To avoid intrusive output and redundant execution, ensure the "Start Ollama service" task in .vscode/tasks.json uses:

// "presentation": { "reveal": "silent", "panel": "shared" }

Copilot · 2025-11-19T17:34:15Z

+    ollama:
+        image: ollama/ollama
+        container_name: ollama
+        deploy:
+            resources:
+                reservations:
+                    devices:
+                        - driver: nvidia
+                          count: all
+                          capabilities: [gpu]
+        ports:
+            - "11434:11434"
+        volumes:
+            - ollama:/root/.ollama
+        restart: always
+
+volumes:
+    ollama:


The Ollama integration is missing documentation. The README should be updated to explain:

That Ollama is now a dependency and what it's used for

How to set up Ollama for development (including GPU requirements)

Available Makefile commands for Ollama management

How to configure which models to use

How to run the API without Ollama (if applicable)

This is especially important since the Quick Start section shows docker run commands that won't include the Ollama service, potentially causing confusion for users.

Copilot · 2025-11-19T17:34:15Z

+        depends_on:
+            - ollama


The depends_on directive only ensures startup order, not readiness. The API services will start as soon as the Ollama container starts, but Ollama may not be ready to accept requests yet. Consider using a health check or wait-for script to ensure Ollama is actually ready before the API services attempt to connect to it.

Example with healthcheck:

ollama: image: ollama/ollama healthcheck: test: ["CMD", "ollama", "list"] interval: 10s timeout: 5s retries: 5 # ... rest of config aymurai-api: depends_on: ollama: condition: service_healthy

Copilot · 2025-11-19T17:34:16Z

+ifndef MODEL
+	$(error MODEL variable is required, e.g. make ollama-run MODEL=llama3)
+endif
+	docker compose up -d --no-recreate ollama


The -it flag in docker compose exec -it requires an interactive terminal, which will cause this Makefile target to fail when run in non-interactive environments (CI/CD pipelines, automated scripts, VSCode tasks, etc.). Consider removing the -it flag or providing separate targets for interactive and non-interactive use.

Recommended change:

ollama-run: ifndef MODEL $(error MODEL variable is required, e.g. make ollama-run MODEL=llama3) endif docker compose up -d --no-recreate ollama docker compose exec ollama ollama run $(MODEL)

Or add a separate interactive target:

ollama-run-interactive: ifndef MODEL $(error MODEL variable is required, e.g. make ollama-run-interactive MODEL=llama3) endif docker compose up -d --no-recreate ollama docker compose exec -it ollama ollama run $(MODEL)

Suggested change

docker compose up -d --no-recreate ollama

docker compose up -d --no-recreate ollama

docker compose exec ollama ollama run $(MODEL)

ollama-run-interactive:

ifndef MODEL

$(error MODEL variable is required, e.g. make ollama-run-interactive MODEL=llama3)

endif

docker compose up -d --no-recreate ollama

Copilot · 2025-11-19T17:34:16Z

+        depends_on:
+            - ollama


The API services have a hard dependency on Ollama being available, but there's no evidence in the codebase that the API actually uses Ollama (no imports found, no Ollama-related configuration). This creates an unnecessary runtime dependency that will prevent the API from starting if Ollama fails.

If Ollama is optional or only used for specific features, consider:

Making the depends_on conditional or removing it

Implementing graceful degradation in the API code when Ollama is unavailable

Adding connection retry logic with appropriate timeouts

If Ollama is truly required, ensure the API code properly handles connection failures and documents this dependency.

jansaldo added 3 commits November 19, 2025 17:17

✨ Add GPU-enabled Ollama service to compose stack

765c52f

🔧 Add Make targets for managing Ollama service and models

f090dd2

🔧 Add launch configuration and task for starting Ollama service

05f0a98

jansaldo requested review from Copilot and padonizetti November 19, 2025 17:30

Copilot started reviewing on behalf of jansaldo November 19, 2025 17:30 View session

sourcery-ai Bot reviewed Nov 19, 2025

View reviewed changes

Copilot finished reviewing on behalf of jansaldo November 19, 2025 17:32

Copilot AI reviewed Nov 19, 2025

View reviewed changes

padonizetti approved these changes Nov 19, 2025

View reviewed changes

jansaldo merged commit 401a08b into release/v2.0.0 Nov 19, 2025
6 of 7 checks passed

jansaldo deleted the feature/ollama-service branch November 19, 2025 20:57

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Feature/ollama service#59

Feature/ollama service#59
jansaldo merged 3 commits intorelease/v2.0.0from
feature/ollama-service

jansaldo commented Nov 19, 2025 •

edited by sourcery-ai Bot

Loading

Uh oh!

sourcery-ai Bot commented Nov 19, 2025 •

edited

Loading

Interacting with Sourcery

Customizing Your Experience

Getting Help

Uh oh!

sourcery-ai Bot left a comment

Uh oh!

Copilot AI left a comment

Uh oh!

Copilot AI Nov 19, 2025

Uh oh!

Copilot AI Nov 19, 2025

Uh oh!

Copilot AI Nov 19, 2025

Uh oh!

Copilot AI Nov 19, 2025

Uh oh!

Copilot AI Nov 19, 2025

Uh oh!

Copilot AI Nov 19, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

jansaldo commented Nov 19, 2025 • edited by sourcery-ai Bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary by Sourcery

Uh oh!

sourcery-ai Bot commented Nov 19, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Reviewer's Guide

Flow diagram for Makefile Ollama commands

File-Level Changes

Interacting with Sourcery

Customizing Your Experience

Getting Help

Uh oh!

sourcery-ai Bot left a comment

Choose a reason for hiding this comment

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull Request Overview

Reviewed Changes

Uh oh!

Copilot AI Nov 19, 2025

Choose a reason for hiding this comment

Uh oh!

Copilot AI Nov 19, 2025

Choose a reason for hiding this comment

Uh oh!

Copilot AI Nov 19, 2025

Choose a reason for hiding this comment

Uh oh!

Copilot AI Nov 19, 2025

Choose a reason for hiding this comment

Uh oh!

Copilot AI Nov 19, 2025

Choose a reason for hiding this comment

Uh oh!

Copilot AI Nov 19, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

jansaldo commented Nov 19, 2025 •

edited by sourcery-ai Bot

Loading

sourcery-ai Bot commented Nov 19, 2025 •

edited

Loading