Add Ollama provider option for LLMs and embeddings by Copilot · Pull Request #475 · sudoleg/YouTubeGPT

Copilot · 2025-12-13T20:46:58Z

Enables choosing Ollama as an alternative to OpenAI for chat/summarization and embeddings, including model discovery and pull flows, so the app can run fully locally without an OpenAI key.

Helpers
- Added Ollama availability check, model listing/pulling, and more robust embedding detection; improved token-count fallback and preferred language helper.
UI
- Sidebar now selects provider (OpenAI/Ollama) and models with warnings when unavailable or empty, with provider-specific embedding guidance.
Summary & Chat pages
- Instantiate ChatOllama/OllamaEmbeddings when selected; store embedding metadata and validate availability; embed pull button uses a shared constant.
Docker
- Enabled the Ollama service and volume in docker-compose for out-of-the-box local Ollama when running via Docker.
Dependencies & Tests
- Added langchain-ollama and ollama deps; new unit tests for Ollama helpers.

Example:

from modules.helpers import is_ollama_available, get_ollama_models, pull_ollama_model

if is_ollama_available():
    models = get_ollama_models("gpts")
    if not models:
        pull_ollama_model("llama3:latest")

Original prompt

This section details on the original issue you should resolve

<issue_title>Add Ollama as LLM provider</issue_title>
<issue_description>This feature should enable the user to select Ollama as LLM provider instead of OpenAI. This allows fully local execution, without a need for an OpenAI API Key and costs for API usage.

Rough guide for implementation

In modules/helpers.py define helper functions to check for connection to Ollama server, list available models and pulling new models. Ideally, the LLMs and embedding models from Ollama should be differentiable. Use the Ollama Python API and include a pinned version in requirements.txt

in modules/ui.py refactor display_model_settings_sidebar() to allow the user to select between OpenAI and Ollama as LLM providers. If the user selects Ollama but it's not available, there should be a warning. After selecting the provider, the user should be able to select the model.

The model selection for Ollama should be based on available models. If no models are available, there should be a streamlit warning telling the user to pull some models first.

In pages/summary.py and pages/chat.py, adjust the logic for LLM instantiation. See LangChain Integration with Ollama.

In pages/chat.py, adjust the logic for embedding model selection. If Ollama server is available, the user should be able to select embedding models provided by ollama. See Ollama Embeddings. If no embedding models are available, a warning should be displayed. The warning should contain a button to trigger pull of the nomic-embed-text:latest model for embeddings.

</issue_description>

Comments on the Issue (you are @copilot in this section)

Fixes Add Ollama as LLM provider #474

💬 We'd love your input! Share your thoughts on Copilot coding agent in our 2 minute survey.

Co-authored-by: sudoleg <54912398+sudoleg@users.noreply.github.com>

Copilot

Pull request overview

This PR adds Ollama as an alternative LLM provider to OpenAI, enabling fully local execution without requiring an OpenAI API key. The implementation includes model discovery, pulling capabilities, and embedding support with appropriate provider-specific handling throughout the application.

Key changes:

Added Ollama helper functions for availability checks, model listing/filtering, and model pulling
Updated UI to support provider selection with appropriate warnings and model availability checks
Modified summary and chat pages to instantiate provider-specific LLM and embedding models
Fixed typo in function name from get_preffered_languages to get_preferred_languages

Reviewed changes

Copilot reviewed 10 out of 11 changed files in this pull request and generated 10 comments.

Show a summary per file

File	Description
modules/helpers.py	Added Ollama integration functions (availability check, model listing with embedding detection, model pulling), fixed typo in language preference function name, updated token counting with fallback
modules/ui.py	Added provider selection dropdown, model availability checks with warnings for both OpenAI and Ollama
modules/summary.py	Changed to use `BaseChatModel` interface, added Ollama context window detection via `ollama.show()`, instantiate provider-specific chat models
pages/summary.py	Added provider readiness checks and conditional LLM instantiation for OpenAI/Ollama with appropriate warnings
pages/chat.py	Added embedding provider metadata storage, provider-specific embedding model selection with pull button for Ollama, validation of embedding availability during retrieval
modules/youtube.py	Fixed typo: `get_preffered_languages` → `get_preferred_languages`
tests/test_ollama_helpers.py	Added unit tests for Ollama helper functions (availability check, model filtering, pull failure handling)
tests/test_summary.py	Commented out context window test that needs updating for multi-provider support
requirements.txt	Added `langchain-ollama==1.0.1` and `ollama==0.6.1` dependencies
docker-compose.yml	Commented out Ollama service configuration
README.md	Updated documentation to mention Ollama support and configuration requirements

Comments suppressed due to low confidence (1)

tests/test_summary.py:75

This comment appears to contain commented-out code.

# def test_transcript_too_long_exception(mock_llm):
#    # Create a transcript that exceeds the context window
#    transcript = "word " * OPENAI_CONTEXT_WINDOWS["gpt-3.5-turbo"]["total"]
#
#    with pytest.raises(
#        expected_exception=TranscriptTooLongForModelException,
#        match="Your transcript exceeds the context window of the chosen model",
#    ) as exc_info:
#        get_transcript_summary(transcript_text=transcript, llm=mock_llm)

modules/ui.py

modules/helpers.py

modules/summary.py

pages/summary.py

tests/test_summary.py

pages/chat.py

tests/test_ollama_helpers.py

pages/summary.py

docker-compose.yml

tests/test_ollama_helpers.py

… helpers

…gth exception test

…h local ollama server

…ken counting logic in transcript summary

Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com>

Co-authored-by: sudoleg <54912398+sudoleg@users.noreply.github.com>

Copilot · 2025-12-21T19:27:12Z

Just as a heads up, I was blocked by some firewall rules while working on your feedback. Expand below for details.

Warning

Firewall rules blocked me from connecting to one or more addresses (expand for details)

I tried to connect to the following addresses, but was blocked by firewall rules:

api.openai.com
- Triggering command: /usr/bin/python python -m pytest (dns block)
- Triggering command: /usr/bin/python python -m pytest conntrack --ctstate INVALID,NEW -j DROP (dns block)
openaipublic.blob.core.windows.net
- Triggering command: /usr/bin/python python -m pytest (dns block)
- Triggering command: /usr/bin/python python -m pytest conntrack --ctstate INVALID,NEW -j DROP (dns block)

If you need me to access, download, or install something from one of these locations, you can either:

Configure Actions setup steps to set up my environment, which run before the firewall is enabled
Add the appropriate URLs or hosts to the custom allowlist in this repository's Copilot coding agent settings (admins only)

This reverts commit 37b673e.

…model for non-English videos

Initial plan

d945297

Copilot AI assigned Copilot and sudoleg Dec 13, 2025

Copilot started work on behalf of sudoleg December 13, 2025 20:47 View session

Copilot AI and others added 5 commits December 13, 2025 21:00

feat: add ollama provider support

9bb0f15

Co-authored-by: sudoleg <54912398+sudoleg@users.noreply.github.com>

chore: address review feedback

6339bba

Co-authored-by: sudoleg <54912398+sudoleg@users.noreply.github.com>

chore: refine ollama embedding detection

7d124c8

Co-authored-by: sudoleg <54912398+sudoleg@users.noreply.github.com>

chore: improve ollama validation and logging

cbdf984

Co-authored-by: sudoleg <54912398+sudoleg@users.noreply.github.com>

chore: add preferred languages alias and embed constant

63b2090

Co-authored-by: sudoleg <54912398+sudoleg@users.noreply.github.com>

Copilot AI changed the title ~~[WIP] Add Ollama as LLM provider for local execution~~ Add Ollama provider option for LLMs and embeddings Dec 13, 2025

Copilot AI requested a review from sudoleg December 13, 2025 21:09

Copilot finished work on behalf of sudoleg December 13, 2025 21:09

sudoleg added 3 commits December 14, 2025 15:42

refactor: fix minor issues

bb06e63

chore: comment out ollama service configuration in docker-compose

396d9b3

docs: update README to clarify usage of Ollama alongside OpenAI API

bc40e2d

sudoleg marked this pull request as ready for review December 14, 2025 14:45

Copilot AI review requested due to automatic review settings December 14, 2025 14:45

Copilot started reviewing on behalf of sudoleg December 14, 2025 14:46 View session

sudoleg approved these changes Dec 14, 2025

View reviewed changes

fix: add model name to ChatOllama initialization

775722e

Copilot AI reviewed Dec 14, 2025

View reviewed changes

sudoleg and others added 5 commits December 14, 2025 15:59

refactor: improve formatting and consistency in test cases for Ollama…

22300be

… helpers

fix: update mock_llm to include model name and restore transcript len…

74a1074

…gth exception test

chore: add OLLAMA_HOST environment variable in docker-compose to reac…

314fa80

…h local ollama server

fix: update context length retrieval for Ollama models and improve to…

523c7ec

…ken counting logic in transcript summary

fix: don't display available models if no API key is set

15c5b9e

Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com>

Copilot started work on behalf of sudoleg December 21, 2025 19:19 View session

chore: address review feedback

37b673e

Co-authored-by: sudoleg <54912398+sudoleg@users.noreply.github.com>

Copilot AI requested a review from sudoleg December 21, 2025 19:27

Copilot finished work on behalf of sudoleg December 21, 2025 19:27

sudoleg and others added 5 commits December 21, 2025 20:33

Revert "chore: address review feedback"

caeb69c

This reverts commit 37b673e.

refactor: update sidebar message to suggest a multilingual embedding …

980c475

…model for non-English videos

refactor: remove unused import of pytest in test_ollama_helpers.py

b33e1d3

fix: correct API key check in display_model_settings_sidebar function

4bfb46c

Merge branch 'main' into copilot/add-ollama-llm-provider

6390da3

sudoleg approved these changes Dec 21, 2025

View reviewed changes

sudoleg merged commit 24a5b7d into main Dec 21, 2025
1 of 2 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add Ollama provider option for LLMs and embeddings#475

Add Ollama provider option for LLMs and embeddings#475
sudoleg merged 21 commits intomainfrom
copilot/add-ollama-llm-provider

Copilot AI commented Dec 13, 2025 •

edited

Loading

Uh oh!

Copilot AI left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Copilot AI commented Dec 21, 2025

I tried to connect to the following addresses, but was blocked by firewall rules:

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

Copilot AI commented Dec 13, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Rough guide for implementation

Comments on the Issue (you are @copilot in this section)

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Reviewed changes

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Copilot AI commented Dec 21, 2025

I tried to connect to the following addresses, but was blocked by firewall rules:

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Copilot AI commented Dec 13, 2025 •

edited

Loading