Feature/llm providers by jansaldo · Pull Request #60 · AymurAI/backend

jansaldo · 2025-11-20T15:56:14Z

This pull request introduces a new abstraction layer for Large Language Model (LLM) providers, adds an implementation for the Ollama provider, and provides both documentation and tests for the new functionality. The changes establish a standardized interface for integrating different LLM backends, making it easier to extend and maintain the codebase.

LLM Provider Abstraction

Introduced a new base class LLMProvider in provider.py that defines a standard interface for LLM providers, including methods for text generation, streaming, token counting, and chunking text. Also defines the LLMResponse and DocumentChunk data models.
Updated the module's __init__.py to expose the new provider classes and types in the public API.

Ollama Provider Implementation

Added OllamaLLMProvider in ollama_provider.py, which implements the LLMProvider interface, allowing interaction with Ollama's chat API for both standard and streaming responses. Handles system prompts, message formatting, and response parsing.

Documentation and Usage Example

Added a Jupyter notebook (01-ollama-provider.ipynb) demonstrating how to use the new OllamaLLMProvider, including chunking, generation, and streaming features.

Testing

Added comprehensive unit tests in test_providers.py for both the base provider functionality and the Ollama provider, using dummy/fake classes and mocks to ensure correct behavior.

Summary by Sourcery

Introduce a standardized LLMProvider interface, implement an Ollama-based provider, and include accompanying documentation and tests

New Features:

Add LLMProvider abstract base class with text generation, streaming, token counting, and text chunking methods
Implement OllamaLLMProvider to integrate with Ollama’s chat API and support both standard and streaming responses

Enhancements:

Expose LLMResponse, DocumentChunk, LLMProvider, and OllamaLLMProvider in the module's public API

Documentation:

Add Jupyter notebook demonstrating usage of the OllamaLLMProvider, including chunking, generation, and streaming

Tests:

Add unit tests verifying text chunking behavior in the base provider and generate/stream workflows in the Ollama provider

…actions

…LLMProvider

…ure/llm-providers

sourcery-ai · 2025-11-20T15:56:21Z

Reviewer's Guide

This PR introduces a standardized LLMProvider abstraction with built-in token counting and chunking utilities, implements a concrete OllamaLLMProvider adapter for synchronous and streaming calls (including message construction and metadata parsing), and adds an example notebook alongside unit tests to validate both the base and Ollama provider behaviors.

Sequence diagram for OllamaLLMProvider.generate() interaction with Ollama API

sequenceDiagram
    participant User
    participant OllamaLLMProvider
    participant OllamaAPI
    User->>OllamaLLMProvider: generate(prompt)
    OllamaLLMProvider->>OllamaLLMProvider: _build_messages(prompt)
    OllamaLLMProvider->>OllamaAPI: chat(model, messages, keep_alive, ...)
    OllamaAPI-->>OllamaLLMProvider: response
    OllamaLLMProvider->>User: LLMResponse(text, metadata, raw)

Sequence diagram for OllamaLLMProvider.stream() interaction with Ollama API

sequenceDiagram
    participant User
    participant OllamaLLMProvider
    participant OllamaAPI
    User->>OllamaLLMProvider: stream(prompt)
    OllamaLLMProvider->>OllamaLLMProvider: _build_messages(prompt)
    OllamaLLMProvider->>OllamaAPI: chat(model, messages, keep_alive, stream=True)
    loop For each chunk
        OllamaAPI-->>OllamaLLMProvider: chunk
        OllamaLLMProvider->>User: LLMResponse(text, metadata, raw)
    end

Class diagram for the new LLMProvider abstraction and OllamaLLMProvider implementation

classDiagram
    class LLMProvider {
        <<abstract>>
        +model_name: str
        +_tokenizer: TokenizerType | None
        +max_context_tokens: int | None
        +chunk_overlap: int
        +__init__(model, tokenizer, max_context_tokens, chunk_overlap)
        +generate(prompt, **kwargs) LLMResponse
        +stream(prompt, **kwargs)
        +count_tokens(text) int
        +chunk_text(text, max_tokens, overlap) list[DocumentChunk]
        +_tokenize(text) Sequence[Any]
        +_overlap_tail(words, overlap_tokens) list[str]
    }
    class OllamaLLMProvider {
        +system_prompt: str | None
        +keep_alive: int | str | None
        +__init__(model, system_prompt, keep_alive, **kwargs)
        +generate(prompt, messages, **kwargs) LLMResponse
        +stream(prompt, messages, **kwargs) Iterator[LLMResponse]
        +_build_messages(prompt, messages) list[dict[str, str]]
    }
    class LLMResponse {
        +text: str
        +metadata: dict[str, Any]
        +raw: Any | None
    }
    class DocumentChunk {
        +text: str
        +token_count: int
        +index: int
    }
    LLMProvider <|-- OllamaLLMProvider
    LLMProvider o-- LLMResponse
    LLMProvider o-- DocumentChunk

File-Level Changes

Change	Details	Files
Define a unified LLMProvider abstraction with core utilities	Add LLMResponse and DocumentChunk Pydantic models Implement count_tokens, chunk_text, _tokenize, and _overlap_tail methods Declare abstract generate() and stream() signatures Expose new classes in public API via init.py	`aymurai/llm_providers/provider.py` `aymurai/llm_providers/__init__.py`
Implement OllamaLLMProvider adapter	Subclass LLMProvider to wrap ollama.chat calls Implement generate() to build messages, call API, and parse text/metadata Implement stream() for chunked responses and metadata flags Add private _build_messages() for system/user prompt assembly	`aymurai/llm_providers/ollama_provider.py`
Provide usage example in a Jupyter notebook	Demonstrate instantiation with system prompt Show chunk_text, generate, and stream workflows with sample text	`notebooks/experiments/llm-providers/01-ollama-provider.ipynb`
Add comprehensive unit tests for both base and Ollama providers	Test chunk_text behavior with DummyProvider and fake tokenizer Mock ollama.chat to validate generate() message payload and response parsing Mock streaming API to verify stream() yields correct chunks	`test/llm_providers/test_providers.py`

Tips and commands

Interacting with Sourcery

Trigger a new review: Comment @sourcery-ai review on the pull request.
Continue discussions: Reply directly to Sourcery's review comments.
Generate a GitHub issue from a review comment: Ask Sourcery to create an
issue from a review comment by replying to it. You can also reply to a
review comment with @sourcery-ai issue to create an issue from it.
Generate a pull request title: Write @sourcery-ai anywhere in the pull
request title to generate a title at any time. You can also comment
@sourcery-ai title on the pull request to (re-)generate the title at any time.
Generate a pull request summary: Write @sourcery-ai summary anywhere in
the pull request body to generate a PR summary at any time exactly where you
want it. You can also comment @sourcery-ai summary on the pull request to
(re-)generate the summary at any time.
Generate reviewer's guide: Comment @sourcery-ai guide on the pull
request to (re-)generate the reviewer's guide at any time.
Resolve all Sourcery comments: Comment @sourcery-ai resolve on the
pull request to resolve all Sourcery comments. Useful if you've already
addressed all the comments and don't want to see them anymore.
Dismiss all Sourcery reviews: Comment @sourcery-ai dismiss on the pull
request to dismiss all existing Sourcery reviews. Especially useful if you
want to start fresh with a new review - don't forget to comment
@sourcery-ai review to trigger a new review!

Customizing Your Experience

Access your dashboard to:

Enable or disable review features such as the Sourcery-generated pull request
summary, the reviewer's guide, and others.
Change the review language.
Add, remove or edit custom review instructions.
Adjust other review settings.

Getting Help

Contact our support team for questions or feedback.
Visit our documentation for detailed guides and information.
Keep in touch with the Sourcery team by following us on X/Twitter, LinkedIn or GitHub.

sourcery-ai

Hey there - I've reviewed your changes - here's some feedback:

Consider moving the Jupyter notebook out of the core package (e.g., into an examples or docs folder) so it isn’t shipped with the production library.
It may be better to warn or require an explicit tokenizer instead of silently falling back to whitespace splitting, as that can lead to inaccurate token counts for many models.
Add a unit test to verify that _build_messages raises a ValueError when both prompt and messages are None to cover the error path.

Prompt for AI Agents

Please address the comments from this code review:

## Overall Comments
- Consider moving the Jupyter notebook out of the core package (e.g., into an examples or docs folder) so it isn’t shipped with the production library.
- It may be better to warn or require an explicit tokenizer instead of silently falling back to whitespace splitting, as that can lead to inaccurate token counts for many models.
- Add a unit test to verify that `_build_messages` raises a ValueError when both `prompt` and `messages` are None to cover the error path.

## Individual Comments

### Comment 1
<location> `test/llm_providers/test_providers.py:39-46` </location>
<code_context>
+        return self._tokenizer
+
+
+class BaseProviderTests(unittest.TestCase):
+    def test_chunk_text_respects_token_limit(self):
+        provider = DummyProvider(max_context_tokens=4, chunk_overlap=1)
+        text = "uno dos tres cuatro cinco seis"
</code_context>

<issue_to_address>
**suggestion (testing):** Missing tests for edge cases in chunk_text (empty string, no words, overlap > token limit).

Please add tests for chunk_text covering empty input, no words, overlap exceeding token limit, and max_context_tokens set to None to ensure all edge cases are handled.

```suggestion
class BaseProviderTests(unittest.TestCase):
    def test_chunk_text_respects_token_limit(self):
        provider = DummyProvider(max_context_tokens=4, chunk_overlap=1)
        text = "uno dos tres cuatro cinco seis"
        chunks = provider.chunk_text(text)
        self.assertGreater(len(chunks), 1)
        for chunk in chunks:
            self.assertLessEqual(chunk.token_count, 4)

    def test_chunk_text_empty_string(self):
        provider = DummyProvider(max_context_tokens=4, chunk_overlap=1)
        text = ""
        chunks = provider.chunk_text(text)
        self.assertEqual(chunks, [])

    def test_chunk_text_whitespace_only(self):
        provider = DummyProvider(max_context_tokens=4, chunk_overlap=1)
        text = "     "
        chunks = provider.chunk_text(text)
        self.assertEqual(chunks, [])

    def test_chunk_text_overlap_exceeds_token_limit(self):
        provider = DummyProvider(max_context_tokens=2, chunk_overlap=3)
        text = "one two three"
        chunks = provider.chunk_text(text)
        # Should not fail, and should chunk correctly
        for chunk in chunks:
            self.assertLessEqual(chunk.token_count, 2)

    def test_chunk_text_max_context_tokens_none(self):
        provider = DummyProvider(max_context_tokens=None, chunk_overlap=1)
        text = "one two three four"
        chunks = provider.chunk_text(text)
        # Should return a single chunk with all tokens
        self.assertEqual(len(chunks), 1)
        self.assertEqual(chunks[0].token_count, 4)
```
</issue_to_address>

### Comment 2
<location> `test/llm_providers/test_providers.py:49-50` </location>
<code_context>
+            self.assertLessEqual(chunk.token_count, 4)
+
+
+class OllamaProviderTests(unittest.TestCase):
+    def test_generate_builds_messages(self):
+        provider = OllamaLLMProvider(model="llama3", system_prompt="Sistema")
+        fake_response = {"message": {"content": "respuesta"}, "eval_count": 10}
</code_context>

<issue_to_address>
**suggestion (testing):** No test for error handling in OllamaLLMProvider._build_messages.

Add a test to confirm that OllamaLLMProvider._build_messages raises ValueError when both prompt and messages are missing.
</issue_to_address>

### Comment 3
<location> `test/llm_providers/test_providers.py:62-76` </location>
<code_context>
+        self.assertEqual(kwargs["model"], "llama3")
+        self.assertEqual(kwargs["messages"][0]["role"], "system")
+
+    def test_stream_yields_chunks(self):
+        provider = OllamaLLMProvider(model="llama3")
+        fake_chunks = iter(
+            [
+                {"message": {"content": "Hola"}, "done": False},
+                {"message": {"content": " Mundo"}, "done": True},
+            ]
+        )
+        with patch("aymurai.llm_providers.ollama_provider.ollama.chat") as mock_chat:
+            mock_chat.return_value = fake_chunks
+            pieces = list(provider.stream("Hola"))
+
+        self.assertEqual([piece.text for piece in pieces], ["Hola", " Mundo"])
+        _, kwargs = mock_chat.call_args
+        self.assertTrue(kwargs["stream"])
+
+
</code_context>

<issue_to_address>
**suggestion (testing):** No test for OllamaLLMProvider with system_prompt in stream method.

Please add a test to verify that when system_prompt is set, the stream method includes the system message in the payload, similar to the generate method.

```suggestion
    def test_stream_yields_chunks(self):
        provider = OllamaLLMProvider(model="llama3")
        fake_chunks = iter(
            [
                {"message": {"content": "Hola"}, "done": False},
                {"message": {"content": " Mundo"}, "done": True},
            ]
        )
        with patch("aymurai.llm_providers.ollama_provider.ollama.chat") as mock_chat:
            mock_chat.return_value = fake_chunks
            pieces = list(provider.stream("Hola"))

        self.assertEqual([piece.text for piece in pieces], ["Hola", " Mundo"])
        _, kwargs = mock_chat.call_args
        self.assertTrue(kwargs["stream"])

    def test_stream_includes_system_prompt(self):
        system_prompt = "You are a helpful assistant."
        provider = OllamaLLMProvider(model="llama3", system_prompt=system_prompt)
        fake_chunks = iter(
            [
                {"message": {"content": "Hola"}, "done": False},
                {"message": {"content": " Mundo"}, "done": True},
            ]
        )
        with patch("aymurai.llm_providers.ollama_provider.ollama.chat") as mock_chat:
            mock_chat.return_value = fake_chunks
            pieces = list(provider.stream("Hola"))

        # Check that the system message is included in the payload
        _, kwargs = mock_chat.call_args
        self.assertEqual(kwargs["messages"][0]["role"], "system")
        self.assertEqual(kwargs["messages"][0]["content"], system_prompt)
```
</issue_to_address>

### Comment 4
<location> `test/llm_providers/test_providers.py:45-46` </location>
<code_context>

</code_context>

<issue_to_address>
**issue (code-quality):** Avoid loops in tests. ([`no-loop-in-tests`](https://docs.sourcery.ai/Reference/Rules-and-In-Line-Suggestions/Python/Default-Rules/no-loop-in-tests))

<details><summary>Explanation</summary>Avoid complex code, like loops, in test functions.

Google's software engineering guidelines says:
"Clear tests are trivially correct upon inspection"
To reach that avoid complex code in tests:
* loops
* conditionals

Some ways to fix this:

* Use parametrized tests to get rid of the loop.
* Move the complex logic into helpers.
* Move the complex part into pytest fixtures.

> Complexity is most often introduced in the form of logic. Logic is defined via the imperative parts of programming languages such as operators, loops, and conditionals. When a piece of code contains logic, you need to do a bit of mental computation to determine its result instead of just reading it off of the screen. It doesn't take much logic to make a test more difficult to reason about.

Software Engineering at Google / [Don't Put Logic in Tests](https://abseil.io/resources/swe-book/html/ch12.html#donapostrophet_put_logic_in_tests)
</details>
</issue_to_address>

### Comment 5
<location> `aymurai/llm_providers/provider.py:198-201` </location>
<code_context>
    def _tokenize(self, text: str) -> Sequence[Any]:
        """
        Tokenize text using the configured tokenizer or fallback to whitespace.

        Args:
            text (str): The input text to tokenize.

        Returns:
            Sequence[Any]: The sequence of tokens.
        """
        if self._tokenizer is None:
            return text.split()

        tokenizer = self._tokenizer
        if hasattr(tokenizer, "encode"):
            return tokenizer.encode(text, add_special_tokens=False)

        if callable(tokenizer):
            tokens = tokenizer(text)
            if isinstance(tokens, dict):
                return tokens.get("input_ids", [])
            return tokens

        return text.split()

</code_context>

<issue_to_address>
**suggestion (code-quality):** We've found these issues:

- Lift code into else after jump in control flow ([`reintroduce-else`](https://docs.sourcery.ai/Reference/Default-Rules/refactorings/reintroduce-else/))
- Replace if statement with if expression ([`assign-if-exp`](https://docs.sourcery.ai/Reference/Default-Rules/refactorings/assign-if-exp/))

```suggestion
            return tokens.get("input_ids", []) if isinstance(tokens, dict) else tokens
```
</issue_to_address>

Sourcery is free for open source - if you like our reviews please consider sharing them ✨

_{Help me be more useful! Please click 👍 or 👎 on each comment and I'll use the feedback to improve your reviews.}

sourcery-ai · 2025-11-20T15:57:06Z

+class BaseProviderTests(unittest.TestCase):
+    def test_chunk_text_respects_token_limit(self):
+        provider = DummyProvider(max_context_tokens=4, chunk_overlap=1)
+        text = "uno dos tres cuatro cinco seis"
+        chunks = provider.chunk_text(text)
+        self.assertGreater(len(chunks), 1)
+        for chunk in chunks:
+            self.assertLessEqual(chunk.token_count, 4)


suggestion (testing): Missing tests for edge cases in chunk_text (empty string, no words, overlap > token limit).

Please add tests for chunk_text covering empty input, no words, overlap exceeding token limit, and max_context_tokens set to None to ensure all edge cases are handled.

Suggested change

class BaseProviderTests(unittest.TestCase):

def test_chunk_text_respects_token_limit(self):

provider = DummyProvider(max_context_tokens=4, chunk_overlap=1)

text = "uno dos tres cuatro cinco seis"

chunks = provider.chunk_text(text)

self.assertGreater(len(chunks), 1)

for chunk in chunks:

self.assertLessEqual(chunk.token_count, 4)

class BaseProviderTests(unittest.TestCase):

def test_chunk_text_respects_token_limit(self):

provider = DummyProvider(max_context_tokens=4, chunk_overlap=1)

text = "uno dos tres cuatro cinco seis"

chunks = provider.chunk_text(text)

self.assertGreater(len(chunks), 1)

for chunk in chunks:

self.assertLessEqual(chunk.token_count, 4)

def test_chunk_text_empty_string(self):

provider = DummyProvider(max_context_tokens=4, chunk_overlap=1)

text = ""

chunks = provider.chunk_text(text)

self.assertEqual(chunks, [])

def test_chunk_text_whitespace_only(self):

provider = DummyProvider(max_context_tokens=4, chunk_overlap=1)

text = " "

chunks = provider.chunk_text(text)

self.assertEqual(chunks, [])

def test_chunk_text_overlap_exceeds_token_limit(self):

provider = DummyProvider(max_context_tokens=2, chunk_overlap=3)

text = "one two three"

chunks = provider.chunk_text(text)

# Should not fail, and should chunk correctly

for chunk in chunks:

self.assertLessEqual(chunk.token_count, 2)

def test_chunk_text_max_context_tokens_none(self):

provider = DummyProvider(max_context_tokens=None, chunk_overlap=1)

text = "one two three four"

chunks = provider.chunk_text(text)

# Should return a single chunk with all tokens

self.assertEqual(len(chunks), 1)

self.assertEqual(chunks[0].token_count, 4)

sourcery-ai · 2025-11-20T15:57:06Z

+class OllamaProviderTests(unittest.TestCase):
+    def test_generate_builds_messages(self):


suggestion (testing): No test for error handling in OllamaLLMProvider._build_messages.

Add a test to confirm that OllamaLLMProvider._build_messages raises ValueError when both prompt and messages are missing.

sourcery-ai · 2025-11-20T15:57:06Z

+    def test_stream_yields_chunks(self):
+        provider = OllamaLLMProvider(model="llama3")
+        fake_chunks = iter(
+            [
+                {"message": {"content": "Hola"}, "done": False},
+                {"message": {"content": " Mundo"}, "done": True},
+            ]
+        )
+        with patch("aymurai.llm_providers.ollama_provider.ollama.chat") as mock_chat:
+            mock_chat.return_value = fake_chunks
+            pieces = list(provider.stream("Hola"))
+
+        self.assertEqual([piece.text for piece in pieces], ["Hola", " Mundo"])
+        _, kwargs = mock_chat.call_args
+        self.assertTrue(kwargs["stream"])


suggestion (testing): No test for OllamaLLMProvider with system_prompt in stream method.

Please add a test to verify that when system_prompt is set, the stream method includes the system message in the payload, similar to the generate method.

Suggested change

def test_stream_yields_chunks(self):

provider = OllamaLLMProvider(model="llama3")

fake_chunks = iter(

[

{"message": {"content": "Hola"}, "done": False},

{"message": {"content": " Mundo"}, "done": True},

]

)

with patch("aymurai.llm_providers.ollama_provider.ollama.chat") as mock_chat:

mock_chat.return_value = fake_chunks

pieces = list(provider.stream("Hola"))

self.assertEqual([piece.text for piece in pieces], ["Hola", " Mundo"])

_, kwargs = mock_chat.call_args

self.assertTrue(kwargs["stream"])

def test_stream_yields_chunks(self):

provider = OllamaLLMProvider(model="llama3")

fake_chunks = iter(

[

{"message": {"content": "Hola"}, "done": False},

{"message": {"content": " Mundo"}, "done": True},

]

)

with patch("aymurai.llm_providers.ollama_provider.ollama.chat") as mock_chat:

mock_chat.return_value = fake_chunks

pieces = list(provider.stream("Hola"))

self.assertEqual([piece.text for piece in pieces], ["Hola", " Mundo"])

_, kwargs = mock_chat.call_args

self.assertTrue(kwargs["stream"])

def test_stream_includes_system_prompt(self):

system_prompt = "You are a helpful assistant."

provider = OllamaLLMProvider(model="llama3", system_prompt=system_prompt)

fake_chunks = iter(

[

{"message": {"content": "Hola"}, "done": False},

{"message": {"content": " Mundo"}, "done": True},

]

)

with patch("aymurai.llm_providers.ollama_provider.ollama.chat") as mock_chat:

mock_chat.return_value = fake_chunks

pieces = list(provider.stream("Hola"))

# Check that the system message is included in the payload

_, kwargs = mock_chat.call_args

self.assertEqual(kwargs["messages"][0]["role"], "system")

self.assertEqual(kwargs["messages"][0]["content"], system_prompt)

sourcery-ai · 2025-11-20T15:57:06Z

+        for chunk in chunks:
+            self.assertLessEqual(chunk.token_count, 4)


issue (code-quality): Avoid loops in tests. (no-loop-in-tests)

Explanation
Avoid complex code, like loops, in test functions.
Google's software engineering guidelines says:
"Clear tests are trivially correct upon inspection"
To reach that avoid complex code in tests:

loops

conditionals

Some ways to fix this:

Use parametrized tests to get rid of the loop.

Move the complex logic into helpers.

Move the complex part into pytest fixtures.

Complexity is most often introduced in the form of logic. Logic is defined via the imperative parts of programming languages such as operators, loops, and conditionals. When a piece of code contains logic, you need to do a bit of mental computation to determine its result instead of just reading it off of the screen. It doesn't take much logic to make a test more difficult to reason about.

Software Engineering at Google / Don't Put Logic in Tests

sourcery-ai · 2025-11-20T15:57:07Z

+            if isinstance(tokens, dict):
+                return tokens.get("input_ids", [])
+            return tokens
+


suggestion (code-quality): We've found these issues:

Lift code into else after jump in control flow (reintroduce-else)

Replace if statement with if expression (assign-if-exp)

Suggested change

if isinstance(tokens, dict):

return tokens.get("input_ids", [])

return tokens

return tokens.get("input_ids", []) if isinstance(tokens, dict) else tokens

Copilot

Pull Request Overview

This PR introduces a flexible abstraction layer for integrating multiple Large Language Model (LLM) providers into the aymurai codebase. The implementation provides a standardized interface through a base LLMProvider class with concrete support for Ollama, along with utilities for text chunking, token counting, and streaming responses.

Key changes:

New LLMProvider abstract base class defining standard interface for LLM interactions
OllamaLLMProvider implementation supporting both standard and streaming generation
Comprehensive unit tests with mocking for both base and Ollama functionality

Reviewed Changes

Copilot reviewed 5 out of 6 changed files in this pull request and generated 3 comments.

Show a summary per file

File	Description
`aymurai/llm_providers/provider.py`	Base provider abstraction with token counting, chunking logic, and response models
`aymurai/llm_providers/ollama_provider.py`	Ollama-specific implementation with message building and streaming support
`aymurai/llm_providers/__init__.py`	Module exports for public API
`test/llm_providers/test_providers.py`	Unit tests for base provider and Ollama implementation
`test/llm_providers/__init__.py`	Empty init file for test module
`notebooks/experiments/llm-providers/01-ollama-provider.ipynb`	Usage examples demonstrating chunking, generation, and streaming

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

Copilot · 2025-11-20T15:58:57Z

+        """
+        pass
+
+    @abc.abstractmethod


The stream method is decorated with @abc.abstractmethod but provides a default implementation that raises NotImplementedError. This is inconsistent - abstract methods should not have implementations. Either remove the decorator to make it a concrete method with optional override, or remove the default implementation entirely and require subclasses to implement it.

Suggested change

@abc.abstractmethod

Copilot · 2025-11-20T15:58:58Z

+    def tokenizer(self):
+        return self._tokenizer


Missing return statement in the tokenizer property. The method should return self._tokenizer but currently returns None implicitly. This would cause any tests using FakePipeline.tokenizer to fail.

… for consistency and improved readability

…nc calls

Copilot

Pull Request Overview

Copilot reviewed 5 out of 6 changed files in this pull request and generated 1 comment.

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

…ved efficiency

…amline client instantiation

jansaldo added 7 commits November 19, 2025 17:17

✨ Add GPU-enabled Ollama service to compose stack

765c52f

🔧 Add Make targets for managing Ollama service and models

f090dd2

🔧 Add launch configuration and task for starting Ollama service

05f0a98

✨ Implement LLM providers module with Ollama adapter and shared abstr…

47115a6

…actions

✅ Add unit tests for LLM providers including DummyProvider and Ollama…

88499fe

…LLMProvider

📝 Document Ollama provider usage via notebook demo

4906f6c

Merge branch 'release/v2.0.0' of github.com:AymurAI/backend into feat…

09e141e

…ure/llm-providers

jansaldo requested review from Copilot and padonizetti November 20, 2025 15:56

Copilot started reviewing on behalf of jansaldo November 20, 2025 15:56 View session

sourcery-ai Bot reviewed Nov 20, 2025

View reviewed changes

Copilot finished reviewing on behalf of jansaldo November 20, 2025 15:58

Copilot AI reviewed Nov 20, 2025

View reviewed changes

jansaldo added 3 commits November 20, 2025 20:22

🐛 Fix tokenizer encoding by removing unnecessary special tokens flag

e366247

♻️ Refactor chunk handling in LLMProvider to use _append_chunk method…

7ed949a

… for consistency and improved readability

✨ Enhance Ollama provider docs and DRY response building for sync/asy…

ed71000

…nc calls

jansaldo requested a review from Copilot November 20, 2025 20:39

Copilot started reviewing on behalf of jansaldo November 20, 2025 20:39 View session

Copilot finished reviewing on behalf of jansaldo November 20, 2025 20:41

Copilot AI reviewed Nov 20, 2025

View reviewed changes

Comment thread aymurai/llm_providers/ollama_provider.py Outdated

jansaldo added 4 commits November 20, 2025 21:22

♻️ Refactor OllamaLLMProvider to reuse AsyncClient instance for impro…

21ee2c9

…ved efficiency

📝 Add async examples to OllamaLLMProvider notebook

474ed36

✅ Add async coverage for OllamaLLMProvider and tighten chunking tests

6af60c8

♻️ Refactor OllamaLLMProvider to remove async client caching and stre…

ea05979

…amline client instantiation

jansaldo merged commit 29d8328 into release/v2.0.0 Nov 20, 2025
1 check passed

jansaldo deleted the feature/llm-providers branch November 20, 2025 23:13

		class OllamaProviderTests(unittest.TestCase):
		def test_generate_builds_messages(self):

		for chunk in chunks:
		self.assertLessEqual(chunk.token_count, 4)

Conversation

jansaldo commented Nov 20, 2025 • edited by sourcery-ai Bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

LLM Provider Abstraction

Ollama Provider Implementation

Documentation and Usage Example

Testing

Summary by Sourcery

Uh oh!

sourcery-ai Bot commented Nov 20, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Reviewer's Guide

Sequence diagram for OllamaLLMProvider.generate() interaction with Ollama API

Sequence diagram for OllamaLLMProvider.stream() interaction with Ollama API

Class diagram for the new LLMProvider abstraction and OllamaLLMProvider implementation

File-Level Changes

Interacting with Sourcery

Customizing Your Experience

Getting Help

Uh oh!

sourcery-ai Bot left a comment

Choose a reason for hiding this comment

Uh oh!

sourcery-ai Bot Nov 20, 2025

Choose a reason for hiding this comment

Uh oh!

sourcery-ai Bot Nov 20, 2025

Choose a reason for hiding this comment

Uh oh!

sourcery-ai Bot Nov 20, 2025

Choose a reason for hiding this comment

Uh oh!

sourcery-ai Bot Nov 20, 2025

Choose a reason for hiding this comment

Uh oh!

sourcery-ai Bot Nov 20, 2025

Choose a reason for hiding this comment

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull Request Overview

Reviewed Changes

Uh oh!

Copilot AI Nov 20, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Copilot AI Nov 20, 2025

Choose a reason for hiding this comment

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull Request Overview

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

jansaldo commented Nov 20, 2025 •

edited by sourcery-ai Bot

Loading

sourcery-ai Bot commented Nov 20, 2025 •

edited

Loading