Feat/ollama integration by srijanpatel · Pull Request #55 · PySpur-Dev/pyspur

srijanpatel · 2024-12-19T06:52:47Z

Add support for models hosted by ollama

This PR adds support for connecting self-hosted models using ollama with PySpur. The ollama endpoint has to be specified in the .env file before launching the PySpur service.

Summary of changes:

use ollama python sdk to generate responses for ollama models. Litellm could not be used as it currently has an unresolved bug with JSON mode JSON output throws error for ollama provider BerriAI/litellm#6353
centralise env management
minor clean up of key management API and settings modal

Important

Integrates Ollama models, centralizes environment management, updates key management APIs, and adjusts Docker and frontend for these changes.

Ollama Integration:
- Added support for Ollama models using ollama Python SDK in llm_utils.py.
- Introduced OllamaOptions class for API call options.
- Added ollama_with_backoff() function for API calls with retry logic.
Environment Management:
- Centralized environment variable management in key_management.py.
- Updated .env.example with Ollama configuration.
Key Management API:
- Refactored key management in key_management.py to use MODEL_PROVIDER_KEYS.
- Updated API endpoints to handle Ollama keys.
Docker and Entrypoint:
- Modified docker-compose.yml to include .env file and extra hosts.
- Added test_ollama.sh script to verify Ollama connection in entrypoint.sh.
Frontend Changes:
- Updated SettingsModal.tsx and api.ts to handle API keys more effectively.
- Adjusted API calls to reflect backend changes.

^{This description was created by}^{for 8a5f886. It will automatically update as commits are pushed.}

ellipsis-dev

❌ Changes requested. Reviewed everything up to 7e44fa5 in 2 minutes and 6 seconds

More details

Looked at 448 lines of code in 10 files
Skipped 0 files when reviewing.
Skipped posting 5 drafted comments based on config settings.

1. backend/app/nodes/llm/llm_utils.py:19

Draft comment:
Ensure litellm.set_verbose=True is a valid usage. If set_verbose is not a valid attribute or method, this line should be removed or corrected.
Reason this comment was not posted:
Comment did not seem useful.

2. backend/app/nodes/llm/llm_utils.py:20

Draft comment:
load_dotenv() is called multiple times across different files. Consider centralizing this call to avoid redundancy.
Reason this comment was not posted:
Confidence changes required: 50%
The load_dotenv() function is called multiple times across different files. This is unnecessary and can be centralized to avoid redundancy.

3. backend/app/nodes/llm/llm_utils.py:193

Draft comment:
Ensure consistent handling of api_base. Check for non-empty values before using it to avoid unexpected behavior.
Reason this comment was not posted:
Comment did not seem useful.

4. backend/app/nodes/llm/single_llm_call.py:17

Draft comment:
load_dotenv() is called multiple times across different files. Consider centralizing this call to avoid redundancy.
Reason this comment was not posted:
Confidence changes required: 50%
The load_dotenv() function is called multiple times across different files. This is unnecessary and can be centralized to avoid redundancy.

5. frontend/src/utils/api.ts:258

Draft comment:
Update listApiKeys to return response.data instead of response.data.keys to match the backend response format.
Reason this comment was not posted:
Comment looked like it was already resolved.

Workflow ID: wflow_bC91faIyjfRF7mCK

Want Ellipsis to fix these issues? Tag @ellipsis-dev in a comment. You can customize Ellipsis with 👍 / 👎 feedback, review rules, user-specific overrides, quiet mode, and more.

ellipsis-dev

👍 Looks good to me! Incremental review on ac6d9c5 in 31 seconds

More details

Looked at 13 lines of code in 1 files
Skipped 0 files when reviewing.
Skipped posting 1 drafted comments based on config settings.

1. backend/app/nodes/llm/llm_utils.py:285

Draft comment:
The docstring mentions parameters (response_model, max_retries, initial_wait, max_wait) that are not in the function signature. Please update the docstring to reflect the actual parameters.
Reason this comment was not posted:
Comment was on unchanged code.

Workflow ID: wflow_0zfuDA4MNdNL9smj

You can customize Ellipsis with 👍 / 👎 feedback, review rules, user-specific overrides, quiet mode, and more.

ellipsis-dev

👍 Looks good to me! Incremental review on 162192c in 23 seconds

More details

Looked at 13 lines of code in 1 files
Skipped 0 files when reviewing.
Skipped posting 1 drafted comments based on config settings.

1. backend/app/nodes/llm/llm_utils.py:284

Draft comment:
The retry attempts have been increased from 1 to 3, which aligns with the retry logic used in other functions like completion_with_backoff. This change should improve reliability.
Reason this comment was not posted:
Confidence changes required: 0%
The change in retry attempts from 1 to 3 in the ollama_with_backoff function is consistent with the retry logic used in other functions like completion_with_backoff. This change is likely intended to improve reliability.

Workflow ID: wflow_UdbByDj9hSNBQ7bb

You can customize Ellipsis with 👍 / 👎 feedback, review rules, user-specific overrides, quiet mode, and more.

ellipsis-dev

👍 Looks good to me! Incremental review on 8a5f886 in 12 seconds

More details

Looked at 55 lines of code in 1 files
Skipped 0 files when reviewing.
Skipped posting 0 drafted comments based on config settings.

Workflow ID: wflow_IRKFF4AsKwrmZ3sC

You can customize Ellipsis with 👍 / 👎 feedback, review rules, user-specific overrides, quiet mode, and more.

srijanpatel added 7 commits December 18, 2024 23:17

centralise .env

af9577a

Refactor key_management API to use model provider keys

1dbdc27

settings modal api keys bug fix

d102408

Add Ollama connection test in entrypoint.sh

2d5bdbc

Use ollama api for completions for ollama models

e20ef52

add ollama base url instructions in .env.example

ca0adf1

Merge remote-tracking branch 'origin/main' into feat/ollama-integration

7e44fa5

srijanpatel requested a review from JeanKaddour December 19, 2024 06:52

ellipsis-dev Bot reviewed Dec 19, 2024

View reviewed changes

Comment thread backend/app/nodes/llm/llm_utils.py

add async_retry decorator back

ac6d9c5

ellipsis-dev Bot reviewed Dec 19, 2024

View reviewed changes

increase tries to 3

162192c

srijanpatel added the enhancement New feature or request label Dec 19, 2024

ellipsis-dev Bot reviewed Dec 19, 2024

View reviewed changes

srijanpatel linked an issue Dec 19, 2024 that may be closed by this pull request

Ollama Support for Local LLM Inferencing #54

Closed

Update README - Add support for using PySpur with Ollama (Local Models)

8a5f886

ellipsis-dev Bot reviewed Dec 19, 2024

View reviewed changes

JeanKaddour merged commit b98a70e into main Dec 19, 2024

kb14 mentioned this pull request Dec 27, 2024

Ollama Support for Local LLM Inferencing #54

Closed

srijanpatel deleted the feat/ollama-integration branch February 7, 2025 19:48

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Feat/ollama integration#55

Feat/ollama integration#55
JeanKaddour merged 10 commits intomainfrom
feat/ollama-integration

srijanpatel commented Dec 19, 2024 •

edited by ellipsis-dev Bot

Loading

Uh oh!

ellipsis-dev Bot left a comment

Uh oh!

Uh oh!

ellipsis-dev Bot left a comment

Uh oh!

ellipsis-dev Bot left a comment

Uh oh!

ellipsis-dev Bot left a comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

srijanpatel commented Dec 19, 2024 • edited by ellipsis-dev Bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Add support for models hosted by ollama

Uh oh!

ellipsis-dev Bot left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

ellipsis-dev Bot left a comment

Choose a reason for hiding this comment

Uh oh!

ellipsis-dev Bot left a comment

Choose a reason for hiding this comment

Uh oh!

ellipsis-dev Bot left a comment

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

srijanpatel commented Dec 19, 2024 •

edited by ellipsis-dev Bot

Loading