Add Local Ollama Model Support to Langchain Handler #8978

tmichaeldb · 2024-03-22T01:20:09Z

Description

Fixes RAG-54

Will follow this up with another PR for Langchain Refactor (link)

This allows users to use their own local Ollama models with Langchain. See here for a list of all supported models.

It's important to note that many local Ollama models (i.e. mistral) are substantially slower than using other LLM APIs. Depending on the specs of the machine, you can expect anywhere from 4-10+ output tokens/second. For higher tokens/second output, users can experiment with smaller models

Type of change

(Please delete options that are not relevant)

⚡ New feature (non-breaking change which adds functionality)
📄 This change requires a documentation update

Verification Process

To ensure the changes are working as expected:

Test Location: ./tests/unit/ml_handlers/test_langchain.py
Verification Steps:
1. pip install ollama to install Ollama
2. ollama pull mistral to download the mistral model locally.
3. Create & Query Langchain model

CREATE MODEL local_mistral_model
    PREDICT answer
    USING
        engine = 'langchain',
        model_name = 'mistral',
        user_column = 'question',
        assistant_column = 'answer',
        mode = 'conversational',
        prompt_template = 'You are a helpful assistant. Make sure to NEVER capitalize your letters. Always reply in all lowercase no matter what. Here is the user input: {{question}}';

SELECT answer FROM local_mistral_model WHERE question = 'What is your name?';

Additional Media:

I have attached a brief loom video or screenshots showcasing the new functionality or change.

Checklist:

My code follows the style guidelines(PEP 8) of MindsDB.
I have appropriately commented on my code, especially in complex areas.
Necessary documentation updates are either made or tracked in issues.
Relevant unit and integration tests are updated or added.

tmichaeldb · 2024-03-22T01:22:48Z

cc @QuantumPlumber

dusvyat

LGTM. Not a blocker for this merge but it would be good to use pydantic for user input parameters in future

tmichaeldb added 3 commits March 21, 2024 18:02

Don't include mindsdb_read tool & wikipedia by default

50900d2

Add support for local Ollama models

8be3ce9

Updated langchain requirements

25d6b6f

tmichaeldb self-assigned this Mar 22, 2024

tmichaeldb requested a review from dusvyat March 22, 2024 01:22

Updated requirements for testing

7b6a530

tmichaeldb requested a review from dylanketterer March 22, 2024 01:57

dusvyat approved these changes Mar 22, 2024

View reviewed changes

tmichaeldb merged commit 7fc6008 into staging Mar 22, 2024
12 checks passed

StpMax mentioned this pull request Mar 26, 2024

Release v24.3.5.0 #8994

Merged

hamishfagg deleted the langchain-refactor branch June 10, 2024 21:31

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add Local Ollama Model Support to Langchain Handler #8978

Add Local Ollama Model Support to Langchain Handler #8978

tmichaeldb commented Mar 22, 2024 •

edited

Loading

tmichaeldb commented Mar 22, 2024 •

edited

Loading

dusvyat left a comment

Add Local Ollama Model Support to Langchain Handler #8978

Add Local Ollama Model Support to Langchain Handler #8978

Conversation

tmichaeldb commented Mar 22, 2024 • edited Loading

Description

Type of change

Verification Process

Additional Media:

Checklist:

tmichaeldb commented Mar 22, 2024 • edited Loading

dusvyat left a comment

Choose a reason for hiding this comment

tmichaeldb commented Mar 22, 2024 •

edited

Loading

tmichaeldb commented Mar 22, 2024 •

edited

Loading