Allow users to govern the ollama token context size #121803

Rudd-O · 2024-07-11T19:41:02Z

Proposed change

Allows the user of Ollama to specify the context token size, to ensure that models with
small default num_ctx can still be made to work with Home Assistant assist.

Type of change

Dependency upgrade
Bugfix (non-breaking change which fixes an issue)
New integration (thank you!)
New feature (which adds functionality to an existing integration)
Deprecation (breaking change to happen in the future)
Breaking change (fix/feature causing existing functionality to break)
Code quality improvements to existing code or addition of tests

Additional information

This PR fixes or closes issue: fixes Ollama context token size must be configurable #119946

Checklist

The code change is tested and works locally.
Local tests pass. Your PR cannot be merged unless tests pass
There is no commented out code in this PR.
I have followed the development checklist
I have followed the perfect PR recommendations
The code has been formatted using Ruff (ruff format homeassistant tests)
Tests have been added to verify that the new code works.

If user exposed functionality or configuration variables are added/changed:

Documentation added/updated for www.home-assistant.io here: Add context tokens home-assistant.io#33709

If the code communicates with devices, web services, or third-party tools:

The manifest file has all fields filled out correctly.
Updated and included derived files by running: python3 -m script.hassfest.
New or updated dependencies have been added to requirements_all.txt.
Updated by running python3 -m script.gen_requirements_all.
For the updated dependencies - a link to the changelog, or at minimum a diff between library versions is added to the PR description.

To help with the load of incoming pull requests:

I have reviewed two other open pull requests in this repository.

Fixes home-assistant#119946

home-assistant · 2024-07-11T19:41:08Z

Hey there @synesthesiam, mind taking a look at this pull request as it has been labeled with an integration (ollama) you are listed as a code owner for? Thanks!

Code owner commands

Code owners of ollama can trigger bot actions by commenting:

@home-assistant close Closes the pull request.
@home-assistant rename Awesome new title Renames the pull request.
@home-assistant reopen Reopen the pull request.
@home-assistant unassign ollama Removes the current integration label and assignees on the pull request, add the integration domain after the command.
@home-assistant add-label needs-more-information Add a label (needs-more-information, problem in dependency, problem in custom component) to the pull request.
@home-assistant remove-label needs-more-information Remove a label (needs-more-information, problem in dependency, problem in custom component) on the pull request.

Companion to home-assistant/core#121803

The default 2048 is useless with even a modest smart home — it causes the system prompt to be completely ignored.

tannisroot · 2024-08-08T03:56:59Z

homeassistant/components/ollama/const.py

@@ -81,6 +81,11 @@
 CONF_MAX_HISTORY = "max_history"
 DEFAULT_MAX_HISTORY = 20

+CONF_NUM_CTX = "num_ctx"
+DEFAULT_NUM_CTX = 2048
+MAX_NUM_CTX = 65536


Newer model seem to support up to 128K, might be a good idea to change max to an even higher number (unless I'm misunderstanding something).

Perhaps. Mind suggesting a diff / patch in this PR? I would happily increase it.

Suggested change

MAX_NUM_CTX = 65536

MAX_NUM_CTX = 131072

allenporter · 2024-08-08T14:30:22Z

homeassistant/components/ollama/conversation.py

@@ -251,6 +253,11 @@ async def async_process(
            {"messages": message_history.messages},
        )

+        options: ollama.Options | None = None
+        num_ctx = settings.get(CONF_NUM_CTX, DEFAULT_NUM_CTX)
+        if num_ctx != DEFAULT_NUM_CTX:


The default size in ollama is 2048. So when DEFAULT_NUM_CTX is set to 4096 here it doesn't actually set a num_ctx so the 4096 is ignored and it still uses 2048.

One option is:

Don't set DEFAULT_NUM_CTX at all so that the default is None and only pass the context size when its not None

If you're trying to also increase the default context size, then this needs a different approach

tannisroot · 2024-08-08T15:32:20Z

homeassistant/components/ollama/strings.json

        },
        "data_description": {
          "prompt": "Instruct how the LLM should respond. This can be a template.",
-          "keep_alive": "Duration in seconds for Ollama to keep model in memory. -1 = indefinite, 0 = never."
+          "keep_alive": "Duration in seconds for Ollama to keep model in memory. -1 = indefinite, 0 = never.",
+          "num_ctx": "Increase this if you have a complex smart home, or the LLM seems to ignore knowingly exposed devices."


Suggested change

"num_ctx": "Increase this if you have a complex smart home, or the LLM seems to ignore knowingly exposed devices."

"num_ctx": "Number of tokens a model can process. Higher values allow to handle a larger number of devices."

frenck · 2024-08-08T16:56:10Z

I've marked this PR, as changes are requested that need to be processed.
Please un-draft it once it is ready for review again by clicking the "Ready for review" button.

Thanks! 👍

../Frenck

Learn more about our pull request process.

tannisroot · 2024-08-14T12:09:14Z

@Rudd-O will you be able to implement the suggested changes to the PR? Thanks

allenporter · 2024-08-25T05:36:59Z

Sent #124555 given this seems to have gone un responsive and i think it needs a prompt fix given there is somewhat of a regression here. Respectfully closing to handle in the other PR.

Rudd-O added 2 commits July 11, 2024 19:37

Allow users to govern the token context size.

4aecb79

Fixes home-assistant#119946

Ruffified.

1d24026

Rudd-O requested a review from synesthesiam as a code owner July 11, 2024 19:41

home-assistant bot added cla-signed integration: ollama new-feature small-pr PRs with less than 30 lines. labels Jul 11, 2024

home-assistant bot assigned synesthesiam Jul 11, 2024

home-assistant bot added the Quality Scale: No score label Jul 11, 2024

Rudd-O added a commit to Rudd-O/home-assistant.io that referenced this pull request Jul 11, 2024

Add context tokens

673c2eb

Companion to home-assistant/core#121803

Rudd-O mentioned this pull request Jul 11, 2024

Add context tokens home-assistant/home-assistant.io#33709

Closed

8 tasks

MartinHjelmare changed the title ~~Allow users to govern the token context size.~~ Allow users to govern the ollama token context size Jul 12, 2024

Rudd-O added 3 commits August 8, 2024 02:00

Increase context size and minimum

671a06e

The default 2048 is useless with even a modest smart home — it causes the system prompt to be completely ignored.

Merge branch 'dev' into mybranch

6cabac1

Add description for num_ctx

89bef7e

Rudd-O mentioned this pull request Aug 8, 2024

Latest HA 2024.8.0 with latest Ollama and pretty much every model is unusable #123316

Open

tannisroot reviewed Aug 8, 2024

View reviewed changes

allenporter reviewed Aug 8, 2024

View reviewed changes

allenporter mentioned this pull request Aug 8, 2024

Ollama context token size must be configurable #119946

Closed

tannisroot reviewed Aug 8, 2024

View reviewed changes

frenck marked this pull request as draft August 8, 2024 16:56

allenporter mentioned this pull request Aug 25, 2024

Add ollama context window size configuration #124555

Merged

19 tasks

allenporter closed this Aug 25, 2024

github-actions bot locked and limited conversation to collaborators Aug 26, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Allow users to govern the ollama token context size #121803

Allow users to govern the ollama token context size #121803

Rudd-O commented Jul 11, 2024 •

edited

Loading

home-assistant bot commented Jul 11, 2024

tannisroot Aug 8, 2024

Rudd-O Aug 8, 2024

tannisroot Aug 8, 2024

allenporter Aug 8, 2024

tannisroot Aug 8, 2024

frenck commented Aug 8, 2024

tannisroot commented Aug 14, 2024

allenporter commented Aug 25, 2024

	"num_ctx": "Increase this if you have a complex smart home, or the LLM seems to ignore knowingly exposed devices."
	"num_ctx": "Number of tokens a model can process. Higher values allow to handle a larger number of devices."

Allow users to govern the ollama token context size #121803

Allow users to govern the ollama token context size #121803

Conversation

Rudd-O commented Jul 11, 2024 • edited Loading

Proposed change

Type of change

Additional information

Checklist

home-assistant bot commented Jul 11, 2024

tannisroot Aug 8, 2024

Choose a reason for hiding this comment

Rudd-O Aug 8, 2024

Choose a reason for hiding this comment

tannisroot Aug 8, 2024

Choose a reason for hiding this comment

allenporter Aug 8, 2024

Choose a reason for hiding this comment

tannisroot Aug 8, 2024

Choose a reason for hiding this comment

frenck commented Aug 8, 2024

tannisroot commented Aug 14, 2024

allenporter commented Aug 25, 2024

Rudd-O commented Jul 11, 2024 •

edited

Loading