Fix the maximum context length issue by chunking #3222

kinance · 2023-04-25T15:19:32Z

Background

Multiple issues opened about the same issue, e.g. #2801 #2871 #2906 and more, which multiple commands calls memory.add() which then calls create_embedding_with_ada, in the cases where the input text exceeds the model's 8191 token limit, we will get an InvalidRequestError saying that "This model's maximum context length is 8191 tokens...".

Resolves #2801, resolves #2871, resolves #2906, resolves #3244

Changes

The issue is fixed by chunking the input text, then running embedding individually and then combining by weighted averaging. This approach is suggested by the OpenAI. This change model after OpenAI Cookbook. This PR should fix numbers of open issues including the ones mentioned above and more.

PR Quality Checklist

My pull request is atomic and focuses on a single change.
I have thoroughly tested my changes with multiple different prompts.
I have considered potential risks and mitigations for my changes.
I have documented my changes clearly and comprehensively.
I have not snuck in any "extra" small tweaks changes

codecov · 2023-04-25T15:23:46Z

Codecov Report

Patch coverage: 86.48% and project coverage change: +0.24 🎉

Comparison is base (0ef6f06) 60.31% compared to head (572cac9) 60.55%.

Additional details and impacted files

@@            Coverage Diff             @@
##           master    #3222      +/-   ##
==========================================
+ Coverage   60.31%   60.55%   +0.24%     
==========================================
  Files          69       69              
  Lines        3152     3184      +32     
  Branches      525      528       +3     
==========================================
+ Hits         1901     1928      +27     
- Misses       1118     1122       +4     
- Partials      133      134       +1

Impacted Files	Coverage Δ
autogpt/llm/__init__.py	`100.00% <ø> (ø)`
autogpt/llm/modelsinfo.py	`100.00% <ø> (ø)`
autogpt/config/config.py	`76.25% <66.66%> (-0.58%)`	⬇️
autogpt/llm/llm_utils.py	`66.66% <92.85%> (+5.34%)`	⬆️

☔ View full report in Codecov by Sentry.
📢 Do you have feedback about the report comment? Let us know in this issue.

kinance · 2023-04-25T15:30:03Z

@Pwuts I think this change can fix and close multiple open issues. Could you please review, approve and merge?

Pwuts · 2023-04-25T17:13:50Z

Please link issues if this PR resolves them

Pwuts · 2023-04-25T17:15:14Z

Also, this is missing test coverage. Can you fix that (using pytest, not unittest)?

Pwuts

Please add unit tests using pytest

github-actions · 2023-04-25T18:30:48Z

This pull request has conflicts with the base branch, please resolve those so we can evaluate the pull request.

GoMightyAlgorythmGo · 2023-04-25T19:42:09Z

endless crashes since 4 days a lot but happening less often since 2-3 weeks. Crashed 4 times in a row and constantly for 3 hours every restart. Here is some code to cap the max length for GPT3.5t because max is about 8191 tokens so to be save under 24000 seems to be fine most of the time here the code:

#3239 (comment)

Add basic unit test for the new chunked func

vercel · 2023-04-26T14:55:30Z

The latest updates on your projects. Learn more about Vercel for Git ↗︎

1 Ignored Deployment

Name	Status	Preview	Comments	Updated (UTC)
docs	⬜️ Ignored (Inspect)	Visit Preview		May 1, 2023 6:06pm

github-actions · 2023-04-26T15:42:15Z

Conflicts have been resolved! 🎉 A maintainer will review the pull request shortly.

kinance · 2023-04-26T15:51:28Z

Linked the issues that this PR is going to fix and added a unit test for the new chunk token func

github-actions · 2023-04-26T16:50:22Z

This pull request has conflicts with the base branch, please resolve those so we can evaluate the pull request.

…er-length-Significant-Gravitas#3222

…-fix-character-length-Significant-Gravitas#3222

Pwuts

Best we can do for now; we'll have to iterate on this when reworking the memory system

tests/integration/conftest.py

github-actions · 2023-05-01T17:45:52Z