Fix the incompatibility bug of azure deployment id configuration with gpt4/3-only #4098

IANTHEREAL · 2023-05-11T02:07:39Z

Background

When I use the azure openai api and set the set gpt4 only, auto gpt will fail because the deployment id is incorrect.

azure.yaml

azure_api_type: azure
azure_api_base: https://test.openai.azure.com/
azure_api_version: 2023-03-15-preview
azure_model_map:
    fast_llm_model_deployment_id: gpt-3-5_playground
    smart_llm_model_deployment_id: gpt-4_playground
    embedding_model_deployment_id: gpt-embedding-ada

min test script

from autogpt.config.config import Config

CFG = Config()
print(CFG.fast_llm_model, CFG.get_azure_deployment_id_for_model(CFG.fast_llm_model))
print(CFG.smart_llm_model, CFG.get_azure_deployment_id_for_model(CFG.smart_llm_model))

CFG.set_smart_llm_model(CFG.fast_llm_model)
print(CFG.fast_llm_model, CFG.get_azure_deployment_id_for_model(CFG.fast_llm_model))
print(CFG.smart_llm_model, CFG.get_azure_deployment_id_for_model(CFG.smart_llm_model))

output

gpt-3.5-turbo -> gpt-3-5_playground
gpt-4 -> gpt-4_playground
gpt-4 -> gpt-3-5_playground
gpt-4 -> gpt-3-5_playground

But it should be

gpt-3.5-turbo -> gpt-3-5_playground
gpt-4 -> gpt-4_playground
gpt-4 -> gpt-4_playground
gpt-4 -> gpt-4_playground

Changes

When the user sets set gpt4 only, modify the corresponding fast llm model related configuration

Documentation

Test Plan

PR Quality Checklist

My pull request is atomic and focuses on a single change.
I have thoroughly tested my changes with multiple different prompts.
I have considered potential risks and mitigations for my changes.
I have documented my changes clearly and comprehensively.
I have not snuck in any "extra" small tweaks changes

vercel · 2023-05-11T02:07:44Z

The latest updates on your projects. Learn more about Vercel for Git ↗︎

1 Ignored Deployment

Name	Status	Preview	Comments	Updated (UTC)
docs	⬜️ Ignored (Inspect)	Visit Preview		Jun 7, 2023 3:16am

lc0rp · 2023-05-19T13:25:24Z

The underlying issue is addressed in #3144.

@IANTHEREAL Shall we close this and collaborate there?

IANTHEREAL · 2023-05-22T01:04:25Z

@lc0rp Hi,but I think these two PRs don't solve the same problem

codecov · 2023-05-22T20:04:35Z

Codecov Report

Patch coverage: 85.71% and project coverage change: -0.06 ⚠️

Comparison is base (9e5492b) 50.04% compared to head (6828d11) 49.98%.

Additional details and impacted files

@@            Coverage Diff             @@
##           master    #4098      +/-   ##
==========================================
- Coverage   50.04%   49.98%   -0.06%     
==========================================
  Files         116      116              
  Lines        4826     4825       -1     
  Branches      650      650              
==========================================
- Hits         2415     2412       -3     
- Misses       2227     2228       +1     
- Partials      184      185       +1

Impacted Files	Coverage Δ
autogpt/config/config.py	`82.55% <83.33%> (-1.35%)`	⬇️
autogpt/configurator.py	`29.48% <100.00%> (-0.90%)`	⬇️

☔ View full report in Codecov by Sentry.
📢 Do you have feedback about the report comment? Let us know in this issue.

lc0rp · 2023-05-23T18:44:21Z

@lc0rp Hi, but I think these two PRs don't solve the same problem

Well, not exactly the same, but parts of this PR overlap with #3144 and this PR may be fixed by #3144. So, I should have said they're related. In this PR, you suggest that if we're in gpt4only mode, we should pull settings from the CFG.smart_llm_model and vice-versa.

In PR #3144, we ignore the config when we select an exclusive mode at the command line. It shouldn't matter whether the smart_llm_model is set to 3.5 or 4 - in gpt4only mode, we should use GPT-4.

lc0rp

Added a couple of comments. Happy to hop on a discord chat/call.

autogpt/configurator.py

IANTHEREAL · 2023-05-24T01:04:34Z

@lc0rp Hi
The crux of the issue we're discussing is that, although the model only consists of GPT3.5 and GPT4, there are multiple parameters that can be set independently. Hence, we are forced to make some assumptions based on expectations.

My background is that I'm develop an online service, and neither errors nor performance degradation are acceptable to me.

For example, when setting 'gpt4only':
1.1. Incorrectly selecting the deployment id due to 'gpt4 only' setting shouldn't occur.
1.2. I'd prefer not to see a trade-off where setting the fast model token limit to 4000 results in smaller memory and a subsequent loss of performance.

The same applies when setting 'gpt3only'. Encountering an error due to the smart model token limit being 8000 is something I'd rather not see.

At present, my modifications have only mitigated these issues, but this isn't the best solution. Incorporating these related parameters into two separate GPT configuration objects will be more nature, thus avoiding these discussions that I see as somewhat compromised.

I apologize if my response is a bit rushed. If you have thoughts, I'd be happy to discuss further on the community discord after I finish a project in the next couple of days.

To add to that, I've just tested your code with gpt4only=True, and it seems that the function get_azure_deployment_id_for_model() isn't working as expected.

IANTHEREAL · 2023-05-25T09:47:50Z

Proposal for a Structured Configuration Object for GPT Models

In the current implementation of Auto-GPT, there are multiple parameters related to GPT-3 and GPT-4 models that can be set independently. This can lead to unexpected behavior when the deployment ID for the model is fetched using the get_azure_deployment_id_for_model() function. A solution to this problem is to encapsulate all model-related parameters into a structured configuration object, which will both simplify the code and reduce the likelihood of errors.

Proposal

The proposal is to introduce a GPTConfig data class that contains all relevant parameters for a GPT model. Here is a potential implementation:

@dataclass
class GPTConfig:
    model: str
    api_key: str
    token_limit: int = 4000
    temperature: float = 0
    api_deployment_id: Optional[str] = None

With this data class, we can create separate configuration objects for GPT-3 and GPT-4 models from ENV:

gpt3 = GPTConfig(model='gpt-3', api_key='gpt3_api_key', api_deployment_id='gpt-3_deployment_id')
gpt4 = GPTConfig(model='gpt-4', api_key='gpt4_api_key', api_deployment_id='gpt-4_deployment_id')

To manage these configurations, a Config class can be created:

class Config:
    def __init__(self):
        self.model_config = {
            'gpt-3': gpt3,
            'gpt-4': gpt4,
        }
        self.fast_llm_model = 'gpt-3'
        self.smart_llm_model = 'gpt-3'

    def set_fast_llm_model(self, model: str):
        # check whether model is in self.model_config
        if model in self.model_config:
            self.fast_llm_model = model
        else:
            raise ValueError(f"Model {model} is not in the configuration")

Benefits

Code Simplification: Encapsulating all model-related parameters into a single data structure simplifies the codebase and makes it easier to understand and maintain. This approach brings clarity to the configuration process by making the relationships between the different parameters explicit. This can make the system easier to debug and extend.
Error Reduction: By ensuring that all model-related parameters are set consistently, this approach reduces the likelihood of errors.
Enhanced Flexibility: This approach allows for easy configuration of multiple heterogeneous models in the future. If a new model variant is introduced, a new configuration object can be easily created for it.

IANTHEREAL · 2023-05-25T09:51:20Z

Proposal for a Structured Configuration Object for GPT Models

@lc0rp How do you feel about it? Do you think you could take on the implementation of this proposal? I would really appreciate your assistance with this.

github-actions · 2023-05-26T17:14:11Z

This pull request has conflicts with the base branch, please resolve those so we can evaluate the pull request.

lc0rp · 2023-05-27T00:49:04Z

Proposal for a Structured Configuration Object for GPT Models

@lc0rp How do you feel about it? Do you think you could take on the implementation of this proposal? I would really appreciate your assistance with this.

Oops, I was absent these past few days. However, I agree with you regarding the matter of multiple independent variables that need to be linked, and I must say, your proposal looks fantastic!

I am pleased to note that the re-architectural design has also reached similar conclusions, and your proposal resembles the re-architectural code. The re-architectural code is situated in a separate branch as all the loose ends have been tied. It is almost ready for release.

Take a look at the LLM base models through this link: https://github.com/Significant-Gravitas/Auto-GPT/blob/agent-state-encapsulation/autogpt/llm/base.py and let me know your thoughts. If you'd like to continue the proposal afterward, I'd be happy to collaborate. We'll move it to a new DRAFT issue and invite feedback before PR submission.

IANTHEREAL · 2023-05-29T01:51:01Z

@lc0rp Similar to how the static attributes of GPT are preserved in the LLM base models: https://github.com/Significant-Gravitas/Auto-GPT/blob/agent-state-encapsulation/autogpt/llm/base.py, I think we can implement a similar GPT dynamic attributes configuration. I'd like to convert this PR into a draft and try to modify it. If you have any other plans or thoughts, please feel free to share with me at any time.

… set-gpt3/4-only

lc0rp · 2023-06-23T15:11:03Z

@IANTHEREAL Is this still valid? If so, we're prepping for release v0.4.3. Please resolve conflicts and stand by as we merge.

… set-gpt3/4-only

github-actions · 2023-06-26T01:39:13Z

Conflicts have been resolved! 🎉 A maintainer will review the pull request shortly.

netlify · 2023-06-26T01:39:57Z

✅ Deploy Preview for auto-gpt-docs canceled.

Name	Link
🔨 Latest commit	`f56ff98`
🔍 Latest deploy log	https://app.netlify.com/sites/auto-gpt-docs/deploys/64a78c7e12b0f100081759f4

IANTHEREAL · 2023-06-26T02:42:08Z

@lc0rp Yes, I have resolved it

github-actions · 2023-06-27T07:41:25Z

This pull request has conflicts with the base branch, please resolve those so we can evaluate the pull request.

Pwuts

The config module was overhauled in #4803, causing conflicts with this PR (again). I'm not sure why this PR wasn't included in an earlier release yet, it looks like it was mergeable.

Anyhow, it seems that #4803 also broke support for Azure by removing Config.get_azure_deployment_id_for_model(). Instead of resolving the merge conflicts on this PR, maybe it's easier to make a new PR fixing Azure support altogether in a new PR.

Update: @collijk will look into patching basic Azure support first, and then we can check if the issue addressed by this PR still exists.
Update 2: looks like this PR fixes Azure support: #4875

tests/unit/test_config.py

github-actions · 2023-07-07T02:53:29Z

Conflicts have been resolved! 🎉 A maintainer will review the pull request shortly.

* Fix --gpt3only and --gpt4only * Fix and consolidate test_config.py::test_azure_config (x2) --------- Co-authored-by: Luke K (pr-0f3t) <2609441+lc0rp@users.noreply.github.com> Co-authored-by: Ryan <eimu.gray@gmail.com> Co-authored-by: Reinier van der Leer <github@pwuts.nl>

set only gpt3/4 configuration correctlly

abc37a4

github-actions bot added the size/m label May 11, 2023

IANTHEREAL and others added 2 commits May 11, 2023 10:27

address to make configuration changed correctly

c9a8273

Merge branch 'master' into set-gpt3/4-only

93f74cc

k-boikov added the Azure label May 14, 2023

k-boikov added this to the v0.3.2-release milestone May 14, 2023

Merge branch 'master' into set-gpt3/4-only

d258397

vercel bot temporarily deployed to Preview May 19, 2023 10:55 Inactive

lc0rp self-assigned this May 19, 2023

k-boikov added 2 commits May 22, 2023 22:58

black dot

b10f4fa

Merge branch 'master' into set-gpt3/4-only

dc4f28d

vercel bot temporarily deployed to Preview May 22, 2023 19:59 Inactive

lc0rp reviewed May 23, 2023

View reviewed changes

autogpt/configurator.py Outdated Show resolved Hide resolved

autogpt/configurator.py Outdated Show resolved Hide resolved

autogpt/configurator.py Outdated Show resolved Hide resolved

autogpt/configurator.py Outdated Show resolved Hide resolved

lc0rp modified the milestones: v0.4.0 Release, v0.4.1 Release May 23, 2023

github-actions bot added the conflicts Automatically applied to PRs with merge conflicts label May 26, 2023

IANTHEREAL marked this pull request as draft May 29, 2023 01:51

IANTHEREAL added 2 commits May 29, 2023 15:25

revert modifition

d3b99de

Merge branch 'master' of https://github.com/Torantulino/Auto-GPT into…

9aebaf3

… set-gpt3/4-only

lc0rp added this to the v0.4.2 Release milestone Jun 14, 2023

Merge branch 'master' of https://github.com/Torantulino/Auto-GPT into…

f0b572b

… set-gpt3/4-only

github-actions bot removed the conflicts Automatically applied to PRs with merge conflicts label Jun 26, 2023

eimugray added 2 commits June 26, 2023 10:25

fix test

36f25a6

format code

1dfbc7b

lc0rp modified the milestones: v0.4.3 Release, v0.4.4 Release Jun 26, 2023

github-actions bot added the conflicts Automatically applied to PRs with merge conflicts label Jun 27, 2023

Pwuts reviewed Jul 6, 2023

View reviewed changes

tests/unit/test_config.py Outdated Show resolved Hide resolved

Pwuts assigned Pwuts and unassigned Pwuts Jul 6, 2023

Merge branch 'master' into set-gpt3/4-only

5a59891

github-actions bot removed the conflicts Automatically applied to PRs with merge conflicts label Jul 7, 2023

Pwuts added 5 commits July 7, 2023 05:19

Fix and consolidate test_config.py::test_azure_config (x2)

752f401

Fix circular import

fe46a72

Fix Config.get_azure_kwargs() for model derivatives

6828d11

Fix linter

6090cc6

Merge branch 'master' into set-gpt3/4-only

f56ff98

Pwuts self-assigned this Jul 7, 2023

Pwuts approved these changes Jul 7, 2023

View reviewed changes

Pwuts merged commit 3b7e101 into Significant-Gravitas:master Jul 7, 2023
14 checks passed

collijk mentioned this pull request Jul 7, 2023

Bugfix/broken azure config #4912

Merged

6 tasks

IANTHEREAL deleted the set-gpt3/4-only branch July 10, 2023 06:50

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix the incompatibility bug of azure deployment id configuration with gpt4/3-only #4098

Fix the incompatibility bug of azure deployment id configuration with gpt4/3-only #4098

IANTHEREAL commented May 11, 2023 •

edited

vercel bot commented May 11, 2023 •

edited

lc0rp commented May 19, 2023

IANTHEREAL commented May 22, 2023 •

edited

codecov bot commented May 22, 2023 •

edited

lc0rp commented May 23, 2023 •

edited

lc0rp left a comment

IANTHEREAL commented May 24, 2023

IANTHEREAL commented May 25, 2023 •

edited

IANTHEREAL commented May 25, 2023

Proposal for a Structured Configuration Object for GPT Models

github-actions bot commented May 26, 2023

lc0rp commented May 27, 2023

Proposal for a Structured Configuration Object for GPT Models

IANTHEREAL commented May 29, 2023

lc0rp commented Jun 23, 2023

github-actions bot commented Jun 26, 2023

netlify bot commented Jun 26, 2023 •

edited

IANTHEREAL commented Jun 26, 2023

github-actions bot commented Jun 27, 2023

Pwuts left a comment •

edited

github-actions bot commented Jul 7, 2023

Fix the incompatibility bug of azure deployment id configuration with gpt4/3-only #4098

Fix the incompatibility bug of azure deployment id configuration with gpt4/3-only #4098

Conversation

IANTHEREAL commented May 11, 2023 • edited

Background

Changes

Documentation

Test Plan

PR Quality Checklist

vercel bot commented May 11, 2023 • edited

lc0rp commented May 19, 2023

IANTHEREAL commented May 22, 2023 • edited

codecov bot commented May 22, 2023 • edited

Codecov Report

lc0rp commented May 23, 2023 • edited

lc0rp left a comment

Choose a reason for hiding this comment

IANTHEREAL commented May 24, 2023

IANTHEREAL commented May 25, 2023 • edited

Proposal for a Structured Configuration Object for GPT Models

Proposal

Benefits

IANTHEREAL commented May 25, 2023

Proposal for a Structured Configuration Object for GPT Models

github-actions bot commented May 26, 2023

lc0rp commented May 27, 2023

Proposal for a Structured Configuration Object for GPT Models

IANTHEREAL commented May 29, 2023

lc0rp commented Jun 23, 2023

github-actions bot commented Jun 26, 2023

netlify bot commented Jun 26, 2023 • edited

✅ Deploy Preview for auto-gpt-docs canceled.

IANTHEREAL commented Jun 26, 2023

github-actions bot commented Jun 27, 2023

Pwuts left a comment • edited

Choose a reason for hiding this comment

github-actions bot commented Jul 7, 2023

IANTHEREAL commented May 11, 2023 •

edited

vercel bot commented May 11, 2023 •

edited

IANTHEREAL commented May 22, 2023 •

edited

codecov bot commented May 22, 2023 •

edited

lc0rp commented May 23, 2023 •

edited

IANTHEREAL commented May 25, 2023 •

edited

netlify bot commented Jun 26, 2023 •

edited

Pwuts left a comment •

edited