fix(dspy/modules/aws_models): properly copy kwargs so temporary changes don't propagate to base model #858

mikeusru · 2024-04-17T19:43:05Z

When the kwargs of the model get altered for the body, we don't want the new kwargs saved to the original llm object. This fixes a bug caused by the mutability of kwargs.

In my case, the bug resulted in the max_tokens to repeatedly get cut in half every time i would re-run the model due to following logic in predict.py

            max_tokens = min(max(75, max_tokens // 2), max_tokens)
            keys = list(kwargs.keys()) + list(dsp.settings.lm.kwargs.keys()) 
            max_tokens_key = "max_tokens" if "max_tokens" in keys else "max_output_tokens"
            new_kwargs = {
                **kwargs,
                max_tokens_key: max_tokens,
                "n": 1,
                "temperature": 0.0,
            }

…es don't propagate to base model

drawal1 · 2024-04-26T22:11:01Z

Reviewed. I fixed this comprehensively in #843. You caught the dict copy issue but there's another check that's also needed. You can review the aws_models.py in my PR

arnavsinghvi11 · 2024-05-31T04:45:21Z

@drawal1 - could we potentially isolate that patch and push it within this PR? Seems like #843 is more involved and needs some conflicts to be resolved before merging

drawal1 · 2024-06-03T15:03:56Z

@arnavsinghvi11 - I reverted all the complicated changes in PR #843 so that can be merged and this PR can be closed

Fixed issue #894 and #858 (aws_models issues)

arnavsinghvi11 · 2024-06-19T22:42:08Z

Resolved by merged #843

fix(dspy/modules/aws_models): properly copy kwargs so temporary chang…

ba4c09e

…es don't propagate to base model

arnavsinghvi11 mentioned this pull request Jun 15, 2024

Fixed issue #894 and #858 (aws_models issues) #843

Merged

arnavsinghvi11 added a commit that referenced this pull request Jun 19, 2024

Merge pull request #843 from drawal1/main

4379ead

Fixed issue #894 and #858 (aws_models issues)

arnavsinghvi11 closed this Jun 19, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

fix(dspy/modules/aws_models): properly copy kwargs so temporary changes don't propagate to base model #858

fix(dspy/modules/aws_models): properly copy kwargs so temporary changes don't propagate to base model #858

Uh oh!

mikeusru commented Apr 17, 2024

Uh oh!

drawal1 commented Apr 26, 2024

Uh oh!

arnavsinghvi11 commented May 31, 2024

Uh oh!

drawal1 commented Jun 3, 2024

Uh oh!

arnavsinghvi11 commented Jun 19, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

fix(dspy/modules/aws_models): properly copy kwargs so temporary changes don't propagate to base model #858

fix(dspy/modules/aws_models): properly copy kwargs so temporary changes don't propagate to base model #858

Uh oh!

Conversation

mikeusru commented Apr 17, 2024

Uh oh!

drawal1 commented Apr 26, 2024

Uh oh!

arnavsinghvi11 commented May 31, 2024

Uh oh!

drawal1 commented Jun 3, 2024

Uh oh!

arnavsinghvi11 commented Jun 19, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants