enh(dspy): Move max_tokens explicitally into `GPT3` constructor #734

harrysalmon · 2024-03-28T19:38:49Z

Makes the reason for output truncation clearer as per #730

Also adds a comment to the predict file where the max tokens are modified before the call is made.

Makes the reason for output truncation clearer as per stanfordnlp#730 Also adds a comment to the predict file where the max tokens are modified before the call is made.

harrysalmon · 2024-03-28T19:43:34Z

dsp/primitives/predict.py

            keys = list(kwargs.keys()) + list(dsp.settings.lm.kwargs.keys()) 
            max_tokens_key = "max_tokens" if "max_tokens" in keys else "max_output_tokens"
            new_kwargs = {
                **kwargs,


How the value set by the user effects the maximum tokens passed and returned from the model is quite unclear here, would anyone be able to clarify how 101:118 relate to the eventual values passed to different models, I'll have a crack at clearing this up. Might it be possible to do the 75 checking upstream of here?

okhat · 2024-03-30T21:55:10Z

This is mainly during retries etc. It's not very straightforward to change.

okhat · 2024-03-30T21:56:11Z

Basically, the user's max_tokens is used at first, then max_tokens / 2 if we need more fields, but 75 is sort of like a min_max_tokens. Setting a large enough max_tokens could be all that's required here?

harrysalmon · 2024-03-31T19:22:34Z

Thanks for taking a look @okhat

Yeah the main thing for me is putting it into the init so it's clear to the user that they need to set it. What do you think of this part?

In terms of the retries, do you mean if max_tokens doesn't leave the LLM enough tokens to pass the full input tokens, it'll retry with max_tokens / 2 and then with 75?

arnavsinghvi11 · 2024-04-13T01:55:42Z

@harrysalmon just following up on this. I think we can merge the change but leave the existing retry behavior. lmk if it's ready to merge.

arnavsinghvi11 · 2024-05-06T01:36:15Z

Closing for now to avoid breaking cache errors.

enh(dspy): Move max_tokens explicitally into GPT3 constructor

da89922

Makes the reason for output truncation clearer as per stanfordnlp#730 Also adds a comment to the predict file where the max tokens are modified before the call is made.

harrysalmon commented Mar 28, 2024

View reviewed changes

arnavsinghvi11 mentioned this pull request Apr 28, 2024

fix/Extend generation for all candidate completions #920

Merged

arnavsinghvi11 closed this May 6, 2024

arnavsinghvi11 mentioned this pull request Sep 20, 2024

Extra generations cause max_tokens of AWSModels to halve permanently each time #1465

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

enh(dspy): Move max_tokens explicitally into `GPT3` constructor #734

enh(dspy): Move max_tokens explicitally into `GPT3` constructor #734

Uh oh!

harrysalmon commented Mar 28, 2024

Uh oh!

harrysalmon Mar 28, 2024

Uh oh!

okhat commented Mar 30, 2024

Uh oh!

okhat commented Mar 30, 2024

Uh oh!

harrysalmon commented Mar 31, 2024

Uh oh!

arnavsinghvi11 commented Apr 13, 2024

Uh oh!

arnavsinghvi11 commented May 6, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

enh(dspy): Move max_tokens explicitally into GPT3 constructor #734

enh(dspy): Move max_tokens explicitally into GPT3 constructor #734

Uh oh!

Conversation

harrysalmon commented Mar 28, 2024

Uh oh!

harrysalmon Mar 28, 2024

Choose a reason for hiding this comment

Uh oh!

okhat commented Mar 30, 2024

Uh oh!

okhat commented Mar 30, 2024

Uh oh!

harrysalmon commented Mar 31, 2024

Uh oh!

arnavsinghvi11 commented Apr 13, 2024

Uh oh!

arnavsinghvi11 commented May 6, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

enh(dspy): Move max_tokens explicitally into `GPT3` constructor #734

enh(dspy): Move max_tokens explicitally into `GPT3` constructor #734