(dspy/predict): corrected temperature check for bedrock claude models #826

mikeusru · 2024-04-15T17:48:01Z

PR linked to issue described here:
#795 (comment)

Using Bedrock and claude3-haiku

bedrock = dspy.Bedrock(...)

haiku = dspy.AWSAnthropic(
    aws_provider=bedrock,
    model="anthropic.claude-3-haiku-20240307-v1:0",
)

class BasicQA(dspy.Signature):
    """Answer questions with short factoid answers."""

    question = dspy.InputField()
    answer = dspy.OutputField(desc="often between 1 and 5 words")
    
# Define the predictor. Notice we're just changing the class. The signature BasicQA is unchanged.
generate_answer_with_chain_of_thought = dspy.ChainOfThought(BasicQA)

# Call the predictor on the same input.
pred = generate_answer_with_chain_of_thought(question=dev_example.question)

Error: TypeError: '>' not supported between instances of 'NoneType' and 'int'

The error is fixed when the change in the PR is introduced

arnavsinghvi11 · 2024-04-15T18:53:56Z

Thanks for raising this @mikeusru ! This looks like a global change not specific to AWS models, but I'm curious if it is fixed when you specify a default n in AWSModels as 1 like other models do. I see that num_generations is not supported but if n is, that should fix this issue? tagging @drawal1 for AWSModels.

drawal1 · 2024-04-15T19:06:45Z

@arnavsinghvi11 - I have an alternative (and hopefully more robust) suggestion:

Instead of:
num_generations = lm.kwargs.get("n", lm.kwargs.get("num_generations", None))

How about?
num_generations = lm.kwargs.get("n", lm.kwargs.get("num_generations", 1))

mikeusru · 2024-04-15T20:39:46Z

@arnavsinghvi11 - I have an alternative (and hopefully more robust) suggestion:

Instead of: num_generations = lm.kwargs.get("n", lm.kwargs.get("num_generations", None))

How about? num_generations = lm.kwargs.get("n", lm.kwargs.get("num_generations", 1))

I believe "n" causes errors as well, and needs to be removed from the kwargs for Claude 3 models. I do see Bedrock documentation about batch inference in preview, but i haven't gotten this to work.
https://docs.aws.amazon.com/bedrock/latest/userguide/batch-inference.html

arnavsinghvi11 · 2024-04-15T21:11:11Z

hmm yeah so ideally, we’d want n to be set in either the config or the configured LM and merging this would bypass that if either is not set. do none of the AWSModels support n or any alternatives?

drawal1 · 2024-04-15T21:36:46Z

@arnavsinghvi11 @mikeusru - 'n' is a non-issue. I just verified that 'n' is NOT inserted by default in query_args for anthropic.

If somebody deliberately introduces it and bedrock/anthropic complains, then its a valid complaint. We don't want to patch for dev mistakes. Or am I missing something?

arnavsinghvi11 · 2024-04-16T01:38:10Z

makes sense. just wanted to confirm batching behavior for AWS models. adding in the default behavior for num_generations and merging. Thanks @mikeusru @drawal1 !

(dspy/predict): corrected temperature check for bedrock claude models

80833c4

mikeusru marked this pull request as ready for review April 15, 2024 17:48

Update predict.py

e591777

arnavsinghvi11 merged commit 46df3a0 into stanfordnlp:main Apr 16, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

(dspy/predict): corrected temperature check for bedrock claude models #826

(dspy/predict): corrected temperature check for bedrock claude models #826

Uh oh!

mikeusru commented Apr 15, 2024 •

edited

Loading

Uh oh!

arnavsinghvi11 commented Apr 15, 2024

Uh oh!

drawal1 commented Apr 15, 2024

Uh oh!

mikeusru commented Apr 15, 2024

Uh oh!

arnavsinghvi11 commented Apr 15, 2024

Uh oh!

drawal1 commented Apr 15, 2024

Uh oh!

arnavsinghvi11 commented Apr 16, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

(dspy/predict): corrected temperature check for bedrock claude models #826

(dspy/predict): corrected temperature check for bedrock claude models #826

Uh oh!

Conversation

mikeusru commented Apr 15, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

arnavsinghvi11 commented Apr 15, 2024

Uh oh!

drawal1 commented Apr 15, 2024

Uh oh!

mikeusru commented Apr 15, 2024

Uh oh!

arnavsinghvi11 commented Apr 15, 2024

Uh oh!

drawal1 commented Apr 15, 2024

Uh oh!

arnavsinghvi11 commented Apr 16, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

mikeusru commented Apr 15, 2024 •

edited

Loading