Skip to content

Run evals with gpt-4? #575

@garyrob

Description

@garyrob

The evals I am creating won't get good results on gpt-3.5. They need gpt-4. I have tried them manually with ChatGPT with the two models, and 3.5 can't do it while 4 can.

But I don't see how to run evals with gpt-4. oaieval gpt-3.5-turbo test-match works fine on my machine, but oaieval gpt-4 test-match fails with the error, "ValueError: Couldn't find model: gpt-4".

I assume the issue is that "GPT-4 is currently in a limited beta and only accessible to those who have been granted access", and I'm still on the waitlist for that.

I further assume that, even though the evals will only get good results on gpt-4 and I can't run them on gpt-4, you will still find my evals useful. (Or, you could give me gpt-4 access... :) :) )

Let me know if I'm wrong.

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions