Run evals with gpt-4?

The evals I am creating won't get good results on gpt-3.5. They need gpt-4. I have tried them manually with ChatGPT with the two models, and 3.5 can't do it while 4 can. 

But I don't see how to run evals with gpt-4. `oaieval gpt-3.5-turbo test-match` works fine on my machine, but `oaieval gpt-4 test-match` fails with the error, "ValueError: Couldn't find model: gpt-4".

I assume the issue is that "GPT-4 is currently in a limited beta and only accessible to those who have been granted access", and I'm still on the waitlist for that.

I further assume that, even though the evals will only get good results on gpt-4 and I can't run them on gpt-4, you will still find my evals useful. (Or, you could give me gpt-4 access... :)  :)    )

Let me know if I'm wrong.



Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Run evals with gpt-4? #575

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Run evals with gpt-4? #575

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions