-
Notifications
You must be signed in to change notification settings - Fork 2.7k
Evals is a framework for evaluating LLMs and LLM systems, and an open-source registry of benchmarks.
License
openai/evals
ErrorLooks like something went wrong!
About
Evals is a framework for evaluating LLMs and LLM systems, and an open-source registry of benchmarks.