Popular repositories Loading
-
evals
evals PublicForked from openai/evals
Evals is a framework for evaluating LLMs and LLM systems, and an open-source registry of benchmarks.
Python
Repositories
- uptrain Public
UpTrain is an open-source unified platform to evaluate and improve Generative AI applications. We provide grades for 20+ preconfigured checks (covering language, code, embedding use-cases), perform root cause analysis on failure cases and give insights on how to resolve them.
- evals Public Forked from openai/evals
Evals is a framework for evaluating LLMs and LLM systems, and an open-source registry of benchmarks.
People
This organization has no public members. You must be a member to see who’s a part of this organization.
Top languages
Loading…
Most used topics
Loading…