Skip to content
@arena-ai

Arena

Arena Logo

Sarus Arena Framework

If you use public AI services such as OpenAI, Anthropic or Mistral, Sarus Arena is an agent you can easily deploy in your infrastructure to do:

  • LLM evaluation: AB-testing, user-feedback evaluation, formula-based evaluation and LLM as a Judge
  • LLM compliance: Request and response filtering and redacting (PII removal, guardrailing), evaluation-based routing
  • LLM distillation: Train your own model based on the best evaluated responses

Pinned

  1. arena arena Public

    A place to evaluate public models

    Jupyter Notebook 1

Repositories

Showing 4 of 4 repositories

People

This organization has no public members. You must be a member to see who’s a part of this organization.

Top languages

Loading…

Most used topics

Loading…