README + website updates by tara-servicenow · Pull Request #10 · ServiceNow/eva

tara-servicenow · 2026-03-24T16:57:18Z

No description provided.

gabegma · 2026-03-24T17:04:13Z

@@ -90,22 +79,22 @@
 - `GOOGLE_APPLICATION_CREDENTIALS`: Gemini via Vertex AI (audio judge metrics)
 - `AWS_ACCESS_KEY_ID` + `AWS_SECRET_ACCESS_KEY`: Claude via Bedrock (faithfulness metric)


You didn't like this?

No i think this complicates things unnecessarily. also i think if people want to they can easily just do this on their own. I think lots of people (including us) just auto replace the judges whenever they clone new benchmarks to match what they have

Ok! Do we still want to mention that JUDGE_MODEL exists and can be overwritten?

i dont think it does exist anymore though

gabegma · 2026-03-24T17:05:01Z

@@ -69,19 +67,10 @@
 ```



Should we keep the header? Good job on the rest though, it's true it was duplicated

…a into pr/tara/website_and_readme

Co-authored-by: Joseph Marinier <Joseph.Marinier@gmail.com>

…a into pr/tara/website_and_readme

gabegma · 2026-03-24T17:40:30Z

+│   │   └── validation/        # Quality control metrics
+│   └── utils/                 # Utilities (LLM client, log processing)
+├── scripts/                   # Utility scripts
+│   ├── run_benchmark.py       # Benchmark runner


This was moved since then.

gabegma

Tiny comment and then this is good to go!

tara-servicenow and others added 2 commits March 24, 2026 09:57

README updates

5160d1a

Apply pre-commit

6242246

gabegma reviewed Mar 24, 2026

View reviewed changes

tara-servicenow added 6 commits March 24, 2026 10:06

Add dataset link

405595c

Replace chatterbox with chatterbox turbo

bfdccd5

Merge branch 'pr/tara/website_and_readme' of github.com:ServiceNow/ev…

368a36c

…a into pr/tara/website_and_readme

Add heading back

3e92365

Fix

a79a99e

Add index html

d924a5a

tara-servicenow changed the title ~~README updates~~ README + website updates Mar 24, 2026

Finish build

6332cc4

JosephMarinier reviewed Mar 24, 2026

View reviewed changes

Comment thread README.md Outdated

Update README.md

613ccc4

Co-authored-by: Joseph Marinier <Joseph.Marinier@gmail.com>

JosephMarinier approved these changes Mar 24, 2026

View reviewed changes

tara-servicenow added 2 commits March 24, 2026 10:39

Delete

128a211

Merge branch 'pr/tara/website_and_readme' of github.com:ServiceNow/ev…

2e021d9

…a into pr/tara/website_and_readme

gabegma reviewed Mar 24, 2026

View reviewed changes

gabegma approved these changes Mar 24, 2026

View reviewed changes

tara-servicenow added this pull request to the merge queue Mar 24, 2026

Merged via the queue into main with commit 5ca0606 Mar 24, 2026
1 check passed

JosephMarinier deleted the pr/tara/website_and_readme branch March 24, 2026 18:18

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

README + website updates#10

README + website updates#10
tara-servicenow merged 12 commits intomainfrom
pr/tara/website_and_readme

tara-servicenow commented Mar 24, 2026

Uh oh!

gabegma Mar 24, 2026

Uh oh!

tara-servicenow Mar 24, 2026

Uh oh!

gabegma Mar 24, 2026

Uh oh!

tara-servicenow Mar 24, 2026

Uh oh!

gabegma Mar 24, 2026 •

edited

Loading

Uh oh!

Uh oh!

gabegma Mar 24, 2026

Uh oh!

gabegma left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

		@@ -90,22 +79,22 @@
		- `GOOGLE_APPLICATION_CREDENTIALS`: Gemini via Vertex AI (audio judge metrics)
		- `AWS_ACCESS_KEY_ID` + `AWS_SECRET_ACCESS_KEY`: Claude via Bedrock (faithfulness metric)

Conversation

tara-servicenow commented Mar 24, 2026

Uh oh!

gabegma Mar 24, 2026

Choose a reason for hiding this comment

Uh oh!

tara-servicenow Mar 24, 2026

Choose a reason for hiding this comment

Uh oh!

gabegma Mar 24, 2026

Choose a reason for hiding this comment

Uh oh!

tara-servicenow Mar 24, 2026

Choose a reason for hiding this comment

Uh oh!

gabegma Mar 24, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

gabegma Mar 24, 2026

Choose a reason for hiding this comment

Uh oh!

gabegma left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

gabegma Mar 24, 2026 •

edited

Loading