Skip to content

README + website updates#10

Merged
tara-servicenow merged 12 commits intomainfrom
pr/tara/website_and_readme
Mar 24, 2026
Merged

README + website updates#10
tara-servicenow merged 12 commits intomainfrom
pr/tara/website_and_readme

Conversation

@tara-servicenow
Copy link
Copy Markdown
Collaborator

No description provided.

Comment thread README.md
Comment on lines 89 to 91
@@ -90,22 +79,22 @@
- `GOOGLE_APPLICATION_CREDENTIALS`: Gemini via Vertex AI (audio judge metrics)
- `AWS_ACCESS_KEY_ID` + `AWS_SECRET_ACCESS_KEY`: Claude via Bedrock (faithfulness metric)
Copy link
Copy Markdown
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

You didn't like this?

Copy link
Copy Markdown
Collaborator Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

No i think this complicates things unnecessarily. also i think if people want to they can easily just do this on their own. I think lots of people (including us) just auto replace the judges whenever they clone new benchmarks to match what they have

Copy link
Copy Markdown
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Ok! Do we still want to mention that JUDGE_MODEL exists and can be overwritten?

Copy link
Copy Markdown
Collaborator Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

i dont think it does exist anymore though

Comment thread README.md
@@ -69,19 +67,10 @@
```

Copy link
Copy Markdown
Collaborator

@gabegma gabegma Mar 24, 2026

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Should we keep the header? Good job on the rest though, it's true it was duplicated

@tara-servicenow tara-servicenow changed the title README updates README + website updates Mar 24, 2026
Comment thread README.md Outdated
Co-authored-by: Joseph Marinier <Joseph.Marinier@gmail.com>
Comment thread README.md
│ │ └── validation/ # Quality control metrics
│ └── utils/ # Utilities (LLM client, log processing)
├── scripts/ # Utility scripts
│ ├── run_benchmark.py # Benchmark runner
Copy link
Copy Markdown
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

This was moved since then.

Copy link
Copy Markdown
Collaborator

@gabegma gabegma left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Tiny comment and then this is good to go!

@tara-servicenow tara-servicenow added this pull request to the merge queue Mar 24, 2026
Merged via the queue into main with commit 5ca0606 Mar 24, 2026
1 check passed
@JosephMarinier JosephMarinier deleted the pr/tara/website_and_readme branch March 24, 2026 18:18
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants