Braintrust Eval Course — Course Materials

This repo contains datasets, code, and other assets for each module of the Braintrust Eval Course.

Modules

03 — Build a Simple Eval in the Braintrust UI — Dataset CSV, prompts, and scorer
06 — Build a Simple Eval in Code — Python eval script
07 — Nondeterminism — Eval with trial runs
10 — Building a Multi-Turn Chat App — Chat app with Braintrust logging
11 — Analyzing Multi-Turn Traces — Batch script to score full conversations at the trace level
12 — Online Scoring — Conversation generator script + online scoring configuration in the Braintrust UI
13 — Analyzing Production Logs — Script to generate production logs + Topics setup walkthrough
14 — The Improvement Loop — Baseline and fixed eval scripts to verify a prompt change resolves a regression

export BRAINTRUST_API_KEY="your-api-key"
export OPENAI_API_KEY="your-openai-api-key"

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
module-03		module-03
module-06		module-06
module-07		module-07
module-10		module-10
module-11		module-11
module-12		module-12
module-13		module-13
module-14		module-14
.gitignore		.gitignore
README.md		README.md