refactor: rename eval_set to dataset across codebase by christso · Pull Request #814 · EntityProcess/agentv

christso · 2026-03-28T12:53:12Z

Summary

Closes #812

Rename eval_set field to dataset in core types (EvalTest, EvaluationResult), Zod schema, YAML/JSONL parsers, orchestrator, and OTel exporter
Update all CLI commands: artifact-writer, junit-writer, manifest, serve endpoint (/categories → /datasets), pipeline (input/run/bench/grade), trace (show/stats/utils)
Rename Studio UI: CategorySummary → DatasetSummary, CategoriesResponse → DatasetsResponse, route /category/ → /dataset/, heading "Categories" → "Datasets", CategorySidebar → DatasetSidebar
Regenerate eval-schema.json and routeTree.gen.ts
Maintain backward compatibility: JSONL parser/manifest accept both eval_set and dataset, pipeline readers use dataset ?? eval_set, trace stats accepts --group-by eval-set as deprecated alias

Risk

High — breaking API change (renamed wire format fields, API endpoint, route URLs). Backward compat maintained for reading old data.

Test plan

bun --filter @agentv/core typecheck — clean
bun --filter @agentv/core test + bun --filter agentv test — 353 tests pass, 0 fail
bun --filter @agentv/studio build — builds clean
biome check . — lint clean
Pre-push hooks (build, typecheck, lint, test, validate) — all pass
Manual: load old JSONL result files with eval_set field in Studio

🤖 Generated with Claude Code

Rename every occurrence of eval_set to dataset — core types, wire format, API endpoints, Studio UI labels, routes, and component names — to align with industry conventions (Braintrust, LangSmith, DeepEval all use "dataset"). Backward compatibility maintained: - JSONL parser/manifest reader accept both eval_set and dataset fields - Zod schema accepts both field names (eval_set as deprecated alias) - Pipeline bench/grade read manifest.dataset ?? manifest.eval_set - Trace stats CLI accepts --group-by eval-set as deprecated alias Studio UI changes: - "Categories" → "Datasets" in headings and labels - Route /runs/:runId/category/:category → /runs/:runId/dataset/:dataset - CategorySidebar → DatasetSidebar component rename - API endpoint /api/runs/:filename/categories → /datasets Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

cloudflare-workers-and-pages · 2026-03-28T12:53:47Z

Deploying agentv with Cloudflare Pages

Latest commit:	`b622b3f`
Status:	⚡️ Build in progress...

View logs

Rename "eval_set" to "dataset" in all 42 example baseline JSONL files under examples/features/ and examples/showcase/ so new users see the current field name. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

chore: update example JSONL baselines to use dataset field

b622b3f

Rename "eval_set" to "dataset" in all 42 example baseline JSONL files under examples/features/ and examples/showcase/ so new users see the current field name. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

christso merged commit 5f8f61c into main Mar 28, 2026
1 of 2 checks passed

christso deleted the 812-rename-eval-set-to-dataset branch March 28, 2026 13:03

christso mentioned this pull request Mar 28, 2026

chore: update example baselines and remove eval_set compat shims #815

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

refactor: rename eval_set to dataset across codebase#814

refactor: rename eval_set to dataset across codebase#814
christso merged 2 commits intomainfrom
812-rename-eval-set-to-dataset

christso commented Mar 28, 2026

Uh oh!

cloudflare-workers-and-pages bot commented Mar 28, 2026 •

edited

Loading

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

christso commented Mar 28, 2026

Summary

Risk

Test plan

Uh oh!

cloudflare-workers-and-pages bot commented Mar 28, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Deploying agentv with Cloudflare Pages

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

cloudflare-workers-and-pages bot commented Mar 28, 2026 •

edited

Loading