feat(studio): scaffold AgentV Studio SPA with dashboard feature parity#806
Merged
feat(studio): scaffold AgentV Studio SPA with dashboard feature parity#806
Conversation
Collaborator
Author
E2E Manual Verification — CompleteBoth remaining checklist items verified:
|
5 tasks
Add apps/studio/ as a new Vite + React 19 + TanStack Router workspace package with dark theme dashboard for reviewing evaluation results. - Scaffold studio with route tree (home, run detail, eval detail) - Add components: RunList, RunDetail, EvalDetail, ScoreBar, StatsCards, MonacoViewer, Sidebar, Layout - Add TanStack Query hooks for all API endpoints - Extend Hono API with /api/runs/:id/categories, /api/runs/:id/evals/:evalId, and /api/index endpoints - Serve studio SPA as static files from Hono when dist is available - Register `studio` as primary command, keep `serve` as hidden alias - Wire studio build into CLI tsup onSuccess to bundle dist/studio - Update tests to pass studioDir: false so inline HTML assertions pass Closes #805 Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>
…re reasons Bugs fixed: - Invalid Date shown for runs with no tests (now shows "N/A") - Pass Rate vs Avg Score confusion (renamed to "Tests Passing" / "Mean Score" with tooltips) - No empty state for runs with 0 evals (now shows centered message) Features added: - Category breakdown section on run detail page (groups by eval_set, clickable filter) - Context-aware eval sidebar on eval detail pages (shows all evals with pass/fail indicators) - Failure reason section on Steps tab for failed evals (red-tinted, aggregates error sources) Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>
…ailure reasons - Show category breakdown section even with 1 category (was hidden unless 2+ categories existed) - Serialize details objects properly in failure reason display instead of rendering [object Object] - Update ScoreEntry.details type to accept string | object
453eacd to
89b746c
Compare
40 tasks
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Add this suggestion to a batch that can be applied as a single commit.This suggestion is invalid because no changes were made to the code.Suggestions cannot be applied while the pull request is closed.Suggestions cannot be applied while viewing a subset of changes.Only one suggestion per line can be applied in a batch.Add this suggestion to a batch that can be applied as a single commit.Applying suggestions on deleted lines is not supported.You must change the existing code in this line in order to create a valid suggestion.Outdated suggestions cannot be applied.This suggestion has been applied or marked resolved.Suggestions cannot be applied from pending reviews.Suggestions cannot be applied on multi-line comments.Suggestions cannot be applied while the pull request is queued to merge.Suggestion cannot be applied right now. Please check back later.
Summary
apps/studio/as a new Vite + React 19 + TanStack Router workspace package with dark theme dashboard for reviewing evaluation results/api/runs/:id/categories,/api/runs/:id/evals/:evalId,/api/indexagentv servetoagentv studio(keepingserveas a hidden alias for backward compatibility)agentv studioserves the SPAComponents
Test plan
studioDir: falseto preserve inline HTML assertions)bun run build)bun apps/cli/src/cli.ts studioserves the React SPA at localhost:3117bun apps/cli/src/cli.ts servestill works as aliasCloses #805
🤖 Generated with Claude Code