Skip to content

feat(cli): add results report subcommand#1105

Merged
christso merged 16 commits intomainfrom
feat/1079-static-html-report-skill
Apr 15, 2026
Merged

feat(cli): add results report subcommand#1105
christso merged 16 commits intomainfrom
feat/1079-static-html-report-skill

Conversation

@christso
Copy link
Copy Markdown
Collaborator

@christso christso commented Apr 15, 2026

Refs #1079

Summary

  • add agentv results report to generate a self-contained static HTML report from an existing run workspace or index.jsonl
  • move the Studio-themed report template into the CLI and remove the earlier skill-based exporter draft
  • add results-report coverage for default output paths, benchmark metadata fallback, grouped eval rendering, and inline script parseability

Validation

  • bun test apps/cli/test/commands/results/report.test.ts apps/cli/test/commands/results/export.test.ts
  • bun run build
  • visual browser E2E against a generated results-report.html artifact using agent-browser

christso and others added 2 commits April 15, 2026 01:22
Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com>
Move the static HTML report flow into a first-class results command that reads existing artifact workspaces, and remove the earlier skill-based implementation.

Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com>
@cloudflare-workers-and-pages
Copy link
Copy Markdown

cloudflare-workers-and-pages bot commented Apr 15, 2026

Deploying agentv with  Cloudflare Pages  Cloudflare Pages

Latest commit: fb74192
Status: ✅  Deploy successful!
Preview URL: https://12ad2a2c.agentv.pages.dev
Branch Preview URL: https://feat-1079-static-html-report.agentv.pages.dev

View logs

@christso christso changed the title feat(skill): add static html report export skill feat(cli): add results report subcommand Apr 15, 2026
christso and others added 14 commits April 15, 2026 02:13
Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com>
Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com>
Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com>
Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com>
Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com>
Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com>
Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com>
Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com>
- Merge separate "Passed/Failed Assertions" cards into a single
  unified list with inline checkmark/cross icons per assertion
- Add "Criteria" column to test cases table, extracted from input
  prompt field for human-readable test descriptions
- Remove redundant "Grader Results" table from detail panel (info
  already visible in per-grader score columns)
- Collapse Input/Output into a toggleable details element so
  assertions are the primary expanded content
- Add single-page mode for small result sets (<=20 tests) that
  renders summary + test cases together without tab navigation
- Add light/dark theme toggle with full CSS custom property support

Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>
- Sort imports alphabetically in report.ts
- Fix biome formatting (long line wrapping)
- Replace non-null assertion with type assertion in test

Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>
Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>
Replace screenshots with realistic data showing the new single-page
layout, unified assertions, criteria column, and collapsible I/O.
Update feature descriptions to match the current report template.

Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>
Add align-items: center to .header-meta so chips don't stretch to
match the theme toggle button height. Retake details screenshot with
All status filter so pass rate (83.3%) is consistent with the overview.

Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>
Assertion badges now prefer the assertion's own type (e.g. "contains",
"regex") over the parent score name ("deterministic"). When input has
no prompt, criteria falls back to concatenated grader names + first
assertion text, truncated at 120 chars.

Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>
@christso christso marked this pull request as ready for review April 15, 2026 05:32
@christso christso merged commit 770bd0f into main Apr 15, 2026
4 checks passed
@christso christso deleted the feat/1079-static-html-report-skill branch April 15, 2026 05:32
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant