feat(cli): add results report subcommand#1105
Merged
Conversation
Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com>
Move the static HTML report flow into a first-class results command that reads existing artifact workspaces, and remove the earlier skill-based implementation. Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com>
Deploying agentv with
|
| Latest commit: |
fb74192
|
| Status: | ✅ Deploy successful! |
| Preview URL: | https://12ad2a2c.agentv.pages.dev |
| Branch Preview URL: | https://feat-1079-static-html-report.agentv.pages.dev |
Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com>
Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com>
Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com>
Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com>
Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com>
Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com>
Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com>
Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com>
- Merge separate "Passed/Failed Assertions" cards into a single unified list with inline checkmark/cross icons per assertion - Add "Criteria" column to test cases table, extracted from input prompt field for human-readable test descriptions - Remove redundant "Grader Results" table from detail panel (info already visible in per-grader score columns) - Collapse Input/Output into a toggleable details element so assertions are the primary expanded content - Add single-page mode for small result sets (<=20 tests) that renders summary + test cases together without tab navigation - Add light/dark theme toggle with full CSS custom property support Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>
- Sort imports alphabetically in report.ts - Fix biome formatting (long line wrapping) - Replace non-null assertion with type assertion in test Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>
Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>
Replace screenshots with realistic data showing the new single-page layout, unified assertions, criteria column, and collapsible I/O. Update feature descriptions to match the current report template. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>
Add align-items: center to .header-meta so chips don't stretch to match the theme toggle button height. Retake details screenshot with All status filter so pass rate (83.3%) is consistent with the overview. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>
Assertion badges now prefer the assertion's own type (e.g. "contains",
"regex") over the parent score name ("deterministic"). When input has
no prompt, criteria falls back to concatenated grader names + first
assertion text, truncated at 120 chars.
Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Add this suggestion to a batch that can be applied as a single commit.This suggestion is invalid because no changes were made to the code.Suggestions cannot be applied while the pull request is closed.Suggestions cannot be applied while viewing a subset of changes.Only one suggestion per line can be applied in a batch.Add this suggestion to a batch that can be applied as a single commit.Applying suggestions on deleted lines is not supported.You must change the existing code in this line in order to create a valid suggestion.Outdated suggestions cannot be applied.This suggestion has been applied or marked resolved.Suggestions cannot be applied from pending reviews.Suggestions cannot be applied on multi-line comments.Suggestions cannot be applied while the pull request is queued to merge.Suggestion cannot be applied right now. Please check back later.
Refs #1079
Summary
agentv results reportto generate a self-contained static HTML report from an existing run workspace orindex.jsonlValidation
bun test apps/cli/test/commands/results/report.test.ts apps/cli/test/commands/results/export.test.tsbun run buildresults-report.htmlartifact usingagent-browser