Skip to content

Simplify evals dash cont.#1613

Merged
prathmeshpatel merged 1 commit intomainfrom
evals-ux-improves-8
Mar 16, 2026
Merged

Simplify evals dash cont.#1613
prathmeshpatel merged 1 commit intomainfrom
evals-ux-improves-8

Conversation

@prathmeshpatel
Copy link
Collaborator

@prathmeshpatel prathmeshpatel commented Mar 14, 2026

TL;DR

Redesigned the CI suite overview page with a new accordion-style run view and consolidated hero stats section.

What changed?

  • Replaced the dual-view runs/test-cases toggle with a single accordion-style interface that shows runs with expandable test case details
  • Added a new SuiteHeroStats component that displays overall suite metrics, trends, and model performance in a compact card
  • Created RunAccordionView component that lists runs chronologically with collapsible test case results
  • Removed separate chart sections and consolidated them into the hero stats
  • Updated the run detail view to show model performance inline and group iterations by result status (failing/passing/pending)
  • Added breadcrumb navigation to test case detail view
  • Changed default sort order in run details from "test" to "result"
  • Simplified suite header by removing accuracy donut chart and delete buttons

How to test?

  1. Navigate to any CI suite overview page
  2. Verify the new hero stats card shows suite-level metrics with trend sparkline and model comparison
  3. Test expanding/collapsing runs in the accordion view
  4. Click on individual test cases within expanded runs to navigate to test detail
  5. Check that run detail view groups iterations by result status
  6. Verify breadcrumb navigation works in test case detail view

Why make this change?

This redesign provides a more streamlined and information-dense interface that reduces visual clutter while making it easier to quickly scan run results and drill down into specific failures. The accordion format allows users to see both high-level run status and detailed test results in a single view without switching between tabs.

@chelojimenez
Copy link
Contributor

chelojimenez commented Mar 14, 2026

Snyk checks have passed. No issues have been found so far.

Status Scan Engine Critical High Medium Low Total (0)
Open Source Security 0 0 0 0 0 issues
Code Security 0 0 0 0 0 issues

💻 Catch issues earlier using the plugins for VS Code, JetBrains IDEs, Visual Studio, and Eclipse.

@railway-app railway-app bot temporarily deployed to triumphant-alignment / staging-prathmesh March 14, 2026 17:12 Inactive
@railway-app railway-app bot temporarily deployed to triumphant-alignment / staging March 14, 2026 18:34 Inactive
@railway-app railway-app bot temporarily deployed to triumphant-alignment / staging-marcelo March 14, 2026 19:55 Inactive
@railway-app railway-app bot temporarily deployed to triumphant-alignment / staging-nacho March 14, 2026 19:55 Inactive
@railway-app railway-app bot temporarily deployed to triumphant-alignment / staging-ignacio March 14, 2026 20:33 Inactive
@railway-app railway-app bot temporarily deployed to triumphant-alignment / staging-prathmesh March 14, 2026 21:57 Inactive
@railway-app railway-app bot temporarily deployed to triumphant-alignment / staging-prathmesh March 14, 2026 22:24 Inactive
@prathmeshpatel prathmeshpatel marked this pull request as ready for review March 16, 2026 08:47
@dosubot dosubot bot added size:XXL This PR changes 1000+ lines, ignoring generated files. enhancement New feature or request labels Mar 16, 2026
@prathmeshpatel prathmeshpatel force-pushed the evals-ux-improves-8 branch 2 times, most recently from 17a133c to 5c2d3bd Compare March 16, 2026 09:12
Base automatically changed from evals-ux-improves-7 to main March 16, 2026 09:13
@prathmeshpatel prathmeshpatel merged commit a7889b2 into main Mar 16, 2026
3 of 6 checks passed
@prathmeshpatel prathmeshpatel deleted the evals-ux-improves-8 branch March 16, 2026 09:14
@coderabbitai
Copy link
Contributor

coderabbitai bot commented Mar 16, 2026

Caution

Review failed

The pull request is closed.

ℹ️ Recent review info
⚙️ Run configuration

Configuration used: Organization UI

Review profile: CHILL

Plan: Pro

Run ID: d99e89af-b633-4b1c-b291-98dbba2df361

📥 Commits

Reviewing files that changed from the base of the PR and between 64f6dab and 5cf87d5.

📒 Files selected for processing (10)
  • mcpjam-inspector/client/src/components/evals/ci-suite-detail.tsx
  • mcpjam-inspector/client/src/components/evals/run-accordion-view.tsx
  • mcpjam-inspector/client/src/components/evals/run-detail-view.tsx
  • mcpjam-inspector/client/src/components/evals/run-overview.tsx
  • mcpjam-inspector/client/src/components/evals/suite-header.tsx
  • mcpjam-inspector/client/src/components/evals/suite-hero-stats.tsx
  • mcpjam-inspector/client/src/components/evals/suite-iterations-view.tsx
  • mcpjam-inspector/client/src/components/evals/test-case-detail-view.tsx
  • mcpjam-inspector/client/src/components/evals/test-cases-overview.tsx
  • mcpjam-inspector/client/src/lib/ci-evals-router.ts

Walkthrough

This pull request restructures the evaluation suite detail interface by introducing two new components—SuiteHeroStats and RunAccordionView—to replace existing UI layouts. Chart visualizations are removed from RunOverview, RunDetailView, SuiteHeader, and TestCasesOverview. TestCaseDetailView gains navigation props (suiteName, onNavigateToSuite) to support suite-level context. RunDetailView adopts a sectioned iteration layout, and TestCaseDetailView refactors its statistics display. The default view mode for suite overview routes shifts from "runs" to "test-cases" in the router configuration. Lines changed: 938 total across 10 files.

📝 Coding Plan
  • Generate coding plan for human review comments

Thanks for using CodeRabbit! It's free for OSS, and your support helps us grow. If you like it, consider giving us a shout-out.

❤️ Share

Comment @coderabbitai help to get the list of available commands and usage tips.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

enhancement New feature or request size:XXL This PR changes 1000+ lines, ignoring generated files.

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants