Skip to content

fix(benchmark): scenario status in run view and some additional tweaks#98

Merged
ross-rl merged 2 commits intomainfrom
ross/b2
Feb 5, 2026
Merged

fix(benchmark): scenario status in run view and some additional tweaks#98
ross-rl merged 2 commits intomainfrom
ross/b2

Conversation

@ross-rl
Copy link
Contributor

@ross-rl ross-rl commented Feb 4, 2026

Description

Summary of Pending Changes (3 files modified)

  1. src/components/StatusBadge.tsx (Minor change)
  • Changed "COMPLETED" to "COMPLETE" for consistency
  • Updated label from "Completed" to "Complete"
  1. src/screens/BenchmarkJobDetailScreen.tsx (+59 lines)

Main feature: Fetch actual benchmark run names instead of showing IDs

  • Added import for getBenchmarkRun service
  • Added runNames state to cache fetched run names
  • Added React effect that:
    • Collects all run IDs from outcomes and in-progress runs
    • Fetches full run details in parallel using getBenchmarkRun()
    • Extracts names (falls back to ID if name is null)
    • Stores in a Map for fast lookup
  • Updated run display logic to use fetched names instead of:
    • Old: agent names or "Unknown Agent"
    • New: actual benchmark run names from API
  1. src/screens/BenchmarkRunDetailScreen.tsx (+263 lines, -53 lines)

Major feature: Display scenario runs within benchmark run details

Added features:

  • Scenario runs state and loading
  • Overall status section that calculates aggregate status from scenario runs
    • Shows "Failed", "Complete", "In Progress", or "Not Started"
    • Displays scenario count
  • Scenario runs table with columns:
    • ID
    • Name
    • Status (with colors)
    • Score
  • Auto-refresh scenario runs every 5 seconds when run is active
  • Polling integration to refresh scenario runs alongside run details

Removed:

  • Environment Variables section (removed from both detail view and polling display)

Note: PR titles should follow Conventional Commits format (e.g., feat(devbox): add support for custom env vars or fix(snapshot): resolve pagination issue) as they are used for automatic release notes generation.

Type of Change

  • Bug fix (non-breaking change which fixes an issue)
  • New feature (non-breaking change which adds functionality)
  • Breaking change (fix or feature that would cause existing functionality to not work as expected)
  • Documentation update
  • Code refactoring
  • Performance improvement
  • Test updates

Related Issues

Closes #

Changes Made

Testing

  • I have tested locally
  • I have added/updated tests
  • All existing tests pass

Checklist

  • My code follows the code style of this project
  • I have performed a self-review of my own code
  • I have commented my code, particularly in hard-to-understand areas
  • I have updated the documentation accordingly
  • My changes generate no new warnings
  • I have added tests that prove my fix is effective or that my feature works
  • New and existing unit tests pass locally with my changes
  • Any dependent changes have been merged and published

Screenshots (if applicable)

Additional Notes

@ross-rl ross-rl changed the title fix(jobs): Scenario status in run view and some additional tweaks fix(benchmark): Scenario status in run view and some additional tweaks Feb 4, 2026
@ross-rl ross-rl changed the title fix(benchmark): Scenario status in run view and some additional tweaks fix(benchmark): scenario status in run view and some additional tweaks Feb 5, 2026
@ross-rl ross-rl merged commit ca77634 into main Feb 5, 2026
15 of 18 checks passed
@ross-rl ross-rl deleted the ross/b2 branch February 5, 2026 16:43
dines-rl pushed a commit that referenced this pull request Feb 10, 2026
🤖 I have created a release *beep* *boop*
---


## [1.9.0](v1.8.0...v1.9.0)
(2026-02-10)


### Features

* add benchmark job to cli [beta]
([#88](#88))
([f8759c2](f8759c2))
* add gatway support to rli
([#101](#101))
([441e888](441e888))
* adding links to detail pages IE: allowing you to view the source of a
devbox ([#112](#112))
([62fa6dc](62fa6dc))
* **devbox:** add tunnel to devbox create
([#99](#99))
([a3c1b7a](a3c1b7a))
* **snapshot:** snapshot prune command
([#104](#104))
([b3479fe](b3479fe))


### Bug Fixes

* **benchmark:** scenario status in run view and some additional tweaks
([#98](#98))
([ca77634](ca77634))
* **blueprint:** adding delete
([#111](#111))
([0658932](0658932))
* **blueprint:** handled blueprint queued state
([#100](#100))
([a77e558](a77e558))
* **devbox:** gateway config create bug
([#110](#110))
([6e7e8c4](6e7e8c4))
* **secret:** obscure secret value entry within tui
([#86](#86))
([8697e5c](8697e5c))
* upgrades a dependency with an override
([#94](#94))
([c7f9398](c7f9398))

---
This PR was generated with [Release
Please](https://github.com/googleapis/release-please). See
[documentation](https://github.com/googleapis/release-please#release-please).

Co-authored-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants