feat(dashboard): per-model health comparison table by Sathvik-1007 · Pull Request #2823 · tinyhumansai/openhuman

Sathvik-1007 · 2026-05-28T07:19:41Z

Summary

Implements #1852 — per-model health comparison table in the dashboard.

Adds a live model health comparison panel that benchmarks every model currently in use across quality score, hallucination rate, cost per million tokens, and a recommended action — giving operators a single-pane view to identify underperforming models, spot cost savings, and make informed swap decisions.

Changes

Backend (Rust):

GET /models/health — authenticated endpoint returning model registry + telemetry aggregation + config thresholds
ModelHealthConfig — config schema for dashboard.model_health.{enabled, hallucination_threshold, min_tasks_for_rating, evaluation_window_tasks}
ModelRegistryEntry — config schema for model_registry[] array with id, provider, cost_per_1m_output, vision
Respects enabled: false (returns 404)

Frontend (React):

ModelHealthPanel.tsx — Settings > Developer Options panel
Sortable table (all columns: model, quality, halluc rate, cost, agents, status)
Status badge logic: Keep / Replace / Staging test / Vision only
Replacement candidate highlighting with Cheaper/Better tags
One-click swap modal with candidate list
Status filter dropdown
Config-driven thresholds from server response

i18n: 21 keys across en.ts, en-5.ts, and all 12 locale chunk files.

Acceptance criteria

Configuration

dashboard:
  model_health:
    enabled: true
    hallucination_threshold: 0.10
    min_tasks_for_rating: 10
    evaluation_window_tasks: 50

model_registry:
  - id: deepseek-v3.2
    provider: SiliconFlow
    cost_per_1m_output: 0.33
    vision: false
  - id: qwen-2.5-8b
    provider: OpenRouter
    cost_per_1m_output: 0.09
    vision: true

Closes #1852

Summary by CodeRabbit

New Features
- Added Model Health settings panel to monitor model performance metrics including quality scores, hallucination rates, operational costs, and agent usage counts
- Multi-column sorting and status-based filtering for easy model analysis
- Model swap recommendations showing performance comparisons between eligible replacement candidates
- Localization support for 12+ languages

GET /models/health SSE endpoint + ModelHealthPanel. Sortable table, status badges (Keep/Replace/Staging/Vision), replacement candidate highlighting, one-click swap modal. Config-driven thresholds from dashboard.model_health. i18n: 21 keys across 14 locale files. Closes tinyhumansai#1852

coderabbitai · 2026-05-28T07:20:00Z

Caution

Review failed

The pull request is closed.

Note

Reviews paused

It looks like this branch is under active development. To avoid overwhelming you with review comments due to an influx of new commits, CodeRabbit has automatically paused this review. You can configure this behavior by changing the reviews.auto_review.auto_pause_after_reviewed_commits setting.

Use the following commands to manage reviews:

@coderabbitai resume to resume automatic reviews.
@coderabbitai review to trigger a single review.

Use the checkboxes below for quick actions:

▶️ Resume reviews
🔍 Trigger review

ℹ️ Recent review info

⚙️ Run configuration

Configuration used: Organization UI

Review profile: CHILL

Plan: Pro

Run ID: 8d365930-ade8-4ef6-9698-a3a99fdec1ab

📥 Commits

Reviewing files that changed from the base of the PR and between 5a39c58 and 430f220.

⛔ Files ignored due to path filters (1)

app/src-tauri/Cargo.lock is excluded by !**/*.lock

📒 Files selected for processing (8)

app/src/components/settings/panels/ModelHealthPanel.tsx
app/src/components/settings/panels/__tests__/ModelHealthPanel.test.tsx
src/core/all.rs
src/openhuman/dashboard/mod.rs
src/openhuman/dashboard/ops.rs
src/openhuman/dashboard/schemas.rs
src/openhuman/dashboard/types.rs
src/openhuman/mod.rs

📝 Walkthrough

Walkthrough

This PR adds a complete model health monitoring feature to OpenHuman's dashboard. It introduces backend configuration schema, a new RPC operation for model health comparison, a React settings panel with sorting/filtering/status badges and candidate swap modal, menu routing, and localized UI strings across 12 languages.

Changes

Model Health Dashboard Feature

Layer / File(s)	Summary
Backend configuration schema and data types `src/openhuman/config/schema/dashboard.rs`, `src/openhuman/config/schema/mod.rs`, `src/openhuman/config/schema/types.rs`, `src/openhuman/dashboard/types.rs`, `src/openhuman/mod.rs`	Defines `DashboardConfig`, `EventStreamConfig`, `ModelHealthConfig` with Serde defaults, and `ModelRegistryEntry` for model metadata; extends `Config` with `dashboard` and `model_registry` fields; defines Rust and TypeScript data types for model health RPC responses.
Backend RPC operation and schema handler `src/openhuman/dashboard/mod.rs`, `src/openhuman/dashboard/ops.rs`, `src/openhuman/dashboard/schemas.rs`, `src/core/all.rs`	Implements `model_health` operation that maps model registry entries to health responses with placeholder telemetry; defines RPC handler, schema registration, and helper functions; integrates operation and schemas into core RPC dispatch system; includes tests for feature flag enforcement, empty registries, and config thresholds.
Frontend ModelHealthPanel component and tests `app/src/components/settings/panels/ModelHealthPanel.tsx`, `app/src/components/settings/panels/__tests__/ModelHealthPanel.test.tsx`	React component fetches model health via RPC, computes status badges (keep/replace/staging/vision) using config thresholds, renders sortable and filterable table with metrics, status badges, and conditional swap button for candidate replacement; includes modal overlay for candidate selection; comprehensive test suite covers table rendering, badges, filtering, sorting, swap modal workflow, and RPC envelope handling.
Frontend routing, menu integration, and localization `app/src/pages/Settings.tsx`, `app/src/components/settings/panels/DeveloperOptionsPanel.tsx`, `app/src/lib/i18n/en.ts`, `app/src/lib/i18n/chunks/{ar,bn,de,es,fr,hi,id,it,ko,pt,ru,zh-CN}-5.ts`	Adds `model-health` route in Settings with wider layout (max-w-4xl) and registers Model Health menu entry in DeveloperOptionsPanel with localized title/description keys and SVG icon; adds `settings.modelHealth` translation keys across 12 language chunks covering section copy, table column headers, status badge labels, modal text, and comparison tags.

Sequence Diagram(s)

sequenceDiagram
  participant User
  participant Panel as ModelHealthPanel
  participant RPC as RPC Client
  participant Backend as model_health Operation
  
  User->>Panel: View Model Health panel
  Panel->>RPC: callCoreRpc(openhuman.dashboard_model_health)
  RPC->>Backend: model_health(config)
  Backend->>Backend: Map registry to entries
  Backend->>Backend: Compute status for each
  Backend->>RPC: Return RpcOutcome{models, config}
  RPC->>Panel: Unwrap envelope/result
  Panel->>Panel: Render table, sort, filter
  User->>Panel: Click status dropdown
  Panel->>Panel: Filter rows by getStatus()
  User->>Panel: Click column header
  Panel->>Panel: Sort by column
  User->>Panel: Click swap on replace row
  Panel->>Panel: Open modal with candidates
  User->>Panel: Select candidate + Apply
  Panel->>Panel: Log intent, close modal

Estimated code review effort

🎯 3 (Moderate) | ⏱️ ~25 minutes

Possibly related PRs

tinyhumansai/openhuman#2250: Refactors DeveloperOptionsPanel to use i18n titleKey/descriptionKey lookups, which this PR then uses for the new model-health menu entry.
tinyhumansai/openhuman#2258: Modifies DeveloperOptionsPanel's menu item structure to support keyed i18n rendering, enabling the model-health menu addition in this PR.

Suggested reviewers

graycyrus
senamakel

Poem

🐰 A rabbit hops through models with glee,
Comparing health metrics with clarity,
Status badges bright, swap candidates light,
Tables that sort and filter just right—
The dashboard now sees what the models can be! ✨

🚥 Pre-merge checks | ✅ 5

✅ Passed checks (5 passed)

Check name	Status	Explanation
Description Check	✅ Passed	Check skipped - CodeRabbit’s high-level summary is enabled.
Title check	✅ Passed	The title 'feat(dashboard): per-model health comparison table' directly and concisely describes the main feature added: a new model health comparison table for the dashboard.
Linked Issues check	✅ Passed	The PR implements all key coding requirements from `#1852`: sortable/filterable model table, status badge logic, replacement candidate highlighting, swap modal integration, config-driven thresholds, local data sourcing, and telemetry/registry integration.
Out of Scope Changes check	✅ Passed	All changes directly support the model health feature: backend endpoints and config schemas, frontend panel components and routing, i18n translations, and test coverage.
Docstring Coverage	✅ Passed	Docstring coverage is 81.25% which is sufficient. The required threshold is 80.00%.

_{✏️ Tip: You can configure your own custom pre-merge checks in the settings.}

Warning

There were issues while running some tools. Please review the errors and either fix the tool's configuration or disable the tool if it's a critical failure.

🔧 ESLint

If the error stems from missing dependencies, add them to the package.json file. For unrecoverable errors (e.g., due to private dependencies), disable the tool in the CodeRabbit configuration.

ESLint skipped: no ESLint configuration detected in root package.json. To enable, add eslint to devDependencies.

Thanks for using CodeRabbit! It's free for OSS, and your support helps us grow. If you like it, consider giving us a shout-out.

❤️ Share

_{Comment @coderabbitai help to get the list of available commands and usage tips.}

coderabbitai

Actionable comments posted: 7

🤖 Prompt for all review comments with AI agents

Verify each finding against current code. Fix only still-valid issues, skip the
rest with a brief reason, keep changes minimal, and validate.

Inline comments:
In `@app/src/components/settings/panels/__tests__/ModelHealthPanel.test.tsx`:
- Around line 99-108: The test "sorts by column" in ModelHealthPanel currently
clicks the column header but has no post-condition; update the test to verify
the sorted result by asserting a measurable change after calling fireEvent.click
on the 'settings.modelHealth.col.cost' header: for example, render
ModelHealthPanel (using the existing MOCK_RESPONSE), trigger the click, then
query the table rows (using screen.getAllByRole('row') or text queries like
screen.getAllByText for the cost cells) and assert that their order matches the
expected sorted order (ascending or descending) or that a sort indicator/ARIA
attribute on the header changed; modify the test to include these expect
assertions to validate sorting behavior.

In `@app/src/components/settings/panels/ModelHealthPanel.tsx`:
- Around line 311-315: The Apply Replacement button currently only calls
setSwapTarget(null) and doesn't run the swap flow; change its onClick to invoke
the replacement handler with the current swapTarget (e.g., call the existing
swap/apply function such as handleApplyReplacement or
performReplacement(swapTarget)), await/handle the result (show loading/disable
while in progress and show errors on failure), and then clear the swap state
(setSwapTarget(null)) and close the modal; ensure you reference the swapTarget
state and the replacement handler function when making the change.
- Around line 92-94: Replace the direct fetch call in ModelHealthPanel (the
await fetch(`${baseUrl}/models/health`, { headers: { Authorization: `Bearer
${token}` } })) with an in-process core RPC relay invocation: call
invoke('core_rpc_relay', ...) supplying the HTTP method, path "/models/health",
and the Authorization header (using the existing token), then await and parse
the returned RPC response the same way you parsed res before; in short, remove
the raw fetch and use invoke('core_rpc_relay', { method: 'GET', path:
'/models/health', headers: { Authorization: `Bearer ${token}` } }) and adapt the
handling where res is used.
- Around line 298-300: Replace the hard-coded labels 'CHEAPER' and 'BETTER' in
the ModelHealthPanel JSX (the span that renders {c.cost_per_1m_output <
swapTarget.cost_per_1m_output ? 'CHEAPER' : 'BETTER'}) with translated strings
via useT(); import/use the hook in the component (e.g., const t = useT()), add
translation keys like t('modelHealth.cheaper') and t('modelHealth.better') (or
existing appropriate keys) and use those in the ternary expression so the UI
strings go through the localization system.

In `@src/core/jsonrpc.rs`:
- Around line 865-866: The JSON-RPC transport currently registers domain routes
directly (.route("/models/health", get(model_health_handler)) and .route("/rpc",
post(rpc_handler))) — move this wiring into the controller registry flow: remove
the domain-specific route registrations from jsonrpc.rs and instead expose the
handlers via the controller registry (add the appropriate schema and handler
registration in schemas.rs and register the handlers in src/core/all.rs so
jsonrpc.rs simply mounts the controller registry endpoint); ensure
model_health_handler and rpc_handler are provided through the registry API so
jsonrpc.rs delegates to the registry rather than containing branch logic.
- Around line 1221-1234: The model list currently emits placeholder metrics;
update the mapping over cfg.model_registry (the closure that builds models Vec)
to look up real per-model metrics instead of hardcoded nulls/zeros: query your
metrics source (e.g., a model metrics collection or method on cfg such as
cfg.model_metrics.get(&entry.id) or cfg.get_model_stats(entry.id)) and populate
"quality_score" and "hallucination_rate" with the retrieved numeric values (or
serde_json::Value::Null if missing) and "agents_using" and "tasks_evaluated"
with the actual counts (or 0 default). Ensure you convert types to
serde_json::Value where needed and fall back to the previous defaults when
metrics are absent.
- Around line 1209-1212: The current code calls
load_config_with_timeout().await.unwrap_or_default(), which hides real load
failures by returning synthetic defaults; change this to explicitly handle the
Result from crate::openhuman::config::rpc::load_config_with_timeout() (e.g.,
match or ? operator) so that errors are logged and propagated instead of being
silently converted to defaults; replace the unwrap_or_default usage around cfg
(and subsequent use of mh_cfg) with error handling that returns or logs an error
and aborts initialization when config loading fails.

🪄 Autofix (Beta)

Fix all unresolved CodeRabbit comments on this PR:

Push a commit to this branch (recommended)
Create a new PR with the fixes

ℹ️ Review info

⚙️ Run configuration

Configuration used: Organization UI

Review profile: CHILL

Plan: Pro

Run ID: d25fe977-e5a3-4b44-b08b-e141c3db8d13

📥 Commits

Reviewing files that changed from the base of the PR and between 145e768 and f39a252.

📒 Files selected for processing (22)

app/src/components/settings/panels/DeveloperOptionsPanel.tsx
app/src/components/settings/panels/ModelHealthPanel.tsx
app/src/components/settings/panels/__tests__/ModelHealthPanel.test.tsx
app/src/lib/i18n/chunks/ar-5.ts
app/src/lib/i18n/chunks/bn-5.ts
app/src/lib/i18n/chunks/de-5.ts
app/src/lib/i18n/chunks/en-5.ts
app/src/lib/i18n/chunks/es-5.ts
app/src/lib/i18n/chunks/fr-5.ts
app/src/lib/i18n/chunks/hi-5.ts
app/src/lib/i18n/chunks/id-5.ts
app/src/lib/i18n/chunks/it-5.ts
app/src/lib/i18n/chunks/ko-5.ts
app/src/lib/i18n/chunks/pt-5.ts
app/src/lib/i18n/chunks/ru-5.ts
app/src/lib/i18n/chunks/zh-CN-5.ts
app/src/lib/i18n/en.ts
app/src/pages/Settings.tsx
src/core/jsonrpc.rs
src/openhuman/config/schema/dashboard.rs
src/openhuman/config/schema/mod.rs
src/openhuman/config/schema/types.rs

coderabbitai · 2026-05-28T07:25:53Z

+              <button
+                type="button"
+                className="flex-1 py-2 rounded-lg bg-blue-600 text-white text-xs font-semibold"
+                onClick={() => setSwapTarget(null)}>
+                {t('settings.modelHealth.modal.apply')}


⚠️ Potential issue | 🟠 Major | 🏗️ Heavy lift

Apply Replacement is currently a no-op.

Line 314 closes the modal but does not execute any swap flow, so the primary action does not perform what its label promises.

🤖 Prompt for AI Agents

Verify each finding against current code. Fix only still-valid issues, skip the rest with a brief reason, keep changes minimal, and validate. In `@app/src/components/settings/panels/ModelHealthPanel.tsx` around lines 311 - 315, The Apply Replacement button currently only calls setSwapTarget(null) and doesn't run the swap flow; change its onClick to invoke the replacement handler with the current swapTarget (e.g., call the existing swap/apply function such as handleApplyReplacement or performReplacement(swapTarget)), await/handle the result (show loading/disable while in progress and show errors on failure), and then clear the swap state (setSwapTarget(null)) and close the modal; ensure you reference the swapTarget state and the replacement handler function when making the change.

coderabbitai · 2026-05-28T07:25:53Z

+    let models: Vec<serde_json::Value> = cfg
+        .model_registry
+        .iter()
+        .map(|entry| {
+            json!({
+                "id": entry.id,
+                "provider": entry.provider,
+                "cost_per_1m_output": entry.cost_per_1m_output,
+                "vision": entry.vision,
+                "quality_score": serde_json::Value::Null,
+                "hallucination_rate": serde_json::Value::Null,
+                "agents_using": 0,
+                "tasks_evaluated": 0,
+            })


⚠️ Potential issue | 🟠 Major | 🏗️ Heavy lift

The endpoint currently returns placeholder metrics instead of health data.

Lines 1230-1234 always emit quality_score: null, hallucination_rate: null, agents_using: 0, and tasks_evaluated: 0. That prevents meaningful per-model comparison/status decisions from backend data.

🤖 Prompt for AI Agents

Verify each finding against current code. Fix only still-valid issues, skip the rest with a brief reason, keep changes minimal, and validate. In `@src/core/jsonrpc.rs` around lines 1221 - 1234, The model list currently emits placeholder metrics; update the mapping over cfg.model_registry (the closure that builds models Vec) to look up real per-model metrics instead of hardcoded nulls/zeros: query your metrics source (e.g., a model metrics collection or method on cfg such as cfg.model_metrics.get(&entry.id) or cfg.get_model_stats(entry.id)) and populate "quality_score" and "hallucination_rate" with the retrieved numeric values (or serde_json::Value::Null if missing) and "agents_using" and "tasks_evaluated" with the actual counts (or 0 default). Ensure you convert types to serde_json::Value where needed and fall back to the previous defaults when metrics are absent.

graycyrus

@Sathvik-1007 heads up — several CI jobs are still pending (Build Tauri App, Frontend Unit Tests, Rust Core Tests, E2E suites, coverage), so i'll hold off on a full sign-off until those complete. i did spot a couple things while reading through that CodeRabbit didn't catch:

No candidate selection in the swap modal — the candidates list in the modal renders each item as a plain div with no onClick. There's no state to track which candidate the operator picked. So even once the "Apply Replacement" button is wired up (CodeRabbit flagged it as a no-op), it still won't know which candidate to apply. The modal needs a selected-candidate state and each candidate card needs an onClick to set it.
replaceCandidates filter makes the BETTER label unreachable in practice — the filter requires c.cost_per_1m_output <= target.cost_per_1m_output, but the label BETTER is shown when c.cost_per_1m_output >= target.cost_per_1m_output. That means BETTER only renders when costs are exactly equal — a float equality edge case that will almost never happen in real config data. A model that's genuinely better quality at a higher cost will never appear as a candidate at all. If the intent is "better quality even at higher cost is worth showing", the filter and labelling logic need to be decoupled.

Fix the CI/CodeRabbit items first and i'll do a proper review after. Let me know if anything is unclear.

coderabbitai

Actionable comments posted: 2

🤖 Prompt for all review comments with AI agents

Verify each finding against current code. Fix only still-valid issues, skip the
rest with a brief reason, keep changes minimal, and validate.

Inline comments:
In `@app/src/components/settings/panels/ModelHealthPanel.tsx`:
- Around line 139-140: The replacement-candidate predicate only checks
hallucination_rate and omits the cost ceiling, allowing more expensive
candidates; update the filter used where variables c and target are compared
(the replacement candidate selection) to require both lower hallucination and
equal-or-lower cost by adding a cost comparison (e.g. require (c.cost ??
Infinity) <= (target.cost ?? Infinity)) alongside the existing hallucination
comparison so candidates meet "lower hallucination and equal-or-lower cost"
before being accepted.
- Around line 289-293: The candidate items currently render as clickable divs
which are not keyboard-focusable; update the clickable element in
ModelHealthPanel (the element using setSelectedCandidate and comparing
selectedCandidate?.id) to be a semantic interactive control (e.g., a button) or
a div with proper accessibility attributes (role="button", tabIndex={0}) plus
key event handling for Enter/Space that calls setSelectedCandidate(c); ensure
focus and the selected styling logic still use selectedCandidate?.id so keyboard
users can select candidates and enable Apply.

🪄 Autofix (Beta)

Fix all unresolved CodeRabbit comments on this PR:

Push a commit to this branch (recommended)
Create a new PR with the fixes

ℹ️ Review info

⚙️ Run configuration

Configuration used: Organization UI

Review profile: CHILL

Plan: Pro

Run ID: da832573-80d1-4e58-8ee1-d5ed19023f1e

📥 Commits

Reviewing files that changed from the base of the PR and between 1163e72 and 5a39c58.

📒 Files selected for processing (1)

app/src/components/settings/panels/ModelHealthPanel.tsx

graycyrus

@Sathvik-1007 the two issues I flagged in the last review are both addressed — candidate selection is wired up and the cheaper/better filter split is correct. Good fixes.

Holding off on approval: CI still has pending jobs (Frontend Unit Tests, Rust Core Tests, coverage, E2E Appium runs), and there are open CodeRabbit threads that need resolution before this is ready. Once those are green and addressed I'll approve.

Replaces the bespoke `/models/health` HTTP route + handler in `src/core/jsonrpc.rs` with a controller-registry-backed RPC method `openhuman.dashboard_model_health` living in a new `src/openhuman/dashboard/` domain (mod.rs / ops.rs / schemas.rs / types.rs). The Rust core no longer has domain logic in the transport layer; CLI + JSON-RPC pick it up via the standard `RegisteredController` flow, with placeholder telemetry metrics documented as a follow-up contract. Frontend `ModelHealthPanel` is switched from raw `fetch` against the HTTP route to `callCoreRpc({ method: 'openhuman.dashboard_model_health' })`, unwrapping the `RpcOutcome` `{result, logs}` envelope. Modal candidate cards now use a semantic `<button role="radio">` instead of clickable `<div>`s, and the "Apply Replacement" button records the operator's swap intent via the debug logger (the backend agent → model rewire is a follow-up). Addresses CodeRabbit review feedback on PR tinyhumansai#2823 — controller-registry placement, in-process RPC over raw `fetch`, semantic interactive controls, and Apply-button no-op surfaced as a logged intent.

sanil-23 · 2026-05-28T14:19:22Z

Addressing the open CodeRabbit threads in commit 430f220:

Controller-registry path / placeholder metrics / unwrap_or_default() (src/core/jsonrpc.rs lines ~865 + 1190-1248)

The bespoke /models/health HTTP route and model_health_handler are removed from the transport layer.
New domain at src/openhuman/dashboard/ (mod.rs / ops.rs / schemas.rs / types.rs) exposes the data as openhuman.dashboard_model_health through the standard RegisteredController flow, wired up in src/core/all.rs. Per-domain unit tests cover the registry + ops + disabled-feature paths.
Config-load failure no longer collapses to a synthetic default — the handler returns Err("config unavailable: ...") (surfaced as a JSON-RPC error) and the warning is logged.
Telemetry metric fields (quality_score, hallucination_rate, agents_using, tasks_evaluated) remain placeholders (null/0) — the wiring to a local telemetry sink is out of scope for this PR; the contract is documented at the module level (src/openhuman/dashboard/mod.rs + ops.rs). The frontend treats null quality/hallucination as 'no signal' so the table is still useful for cost/vision comparison until the sink lands.

Use core RPC relay instead of direct fetch (ModelHealthPanel.tsx lines 93-95)

The raw fetch(${baseUrl}/models/health, ...) is replaced with callCoreRpc<RpcModelHealthPayload>({ method: 'openhuman.dashboard_model_health' }), which goes through the canonical JSON-RPC path. The component unwraps the {result, logs} envelope (forward-compatible with bare responses) and logs entry/exit via the openhuman:model-health debug namespace.

Apply Replacement no-op (ModelHealthPanel.tsx lines 311-315 / 336-339)

Apply remains UI-side (the backend agent → model rewire RPC is a follow-up, not part of this PR's scope), but it now records the operator's swap intent via debug('openhuman:model-health') so the action is observable in support logs and isn't silently dropped. Apply stays disabled={!selectedCandidate}. Cancel + selecting a candidate are both covered by new unit tests.

Use a semantic control for candidate cards (ModelHealthPanel.tsx lines 314-332)

The clickable <div>s are replaced with <button type="button" role="radio" aria-checked={isSelected}> inside a <div role="radiogroup">. Selection still highlights the chosen card and gates Apply.

sorts by column test post-condition (ModelHealthPanel.test.tsx line ~99)

Already had expect(rows[1].textContent).toContain('qwen-2.5-8b') asserting cheapest-first ordering. Also added a complementary toggle-direction test (expect(rows[1]).toContain('bad-model') for desc) and a radio-selection / Apply-disabled-until-selected test.

All 14 ModelHealthPanel tests pass (pnpm debug unit src/components/settings/panels/__tests__/ModelHealthPanel.test.tsx) and the 6 new openhuman::dashboard::* Rust tests pass under cargo test. Settings panels suite-wide: 394/394 green; full Vitest run: 3664/3664 green.

graycyrus

@Sathvik-1007 hey! the code looks good to me — the refactor to the controller registry is clean and all previous feedback has been addressed. two CI jobs are failing but both are infra flakes unrelated to your changes:

Rust Quality: runner failed to download tower/0.5.3 from crates.io (network error on the runner, not a clippy issue)
Rust Core Coverage: runner ran out of disk space

please retrigger those two jobs and once they're green i'll come back and approve. nothing else blocking on my end.

for context on the remaining open CodeRabbit thread about the no-op Apply button — the inline comment in the code makes the intent clear, which is fine for now as a documented follow-up. the placeholder metrics thread is outdated (points to old jsonrpc.rs that no longer exists).

Sathvik-1007 · 2026-05-28T14:49:12Z

@sanil-23 , Thank you for your extra contribution and for saving me a lot of time.

sanil-23

Verified the controller-registry refactor (pnpm compile, ESLint, Vitest 3664/3664, ModelHealthPanel 14/14, cargo check/test core + Tauri shell). Telemetry sink + agent→model rewire are noted follow-ups. All CodeRabbit threads bot-confirmed addressed. LGTM.

Clean

sanil-23 · 2026-05-28T15:06:26Z

@coderabbitai review

coderabbitai · 2026-05-28T15:06:35Z

✅ Actions performed

Review triggered.

Note: CodeRabbit is an incremental review system and does not re-review already reviewed commits. This command is applicable only when automatic reviews are paused.

Sathvik-1007 requested a review from a team May 28, 2026 07:19

coderabbitai Bot added feature Net-new user-facing capability or product behavior. rust-core Core Rust runtime in src/: CLI, core_server, shared infrastructure. labels May 28, 2026

coderabbitai Bot requested changes May 28, 2026

View reviewed changes

style: prettier Settings.tsx

0a6c884

coderabbitai Bot added the working A PR that is being worked on by the team. label May 28, 2026

graycyrus reviewed May 28, 2026

View reviewed changes

Comment thread app/src/components/settings/panels/ModelHealthPanel.tsx

Comment thread app/src/components/settings/panels/ModelHealthPanel.tsx

Sathvik-1007 added 2 commits May 28, 2026 14:08

fix: review — localize labels, error cfg, test assert

1163e72

fix: selectable candidates, broader filter

5a39c58

coderabbitai Bot previously requested changes May 28, 2026

View reviewed changes

Comment thread app/src/components/settings/panels/ModelHealthPanel.tsx Outdated

Comment thread app/src/components/settings/panels/ModelHealthPanel.tsx Outdated

Sathvik-1007 added 4 commits May 28, 2026 14:20

fix: restore cost ceiling, split cheaper/better

03ed8f4

test: cover missing lines for 80% gate

e5ed831

ci: retrigger windows

212ccfd

ci: retrigger flaky

3932641

graycyrus reviewed May 28, 2026

View reviewed changes

Merge branch 'main' into pr/2823

bf4b6a4

sanil-23 self-assigned this May 28, 2026

sanil-23 requested a review from graycyrus May 28, 2026 14:23

graycyrus reviewed May 28, 2026

View reviewed changes

sanil-23 approved these changes May 28, 2026

View reviewed changes

graycyrus approved these changes May 28, 2026

View reviewed changes

graycyrus merged commit fdd2219 into tinyhumansai:main May 28, 2026
33 of 36 checks passed

This was referenced May 28, 2026

feat(events): live domain event stream log panel #2653

Open

feat(intelligence): add architecture diagram viewer #2687

Open

Conversation

Sathvik-1007 commented May 28, 2026 • edited by coderabbitai Bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Changes

Acceptance criteria

Configuration

Summary by CodeRabbit

Uh oh!

coderabbitai Bot commented May 28, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Review failed

Reviews paused

Walkthrough

Changes

Sequence Diagram(s)

Estimated code review effort

Possibly related PRs

Suggested reviewers

Poem

Uh oh!

coderabbitai Bot left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

coderabbitai Bot May 28, 2026

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

coderabbitai Bot May 28, 2026

Choose a reason for hiding this comment

Uh oh!

graycyrus left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

coderabbitai Bot left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

graycyrus left a comment

Choose a reason for hiding this comment

Uh oh!

sanil-23 commented May 28, 2026

Uh oh!

graycyrus left a comment

Choose a reason for hiding this comment

Uh oh!

Sathvik-1007 commented May 28, 2026

Uh oh!

sanil-23 left a comment

Choose a reason for hiding this comment

Uh oh!

sanil-23 commented May 28, 2026

Uh oh!

coderabbitai Bot commented May 28, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Sathvik-1007 commented May 28, 2026 •

edited by coderabbitai Bot

Loading

coderabbitai Bot commented May 28, 2026 •

edited

Loading