test: expand unit + e2e coverage from test-map (batch 3) by senamakel · Pull Request #1778 · tinyhumansai/openhuman

senamakel · 2026-05-15T03:31:39Z

Summary

Continues the test-map.md coverage push from #1724 (batch 1) and #1752 (batch 2). Adds 124 Rust unit tests plus 3 mega-flow E2E scenarios across 7 high-priority features picked from test-map.md.

What's covered

Rust unit tests (all passing locally):

File	Tests	Feature (test-map heading)
`src/openhuman/update/ops_tests.rs`	11	Core lifecycle RPCs expose version, orchestrated update
`src/openhuman/service/mock_tests.rs`	15	Desktop service lifecycle and reconnect gating
`src/openhuman/notifications/bus_tests.rs`	14	Core→shell DomainEvent notification bridge
`src/openhuman/notifications/store_tests.rs`	13	CEF/CDP provider notifications dedup
`src/openhuman/tokenjuice/text_tests.rs`	26	TokenJuice Rust port for tool-output compaction
`src/openhuman/agent/harness/session/transcript_tests.rs`	+4	Conversation history restore — thread-scoped lookup

E2E (app/test/e2e/specs/mega-flow.spec.ts, +200 lines):

Scenario 12: openhuman.update_version RPC contract — semver, target_triple, openhuman-core-<triple> asset prefix; no outbound mock request to update host.
Scenario 13: openhuman.notification_ingest dedup — same payload twice yields ≤1 entry on notification_list.
Scenario 14: openhuman.threads_create_new → threads_message_append → threads_messages_list roundtrip.

Items reported as invalid / out-of-scope

service.shutdown flush-window delay — lives in ShutdownSubscriber, not the RPC handler; existing inline tests cover the publish contract.
Frontend ServiceBlockingGate state transitions — TS/React component, no Rust counterpart.
Turn-state mirror persistence + startup-interruption marking — no Rust persistence struct; UI hydration concern.
service.* mock-backend lifecycle as an E2E scenario — env var read at process boot, can't flip mid-session; no service namespace in any controller schema either way.
Pick 7 (conversation threads RPCs) — already fully covered by an existing conversations/store_tests.rs (37+ tests) from a prior batch; mega-flow scenario 14 added as a thin smoke instead.

Submission Checklist

Tests added or updated (happy path + at least one failure / edge case)
N/A: pure test additions; no product code changes; coverage gate doesn't apply.
N/A: no feature rows added/removed.
N/A: no feature IDs.
No new external network dependencies introduced
N/A: no release-surface change.
N/A: no linked issue.

Impact

Test-only PR. No runtime behavior change. Adds 124 Rust unit tests (all green via cargo test --lib) and 3 new mega-flow E2E scenarios in the existing single-session spec.

AI Authored PR Metadata

Linear Issue

Key: N/A
URL: N/A

Commit & Branch

Branch: test/expand-coverage-batch-3

Validation Run

cargo check --tests — clean (warnings pre-existing)
cargo test --lib bus_tests store_tests mock_tests ops_tests transcript_tests text_tests — 124 new tests, 0 failed
Pre-push hook (fmt + lint) — passed after auto-fix commit

Behavior Changes

None; test-only.

Parity Contract

N/A; no product code touched.

Summary by CodeRabbit

Tests
- Added end-to-end scenarios covering version checks, notification ingest/listing, and thread create/message flows.
- Expanded unit tests for notification generation, deduplication, broadcasting, and deep-link/identifier invariants.
- Added tests for the mock service backend, token/text processing utilities (grapheme/ANSI/regex rules), and update operation behaviors and results.

Adds Rust unit tests and mega-flow scenarios for 7 high-priority features picked from test-map.md, continuing the pattern of tinyhumansai#1724 (batch 1) and tinyhumansai#1752 (batch 2). Rust (124 new tests, all passing): - update/ops_tests.rs (11): update.version contract — CARGO_PKG_VERSION, target triple, openhuman-core-{triple} asset prefix, no remote calls; update.run orchestration result shapes (already_current, missing_asset, apply_failure, supervisor strategy); update_run rejected when rpc_mutations_enabled=false. - service/mock_tests.rs (15): OPENHUMAN_SERVICE_MOCK truthy/falsy parsing; state machine (NotInstalled → Stopped → Running → Stopped → NotInstalled); forced-failure injection via JSON state file; dispatch routing to mock when env flag set. - notifications/bus_tests.rs (14): event_to_notification mapping for WebhookProcessed (silent <400, emits ≥400, error wins over status, error text replaces status code, deep_link target); CronJobCompleted id format and deep_link; SubagentCompleted/Failed deep_links + error truncation; subscriber publish/subscribe roundtrip. - notifications/store_tests.rs (13): exists_recent dedup window — true for identical (provider, account, title, body) within 60s, false past 60s boundary, false when any field differs, NULL account_id handling; insert_if_not_recent preserves first entry, allows re-insert after expiry, allows distinct provider/body. - tokenjuice/text_tests.rs (26): strip_ansi multi-byte safety (CJK/emoji/Arabic/ZWJ); dedupe_adjacent edge cases; clamp_text_middle grapheme safety + omission marker; 3-layer overlay precedence (project > user > builtin); rule loader gracefully handles invalid regex (single bad pattern dropped, all-bad → empty, disk-loaded bad regex doesn't panic). - agent/harness/session/transcript_tests.rs (+4): find_root_transcript for thread isolation, empty thread_id, newest-when-multiple, excludes subagent files. E2E (mega-flow.spec.ts, +200 lines): - Scenario 12: openhuman.update_version RPC contract — semver, target_triple, asset prefix; no outbound mock request to update host. - Scenario 13: openhuman.notification_ingest dedup — same payload twice yields ≤1 entry on notification_list. - Scenario 14: threads create_new → message_append → messages_list roundtrip on a single thread within one session. Invalid / out-of-scope items (reported, not faked): - service.shutdown flush-window delay logic — the delay lives in ShutdownSubscriber, not the RPC handler; the existing inline tests already cover the event-publish contract. - Frontend ServiceBlockingGate state transitions — TS/React component, no Rust counterpart. - Turn-state mirror persistence and startup interruption marking — no Rust persistence struct; handled at the UI hydration layer. - service.* RPC mock-backend lifecycle scenario — env var read at process boot, can't flip mid-session; no service namespace in any controller schema either way. - Pick 7 (conversation threads RPCs) — already fully covered by an existing conversations/store_tests.rs (37+ tests) from a prior batch; not duplicated. Mega-flow scenario 14 still added as a thin end-to-end smoke through the threads namespace.

coderabbitai · 2026-05-15T03:31:55Z

📝 Walkthrough

Walkthrough

Adds unit tests for notifications::bus (webhook translation, deep-link/ID invariants, publish/subscribe) and for tokenjuice text utilities and rule loader (ANSI stripping, dedupe, grapheme-safe truncation, overlay precedence, invalid-regex handling). All changes are test-only.

Changes

Notifications bus tests

Layer / File(s)	Summary
Event-to-notification boundaries `src/openhuman/notifications/bus_tests.rs`	Tests webhook->notification emission rules, identifier/deep-link invariants, broadcast/publish-subscribe behavior, and stable `NotificationBridgeSubscriber` naming.

Tokenjuice text & rule loader tests

Layer / File(s)	Summary
Module imports and strip_ansi `src/openhuman/tokenjuice/text_tests.rs`	Adds module-level imports/docs and `strip_ansi` tests covering CJK, emoji, ZWJ, mixed scripts, and edge cases.
dedupe_adjacent `src/openhuman/tokenjuice/text_tests.rs`	Tests collapsing adjacent identical lines and multi-byte correctness.
clamp_text_middle & grapheme helpers `src/openhuman/tokenjuice/text_tests.rs`	Grapheme-safe truncation tests, emoji/ZWJ safety, omission marker, and grapheme-count helper round-trip.
Rule loader overlay precedence `src/openhuman/tokenjuice/text_tests.rs`	Tests project > user > builtin precedence and `RuleOrigin` selection using temp directories.
skip_patterns compile and load_rules end-to-end `src/openhuman/tokenjuice/text_tests.rs`	Tests invalid-regex handling for `skip_patterns` and an end-to-end load_rules test ensuring bad on-disk patterns don't block valid rules.

🎯 3 (Moderate) | ⏱️ ~25 minutes

"🐰 In burrows I test, with whiskers twitched bright,
Rules and texts kept tidy, not a grapheme in sight.
Notifications quiet where duplicates try,
Regex trimmed kindly, no panics — oh my!
Hooray for green tests through day and night."

🚥 Pre-merge checks | ✅ 5

✅ Passed checks (5 passed)

Check name	Status	Explanation
Description Check	✅ Passed	Check skipped - CodeRabbit’s high-level summary is enabled.
Title check	✅ Passed	The title directly and accurately summarizes the main change: expanding unit and e2e test coverage across multiple modules (batch 3 from test-map).
Docstring Coverage	✅ Passed	Docstring coverage is 100.00% which is sufficient. The required threshold is 80.00%.
Linked Issues check	✅ Passed	Check skipped because no linked issues were found for this pull request.
Out of Scope Changes check	✅ Passed	Check skipped because no linked issues were found for this pull request.

_{✏️ Tip: You can configure your own custom pre-merge checks in the settings.}

_{Comment @coderabbitai help to get the list of available commands and usage tips.}

coderabbitai

Actionable comments posted: 4

🧹 Nitpick comments (1)

app/test/e2e/specs/mega-flow.spec.ts (1)
606-658: ⚡ Quick win

Add explicit failure diagnostics for the new E2E scenarios.

These scenarios log happy-path details, but they don’t capture structured diagnostics on assertion failure (request-log snapshot + accessibility dump), which slows triage for flaky/CI-only failures.

A small helper wrapper per scenario (try/catch → print getRequestLog() + dumpAccessibilityTree() → rethrow) would satisfy the diagnostics rule consistently.

As per coding guidelines, "Add failure diagnostics (request logs, dumpAccessibilityTree()) in E2E specs for faster debugging by agents".

Also applies to: 667-721, 731-790
🤖 Prompt for AI Agents
Verify each finding against current code. Fix only still-valid issues, skip the
rest with a brief reason, keep changes minimal, and validate.

In `@app/test/e2e/specs/mega-flow.spec.ts` around lines 606 - 658, The test lacks
structured failure diagnostics; wrap the body of the "it('update.version:
returns version...')" test (and the other affected specs) in a try/catch that on
catch logs getRequestLog() and await dumpAccessibilityTree(), then rethrows so
failures still fail the test; implement a small helper (e.g.
withFailureDiagnostics(async () => { ... })) that accepts the test async
function, runs it, and on error prints JSON.stringify(getRequestLog(), null, 2)
and the result of await dumpAccessibilityTree() before rethrowing; update the
calls around callOpenhumanRpc(), getRequestLog(), and dumpAccessibilityTree() in
this spec and the other noted specs (lines ~667-721 and ~731-790) to use the
helper.

🤖 Prompt for all review comments with AI agents

Verify each finding against current code. Fix only still-valid issues, skip the
rest with a brief reason, keep changes minimal, and validate.

Inline comments:
In `@app/test/e2e/specs/mega-flow.spec.ts`:
- Around line 650-654: The current no-egress check only inspects getRequestLog()
from the mock server and misses direct outbound HTTP calls (so regressions in
openhuman.update_version could still egress); update the test to force all
outbound HTTP through a controllable interceptor and assert zero hits there —
e.g., configure the test runtime to use an HTTP proxy or set the update host
override to point to the mock server (or inject a mocked HTTP client used by
openhuman.update_version), then replace the getRequestLog() assertion with an
assertion against the proxy/interceptor's request log (or verify the injected
client received no requests) to ensure no real network egress occurs.

In `@src/openhuman/notifications/bus_tests.rs`:
- Around line 198-237: The test is order-dependent because it assumes the first
message from subscribe_core_notifications() is the one just published; change
the assertion to wait for and assert on the matching CoreNotificationEvent by
looping over rx.try_recv() (or rx.recv_timeout) until you find an event with id
== "test-123" (or until a short timeout/attempt limit) and ignore other
unrelated events; update the code around publish_core_notification(evt.clone()),
replacing the single rx.try_recv() assert with a loop that discards non-matching
events and only asserts the id/title/category when a matching
CoreNotificationEvent is received (fail if timeout/attempts exhausted).

In `@src/openhuman/tokenjuice/text_tests.rs`:
- Around line 126-145: The tests clamp_text_middle_output_is_valid_utf8 and
clamp_text_middle_does_not_split_emoji currently only assert UTF-8 validity
(tautological for String); change them to assert grapheme-boundary correctness:
call clamp_text_middle(&s, n) and then verify that the retained prefix and
suffix each end/start on full grapheme cluster boundaries (e.g., using
unicode_segmentation::UnicodeSegmentation::graphemes or GraphemeCursor) and that
concatenating those preserved graphemes equals the same sequence of grapheme
clusters from the original string at those ranges; additionally add a specific
check for ZWJ-family clusters so neither prefix nor suffix contains a partial
ZWJ sequence. Ensure you update both test functions
(clamp_text_middle_output_is_valid_utf8 and
clamp_text_middle_does_not_split_emoji) and the similar checks mentioned for
lines 183-191.
- Around line 401-406: The test currently uses an if-let that silently passes if
the "test/disk-bad-regex" rule is missing; change it to assert that a rule with
rule.id == "test/disk-bad-regex" is present and then assert its
compiled.skip_patterns is empty. Specifically, replace the if-let lookup over
rules.iter().find(|r| r.rule.id == "test/disk-bad-regex") with a firm assertion
that the find returned Some(...) (or use unwrap()/expect with a clear message),
then check bad.compiled.skip_patterns.is_empty() with the existing failure
message so the test fails when the bad-rule is dropped as well as when its
skip_patterns are non-empty.

---

Nitpick comments:
In `@app/test/e2e/specs/mega-flow.spec.ts`:
- Around line 606-658: The test lacks structured failure diagnostics; wrap the
body of the "it('update.version: returns version...')" test (and the other
affected specs) in a try/catch that on catch logs getRequestLog() and await
dumpAccessibilityTree(), then rethrows so failures still fail the test;
implement a small helper (e.g. withFailureDiagnostics(async () => { ... })) that
accepts the test async function, runs it, and on error prints
JSON.stringify(getRequestLog(), null, 2) and the result of await
dumpAccessibilityTree() before rethrowing; update the calls around
callOpenhumanRpc(), getRequestLog(), and dumpAccessibilityTree() in this spec
and the other noted specs (lines ~667-721 and ~731-790) to use the helper.

🪄 Autofix (Beta)

Fix all unresolved CodeRabbit comments on this PR:

Push a commit to this branch (recommended)
Create a new PR with the fixes

ℹ️ Review info

⚙️ Run configuration

Configuration used: Organization UI

Review profile: CHILL

Plan: Pro

Run ID: 77d20c0c-c2bc-4aa6-a762-e3a3fc226e4a

📥 Commits

Reviewing files that changed from the base of the PR and between 6cb82a3 and e07e69e.

📒 Files selected for processing (12)

app/test/e2e/specs/mega-flow.spec.ts
src/openhuman/agent/harness/session/transcript_tests.rs
src/openhuman/notifications/bus.rs
src/openhuman/notifications/bus_tests.rs
src/openhuman/notifications/store.rs
src/openhuman/notifications/store_tests.rs
src/openhuman/service/mock.rs
src/openhuman/service/mock_tests.rs
src/openhuman/tokenjuice/mod.rs
src/openhuman/tokenjuice/text_tests.rs
src/openhuman/update/ops.rs
src/openhuman/update/ops_tests.rs

- notifications/bus_tests: drain up to 64 events when looking for the test event so the assertion is order-independent on the shared global broadcast bus (parallel tests can interleave). - tokenjuice/text_tests: replace tautological from_utf8 checks in the three clamp_text_middle grapheme tests with real boundary assertions — every output grapheme must equal a complete grapheme of the source (or the omission marker), and ZWJ family clusters must never split. - tokenjuice/text_tests: tighten invalid_regex_loaded_from_disk_is_ skipped_not_fatal — assert the bad-regex rule is present (not silently dropped) and that its compiled skip_patterns is empty.

coderabbitai

🧹 Nitpick comments (1)

src/openhuman/tokenjuice/text_tests.rs (1)
137-142: ⚡ Quick win

Tighten the ASCII whitelist to avoid false positives in grapheme-safety assertion.

This continue path allows any ASCII grapheme, so unexpected ASCII corruption in clamped can still pass. Prefer asserting clamped as source_prefix + omission_marker + source_suffix at grapheme boundaries instead of broadly whitelisting ASCII.
🤖 Prompt for AI Agents
Verify each finding against current code. Fix only still-valid issues, skip the
rest with a brief reason, keep changes minimal, and validate.

In `@src/openhuman/tokenjuice/text_tests.rs` around lines 137 - 142, The current
grapheme-safety check lets any ASCII grapheme (g) pass, which can hide
corruption; instead assert that clamped equals the concatenation of
source_prefix + omission_marker + source_suffix at grapheme cluster boundaries:
compute grapheme clusters for clamped and the original source, verify that
clamped is exactly the prefix clusters + omission_marker + suffix clusters
(using the same grapheme segmentation) rather than allowing any ASCII via
g.chars().all(...); update the test around the variables g, clamped,
source_prefix, omission_marker, and source_suffix to perform this exact-match
assertion on grapheme clusters.

🤖 Prompt for all review comments with AI agents

Verify each finding against current code. Fix only still-valid issues, skip the
rest with a brief reason, keep changes minimal, and validate.

Nitpick comments:
In `@src/openhuman/tokenjuice/text_tests.rs`:
- Around line 137-142: The current grapheme-safety check lets any ASCII grapheme
(g) pass, which can hide corruption; instead assert that clamped equals the
concatenation of source_prefix + omission_marker + source_suffix at grapheme
cluster boundaries: compute grapheme clusters for clamped and the original
source, verify that clamped is exactly the prefix clusters + omission_marker +
suffix clusters (using the same grapheme segmentation) rather than allowing any
ASCII via g.chars().all(...); update the test around the variables g, clamped,
source_prefix, omission_marker, and source_suffix to perform this exact-match
assertion on grapheme clusters.

ℹ️ Review info

⚙️ Run configuration

Configuration used: Organization UI

Review profile: CHILL

Plan: Pro

Run ID: 0bfd1958-7213-41be-a9ac-3f36890c27c6

📥 Commits

Reviewing files that changed from the base of the PR and between e07e69e and 27f14d5.

📒 Files selected for processing (2)

src/openhuman/notifications/bus_tests.rs
src/openhuman/tokenjuice/text_tests.rs

…e-batch-3

senamakel added 2 commits May 14, 2026 20:29

chore: apply auto-fixes (fmt/lint)

e07e69e

senamakel requested a review from a team May 15, 2026 03:31

coderabbitai Bot requested changes May 15, 2026

View reviewed changes

Comment thread app/test/e2e/specs/mega-flow.spec.ts

Comment thread src/openhuman/notifications/bus_tests.rs

Comment thread src/openhuman/tokenjuice/text_tests.rs Outdated

Comment thread src/openhuman/tokenjuice/text_tests.rs Outdated

senamakel added 2 commits May 14, 2026 20:47

chore: apply auto-fixes (rustfmt)

27f14d5

coderabbitai Bot reviewed May 15, 2026

View reviewed changes

coderabbitai Bot approved these changes May 15, 2026

View reviewed changes

Merge remote-tracking branch 'upstream/main' into test/expand-coverag…

678d9c2

…e-batch-3

senamakel self-assigned this May 15, 2026

senamakel merged commit 609b6b7 into tinyhumansai:main May 15, 2026
21 checks passed

coderabbitai Bot mentioned this pull request May 18, 2026

fix(e2e): resolve 30+ failing E2E specs — i18n, assertion, and import fixes #2092

Open

12 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

test: expand unit + e2e coverage from test-map (batch 3)#1778

test: expand unit + e2e coverage from test-map (batch 3)#1778
senamakel merged 5 commits into
tinyhumansai:mainfrom
senamakel:test/expand-coverage-batch-3

senamakel commented May 15, 2026 •

edited by coderabbitai Bot

Loading

Uh oh!

coderabbitai Bot commented May 15, 2026 •

edited

Loading

Walkthrough

Changes

Uh oh!

coderabbitai Bot left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

coderabbitai Bot left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

senamakel commented May 15, 2026 • edited by coderabbitai Bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

What's covered

Items reported as invalid / out-of-scope

Submission Checklist

Impact

Related

AI Authored PR Metadata

Linear Issue

Commit & Branch

Validation Run

Behavior Changes

Parity Contract

Summary by CodeRabbit

Uh oh!

coderabbitai Bot commented May 15, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Walkthrough

Changes

Uh oh!

coderabbitai Bot left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

coderabbitai Bot left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

senamakel commented May 15, 2026 •

edited by coderabbitai Bot

Loading

coderabbitai Bot commented May 15, 2026 •

edited

Loading