test(e2e): make the harness actually exercise the UI past login by senamakel · Pull Request #1859 · tinyhumansai/openhuman

senamakel · 2026-05-15T22:06:53Z

Summary

Root cause discovered: post-login assertions in every E2E spec were silently dead. The packaged E2E binary built with vite build (prod-mode IS_DEV=false) ran the real app.restart() on every identity flip, killing the chromium-driver's CDP target. Specs reported "passing" because most assertions were RPC oracles that ran before the dead session blocked them.
Fix the session: build the E2E frontend in dev-like mode (vite build --mode development + a MODE === 'development' gate in restartApp) so identity flip falls back to window.location.reload(). The WebDriver session survives.
Fix onboarding detection: the old walkOnboarding matched a hardcoded list of step titles and silently skipped any step it didn't recognize ("Connect your Gmail" etc.). Replace with a testid-keyed loop (data-testid="onboarding-next-button") that advances until the button truly unmounts, including across the post-auth reload.
Fix BootCheckGate blocking auth: dismiss the "Choose core mode" modal before triggering the auth deep-link — without this, waitForAuthReadiness can't make progress and the spec wedges on the login screen with "Sign-in failed. Please try again.".
Add the reset primitive: new openhuman.test_reset Rust RPC + resetApp(userId) E2E helper that wipe sidecar state in-place between tests (auth + onboarding flag + cron jobs) so each spec starts from a fresh-install baseline without restarting the process.
Reference spec: cron-jobs-flow.spec.ts fully rewritten to drive real UI clicks — Settings → Cron Jobs → Pause / Resume / Refresh / Remove — with one trailing cron_list RPC as oracle. Passes 4/4 in ~30s.

Problem

The E2E harness wasn't catching post-login regressions. Specs hit triggerAuthDeepLinkBypass → app called app.restart() → CDP target died → every subsequent browser.execute / $() call returned "session terminated" — including the assertions. Because the failures all looked the same and most assertions were RPC oracles already happy from before the restart, broken UI shipped silently.

Verified by checking out the pre-existing cron-jobs-flow.spec.ts (commit 609b6b70) against the unmodified packaged build: same "session terminated" failure pattern at the same point. Not a regression from this branch — a latent harness flaw that this branch makes visible and fixes.

Solution

Three layered fixes, in order of leverage:

Dev-mode E2E bundle (app/scripts/e2e-build.sh + app/package.json + app/src/utils/tauriCommands/core.ts). The frontend now builds with vite build --mode development, the script overrides Tauri's beforeBuildCommand to a no-op so it doesn't clobber the dev dist, and restartApp gates on import.meta.env.MODE === 'development' (since vite build always produces DEV=false regardless of --mode). Result: identity flip becomes window.location.reload() and the WebDriver session survives.
Testid-keyed onboarding walker (app/test/e2e/helpers/shared-flows.ts). Replaces the title-list matcher with a loop that dispatches a synthetic MouseEvent on [data-testid="onboarding-next-button"] until it unmounts. Also dismisses BootCheckGate first (lifted out into the exported dismissBootCheckGateIfVisible). Survives the post-auth reload by treating button-gone + hash-still-on-/onboarding as "wait, don't claim victory yet."
Pre-deep-link BootCheckGate dismissal (app/test/e2e/helpers/deep-link-helpers.ts). Inlined into triggerAuthDeepLinkBypass (importing from shared-flows would be circular). Confirms bundled-core mode before the auth handler can race against waitForAuthReadiness.
openhuman.test_reset RPC (src/openhuman/test_support/). Wipes auth (active_user.toml + api_key), clears chat_onboarding_completed, and deletes all cron rows in-place. resetApp() helper races the call against an 8s budget so the very first spec of a run (sidecar not yet spawned) treats unreachability as "already pristine."

Submission Checklist

Tests added or updated — cron-jobs-flow.spec.ts is the reference rewrite; the helpers and the RPC are exercised by it end-to-end.
Diff coverage ≥ 80% — N/A: changes are all in test infrastructure (E2E helpers, test_support domain) and a build script. The cron RPC clear_all_jobsis exercised end-to-end by the reference spec. Frontend logic inrestartApp adds one branch that is intentionally hit only in the E2E build.
Coverage matrix updated — N/A: harness-only change, no product features added/removed.
All affected feature IDs from the matrix are listed in the PR description under ## Related
No new external network dependencies introduced
Manual smoke checklist updated if this touches release-cut surfaces — N/A: shell restartApponly changes behavior underMODE === 'development', which is bound to the E2E build flag and not present in release bundles.
Linked issue closed via Closes #NNN in the ## Related section — none open; this is a discovery-driven fix.

Impact

Runtime/platform: desktop only. restartApp adds one extra check (import.meta.env.MODE === 'development') before invoking the Tauri restart_app command. Production bundles built with pnpm build (no --mode development) take the existing OS-restart path — verified by inspecting the dist output (grep "restartApp: invoking restart_app" succeeds in the prod-mode bundle; grep "restartApp: dev-like" only appears in the E2E bundle).
Security: openhuman.test_reset is a destructive RPC always reachable while the sidecar is up. Acceptable for now because the sidecar binds to 127.0.0.1 and requires the per-launch bearer token written to ${tmpdir}/openhuman-e2e-rpc-token; release builds never write that token, so unauthenticated callers can't invoke it. A follow-up issue gates it behind an explicit env flag — see follow-up issues filed alongside this PR.
Performance: cron spec runtime ~30s end-to-end including auth + onboarding + 4 UI assertions. No measurable change for any other path.
Migration: none. Existing specs are unchanged; the shared helpers they call are the changed surface.

Closes:
Follow-up PR(s)/TODOs: a batch of GitHub issues filed alongside this PR documents the per-spec sweep work (mega-flow, memory-roundtrip, tool-, webhooks-, skill-execution, etc.) — each spec currently runs but doesn't actually drive the UI past login. The harness fixes here unblock that work; the per-spec conversion is the follow-up.

AI Authored PR Metadata (required for Codex/Linear PRs)

Authored by: Claude (Sonnet/Opus) in a Claude Code session
Human reviewer: @senamakel

Summary by CodeRabbit

Tests
- Enhanced end-to-end test infrastructure and helpers for more reliable flows and onboarding.
- Added a test-only app reset (wipe) to restore a fresh-install baseline (disabled in shipping builds).
- Improved dev-like restart behavior during E2E runs to speed iterative test cycles.
- Added robust handling to dismiss an onboarding/boot gate during automated runs.

…ec as real-UI reference Establishes the new E2E pattern: one Appium session for the whole run, each spec begins with `resetApp(<userId>)` which calls the in-place `openhuman.test_reset` RPC, reloads the renderer, and walks the real onboarding UI. Specs then drive the product through clicks, not direct RPC calls. Rust core - src/openhuman/cron/store.rs: `clear_all_jobs(config) -> usize` - src/openhuman/test_support/{mod,rpc,schemas}.rs: new domain exposing `openhuman.test_reset` — wipes auth (active_user.toml + api_key), clears chat_onboarding_completed, deletes all cron rows in-place. - Wired into src/core/all.rs + src/openhuman/mod.rs. E2E harness - app/test/e2e/helpers/reset-app.ts: `resetApp(userId, { skipAuth? })` — RPC wipe -> storage clear + reload -> deep-link auth + onboarding walk. Reference spec - app/test/e2e/specs/cron-jobs-flow.spec.ts rewritten end-to-end: Settings -> Cron Jobs panel via real navigation; Pause/Resume/Refresh/ Remove via button clicks; UI assertions only, with one trailing cron_list oracle to confirm the sidecar agrees. Follow-ups: extend test_reset coverage as other domains are converted (memory, channels, skills, accounts, threads, notifications). Add data-testid hooks before the sweep so selectors don't depend on Tailwind class strings.

…re-mode modal - Race the openhuman.test_reset RPC against an 8s budget. On the very first spec of a run the user-scoped sidecar hasn't spawned yet, so the RPC is unreachable — that's the same baseline state the wipe would have produced, so treat it as success and skip the wipe. - Only reload the renderer when we actually wiped something, otherwise we'd throw away the in-app "Choose core mode" acceptance and wedge on the BootCheckGate modal. - Add dismissCoreModeModalIfVisible() that retries with a synthetic MouseEvent (more reliable than raw .click() against React's onClick) and is called both after the optional reload and after the deep-link auth in case BootCheckGate re-mounts.

…river session The packaged E2E binary was running `app.restart()` on every login (the identity-flip path in CoreStateProvider). That destroyed the CDP target the chromium-driver session was attached to, and every subsequent command failed with "session terminated" — silently masking ~all post-login assertions in the existing specs. `vite build` always produces PROD=true / DEV=false regardless of --mode, so flipping the build mode alone doesn't change IS_DEV. Instead gate `restartApp` on `import.meta.env.MODE === 'development'` too, and have the E2E build script invoke `vite build --mode development` up-front with a no-op `beforeBuildCommand` so Tauri doesn't overwrite the dev-mode dist with a fresh prod build. Also harden the onboarding walker: the shared `walkOnboarding` matches a fixed list of step *titles* and silently skips any step it doesn't recognize ("Connect your Gmail" etc.). Use the stable `data-testid="onboarding-next-button"` selector instead and keep advancing until the button unmounts. cron-jobs-flow spec now passes 4/4 in 28s end-to-end.

walkOnboarding() was matching against a hardcoded list of step titles (Welcome, Install Skills, …) so any onboarding step that wasn't in ONBOARDING_OVERLAY_TEXTS (e.g. "Connect your Gmail") silently fell through. The walker reported success while the renderer stayed wedged behind onboarding, masking post-login breakage in ~every spec that uses completeOnboardingIfVisible. Replace the body with a testid-keyed loop: wait for data-testid="onboarding-next-button" to mount, click it via synthetic MouseEvent until it unmounts, bail if it stays disabled too long. resetApp now delegates to the shared walker (no more duplicated logic). This single helper change is the leverage point — every existing spec that already calls completeOnboardingIfVisible now actually advances past current and future onboarding steps.

…-auth reload Two changes that make existing specs (which don't use resetApp) actually work end-to-end, not just up to the login screen. 1. Pre-deep-link BootCheckGate dismissal in triggerAuthDeepLinkBypass. The handler calls `waitForAuthReadiness` which can't make progress while the "Choose core mode" modal is up — it eventually fails with "Sign-in failed. Please try again." and the spec is wedged on the login screen. Inlined here (rather than imported from shared-flows) to avoid a circular dependency. 2. walkOnboarding tolerates the post-auth reload. The deep-link auth triggers an identity flip → `restartApp` → reload, which re-mounts onboarding fresh for the new user namespace. Previously the walker exited the moment the next-button briefly unmounted between steps and the spec navigated mid-reload. Now it stays in the loop while the hash is still on `#/onboarding/*`. Also factor `dismissBootCheckGateIfVisible` into shared-flows as the canonical export and have resetApp + walkOnboarding delegate to it. Settings - Developer Options now 2/3 passing (was 0/3); cron-jobs-flow still 4/4. Remaining flake on the first test of a fresh launch is a warmup race, not a structural issue.

coderabbitai · 2026-05-15T22:07:07Z

No actionable comments were generated in the recent review. 🎉

ℹ️ Recent review info

⚙️ Run configuration

Configuration used: Organization UI

Review profile: CHILL

Plan: Pro

Run ID: e026cad8-bfb8-454c-9605-1181c9dd78a2

📥 Commits

Reviewing files that changed from the base of the PR and between 4d73f11 and b494e53.

📒 Files selected for processing (10)

Cargo.toml
app/scripts/e2e-build.sh
app/src-tauri/Cargo.toml
app/src/test/setup.ts
app/src/utils/config.ts
app/src/utils/tauriCommands/core.test.ts
app/src/utils/tauriCommands/core.ts
src/core/all.rs
src/openhuman/mod.rs
src/openhuman/test_support/rpc.rs

✅ Files skipped from review due to trivial changes (1)

app/src-tauri/Cargo.toml

🚧 Files skipped from review as they are similar to previous changes (5)

src/openhuman/mod.rs
app/src/utils/tauriCommands/core.ts
src/openhuman/test_support/rpc.rs
app/scripts/e2e-build.sh
src/core/all.rs

📝 Walkthrough

Walkthrough

This PR adds E2E testing infrastructure to streamline test isolation and startup navigation. A new backend test_reset RPC handler wipes persistent state (cron jobs, onboarding, API keys) in-place. Frontend build and helper layers enable dev-mode app bundles that reload via JavaScript instead of native restart, and introduce resetApp() to orchestrate state clearing, BootCheckGate dismissal, and onboarding completion. The cron-jobs spec is modernized from RPC-envelope testing to real UI flow interaction.

Changes

E2E Testing Infrastructure

Layer / File(s)	Summary
Backend test RPC infrastructure and state clearing `src/openhuman/test_support/{mod,rpc,schemas}.rs`, `src/openhuman/mod.rs`, `src/core/all.rs`, `src/openhuman/cron/store.rs`, `src/openhuman/cron/mod.rs`, `Cargo.toml`	Adds a feature-gated `test_support` module and `openhuman.test_reset` RPC that clears cron jobs, resets onboarding/API key and active user state, registers controller schemas, and wires them into core registry/catalog.
Frontend E2E build configuration and dev-like runtime `app/package.json`, `app/scripts/e2e-build.sh`, `app/src-tauri/Cargo.toml`, `app/src/utils/config.ts`, `app/src/utils/tauriCommands/core.ts`, `app/src/test/setup.ts`, `app/src/utils/tauriCommands/core.test.ts`	Adds `build:app:e2e` script and updates E2E build script to run a dev-mode frontend build and enable the `e2e-test-support` Cargo feature; introduces `IS_DEV_LIKE` to detect dev-like runtime and reload the window in that case.
E2E helper functions and onboarding flow `app/test/e2e/helpers/deep-link-helpers.ts`, `app/test/e2e/helpers/reset-app.ts`, `app/test/e2e/helpers/shared-flows.ts`	Adds `resetApp()` orchestrator that probes `openhuman.test_reset`, conditionally clears renderer storage and reloads, dismisses BootCheckGate, runs auth deep-link bypass, and walks onboarding. Adds inline and remote BootCheckGate dismissal helpers and rewrites `walkOnboarding()` to poll for the next button and dispatch synthetic mouse events.
Cron jobs spec real UI flow `app/test/e2e/specs/cron-jobs-flow.spec.ts`	Rewrites the cron-jobs E2E to drive the settings panel UI, using `resetApp()` for setup, browser helpers to interact with row action buttons, polling for Pause→Resume toggles, refresh persistence checks, and final removal verification via `openhuman.cron_list`.

Sequence Diagram

sequenceDiagram
  participant Spec as E2E Spec
  participant Reset as resetApp()
  participant RPC as openhuman.test_reset
  participant Renderer as App Renderer
  participant Gate as BootCheckGate Modal
  participant DeepLink as Auth Deep-Link
  participant Onboard as Onboarding Flow
  Spec->>Reset: resetApp(userId)
  Reset->>RPC: Call test_reset RPC with 8s timeout
  RPC->>RPC: Clear cron jobs, wipe config state
  Reset->>Renderer: Clear localStorage & sessionStorage
  Reset->>Renderer: Reload webview
  Reset->>Gate: dismissBootCheckGateIfVisible()
  Reset->>DeepLink: triggerAuthDeepLinkBypass(userId)
  DeepLink->>Gate: dismissBootCheckGateIfVisibleInline()
  DeepLink->>Reset: Auth complete
  Reset->>Onboard: walkOnboarding()
  Onboard->>Gate: dismissBootCheckGateIfVisible()
  Onboard->>Onboard: Poll onboarding next button, dispatch synthetic clicks
  Onboard->>Reset: Onboarding complete
  Reset->>Spec: Return userId

Estimated code review effort

🎯 3 (Moderate) | ⏱️ ~22 minutes

Possibly related issues

Stabilize E2E flake on first navigation after onboarding completes #1864: Changes to walkOnboarding and BootCheckGate dismissal map to the navigation-after-onboarding race described there.
Wire fix/e2e-tests changes into the all-flows CI runner #1865: Adds resetApp, test-support RPCs, and dev-mode E2E build/script updates that enable single-session all-flows runners requested there.
Convert RPC-heavy E2E specs to drive the UI past login #1860: Implements the E2E conversion pattern and cron-jobs UI-driven flow described in that issue.
Gate openhuman.test_reset behind an explicit E2E flag #1863: Adds build-time gating and feature wiring for openhuman.test_reset as requested; missing some runtime env guard requested in the issue.

Possibly related PRs

tinyhumansai/openhuman#1777: Overlaps on app/test/e2e/helpers/deep-link-helpers.ts changes to deep-link delivery.
tinyhumansai/openhuman#1779: Related BootCheckGate dismissal helpers invoked from waitForApp, complementary to dismissal logic added here.
tinyhumansai/openhuman#886: Overlaps pnpm/tauri build script and package.json script changes relevant to E2E build wiring.

Poem

🐰 I hopped through tests to clear the mess,

Doors unlatched, the gate's recess,
State wiped neat with RPC's cheer,
Onboarding trotted, all paths clear,
Cron dreams paused — the specs now bless.

🚥 Pre-merge checks | ✅ 5

✅ Passed checks (5 passed)

Check name	Status	Explanation
Description Check	✅ Passed	Check skipped - CodeRabbit’s high-level summary is enabled.
Title check	✅ Passed	The title accurately and specifically describes the main objective: fixing E2E tests to exercise UI past login by addressing the restart behavior issue.
Docstring Coverage	✅ Passed	Docstring coverage is 100.00% which is sufficient. The required threshold is 80.00%.
Linked Issues check	✅ Passed	Check skipped because no linked issues were found for this pull request.
Out of Scope Changes check	✅ Passed	Check skipped because no linked issues were found for this pull request.

_{✏️ Tip: You can configure your own custom pre-merge checks in the settings.}

Warning

There were issues while running some tools. Please review the errors and either fix the tool's configuration or disable the tool if it's a critical failure.

🔧 ESLint

If the error stems from missing dependencies, add them to the package.json file. For unrecoverable errors (e.g., due to private dependencies), disable the tool in the CodeRabbit configuration.

ESLint skipped: no ESLint configuration detected in root package.json. To enable, add eslint to devDependencies.

_{Comment @coderabbitai help to get the list of available commands and usage tips.}

coderabbitai

Actionable comments posted: 3

🧹 Nitpick comments (2)

app/test/e2e/specs/cron-jobs-flow.spec.ts (2)
98-102: ⚡ Quick win

Position-based button selection is brittle.

Assuming labels[0] is the toggle button relies on DOM order which can change if the row layout is reorganized. Consider either:

Using data-testid="cron-job-toggle-${name}" for explicit targeting

Or filtering buttons by a known set of toggle labels (['Pause', 'Resume'])
♻️ Suggested approach using label filtering
-      // We care about the toggle button (first one in the row).
-      return labels[0] ?? null;
+      // Find the toggle button by its known labels.
+      const toggleLabels = ['Pause', 'Resume'];
+      return labels.find(l => toggleLabels.includes(l)) ?? null;
🤖 Prompt for AI Agents
Verify each finding against current code. Fix only still-valid issues, skip the
rest with a brief reason, keep changes minimal, and validate.

In `@app/test/e2e/specs/cron-jobs-flow.spec.ts` around lines 98 - 102, The current
selector assumes the first button in the row is the toggle by returning
labels[0], which is brittle; update the test in cron-jobs-flow.spec.ts to select
the toggle explicitly either by adding and querying a data-testid attribute
(e.g. querySelector(`[data-testid="cron-job-toggle-${name}"]`) using the cron
job name) or by filtering the buttons array (from
container.querySelectorAll('button') mapped to labels) for known toggle texts
like 'Pause' or 'Resume' and returning that match instead of labels[0]; adjust
the test to prefer the data-testid approach when available and fall back to
label-filtering using the labels variable.
64-70: ⚡ Quick win

Fragile class-based selector pattern.

The regex /text-sm font-semibold text-stone-900/ matches Tailwind utility classes which are prone to change during styling updates. Similarly, .closest('div.p-4') is coupled to layout implementation.

Consider adding data-testid attributes to the CoreJobList component's job rows and action buttons, then query by [data-testid="cron-job-row-${name}"] instead. This aligns with the testid-keyed approach used elsewhere in this PR for onboarding.
🤖 Prompt for AI Agents
Verify each finding against current code. Fix only still-valid issues, skip the
rest with a brief reason, keep changes minimal, and validate.

In `@app/test/e2e/specs/cron-jobs-flow.spec.ts` around lines 64 - 70, The test
uses fragile class-based selectors in the anonymous row-finder (the code
building rows via /text-sm font-semibold text-stone-900/ and
.closest('div.p-4')), so add stable data-testid attributes in the CoreJobList
component for job rows and action buttons (e.g.
data-testid="cron-job-row-<name>" and data-testid="cron-job-action-<name>") and
update the test's row lookup (the rows variable / container usage) to query by
document.querySelector(`[data-testid="cron-job-row-${name}"]`) and related
data-testid selectors instead of the regex and .closest layout-based selector.

🤖 Prompt for all review comments with AI agents

Verify each finding against current code. Fix only still-valid issues, skip the
rest with a brief reason, keep changes minimal, and validate.

Inline comments:
In `@app/src/utils/tauriCommands/core.ts`:
- Around line 82-91: The code in core.ts reads import.meta.env.MODE and
import.meta.env.DEV directly; replace those direct env reads with the
centralized exports from app/src/utils/config.ts by importing the exported
values (e.g., MODE and DEV or a boolean like isDevMode) and use them when
computing isDevLike and in the debug message; update the reference in the
isDevLike expression (currently using IS_DEV and import.meta.env.MODE) and the
console.debug string (currently referencing import.meta.env.MODE and
import.meta.env.DEV) to use the config exports instead so no direct
import.meta.env access remains in core.ts and the logic still uses IS_DEV plus
the re-exported mode flag.

In `@src/core/all.rs`:
- Around line 185-186: The code is registering the destructive E2E RPC
openhuman.test_reset via
controllers.extend(crate::openhuman::test_support::all_test_support_registered_controllers()),
which exposes a state-wiping endpoint in production; change this so test-support
controllers are only added under a hard gate (e.g., compile-time cfg or an
explicit runtime opt-in), by removing the unconditional call to
all_test_support_registered_controllers() and only invoking it when a secure
guard is present (for example cfg(test) / cfg(feature = "e2e_test_support") or a
checked env/flag like ENABLE_E2E_TEST_SUPPORT), ensuring
openhuman::test_support::all_test_support_registered_controllers and the exposed
openhuman.test_reset remain unavailable in normal production builds.

In `@src/openhuman/test_support/rpc.rs`:
- Around line 33-77: The reset() flow lacks debug/trace instrumentation; add
log::debug or tracing::debug/trace calls at function entry/exit and before/after
each external step (Config::load_or_init, cron::clear_all_jobs, Config::save,
default_root_openhuman_dir, clear_active_user) and on branch points (values of
onboarding_was_completed, api_key_was_set) and errors, using stable
grep-friendly prefixes like "[test_reset]" and include contextual fields (e.g.,
cron_jobs_removed, root path, summary) so callers can correlate; keep final info
log but emit earlier trace/debug lines around creating ResetSummary and before
returning RpcOutcome::new to record the outcome.

---

Nitpick comments:
In `@app/test/e2e/specs/cron-jobs-flow.spec.ts`:
- Around line 98-102: The current selector assumes the first button in the row
is the toggle by returning labels[0], which is brittle; update the test in
cron-jobs-flow.spec.ts to select the toggle explicitly either by adding and
querying a data-testid attribute (e.g.
querySelector(`[data-testid="cron-job-toggle-${name}"]`) using the cron job
name) or by filtering the buttons array (from
container.querySelectorAll('button') mapped to labels) for known toggle texts
like 'Pause' or 'Resume' and returning that match instead of labels[0]; adjust
the test to prefer the data-testid approach when available and fall back to
label-filtering using the labels variable.
- Around line 64-70: The test uses fragile class-based selectors in the
anonymous row-finder (the code building rows via /text-sm font-semibold
text-stone-900/ and .closest('div.p-4')), so add stable data-testid attributes
in the CoreJobList component for job rows and action buttons (e.g.
data-testid="cron-job-row-<name>" and data-testid="cron-job-action-<name>") and
update the test's row lookup (the rows variable / container usage) to query by
document.querySelector(`[data-testid="cron-job-row-${name}"]`) and related
data-testid selectors instead of the regex and .closest layout-based selector.

🪄 Autofix (Beta)

Fix all unresolved CodeRabbit comments on this PR:

Push a commit to this branch (recommended)
Create a new PR with the fixes

ℹ️ Review info

⚙️ Run configuration

Configuration used: Organization UI

Review profile: CHILL

Plan: Pro

Run ID: 09e6a036-8b12-49ce-bae1-f2aa6700a76f

📥 Commits

Reviewing files that changed from the base of the PR and between 6cb9cbb and 4d73f11.

📒 Files selected for processing (14)

app/package.json
app/scripts/e2e-build.sh
app/src/utils/tauriCommands/core.ts
app/test/e2e/helpers/deep-link-helpers.ts
app/test/e2e/helpers/reset-app.ts
app/test/e2e/helpers/shared-flows.ts
app/test/e2e/specs/cron-jobs-flow.spec.ts
src/core/all.rs
src/openhuman/cron/mod.rs
src/openhuman/cron/store.rs
src/openhuman/mod.rs
src/openhuman/test_support/mod.rs
src/openhuman/test_support/rpc.rs
src/openhuman/test_support/schemas.rs

…s CodeRabbit feedback Three review fixes from PR tinyhumansai#1859: - **Critical**: gate `openhuman.test_reset` behind a new `e2e-test-support` cargo feature. Without the feature the controller is neither registered nor declared, so `try_invoke_registered_rpc` cannot route to it and `schema_for_rpc_method` returns None — the destructive wipe RPC simply does not exist in shipped binaries. The E2E build script and the Tauri shell crate both forward the feature on. - **Major**: route `import.meta.env.MODE` through a new `IS_DEV_LIKE` export in `app/src/utils/config.ts` so consumer code (`restartApp`) doesn't reach into `import.meta.env` directly. Keeps the canonical env-reading boundary intact per the codebase convention. - **Major**: add debug/trace instrumentation across every step of the reset lifecycle in `src/openhuman/test_support/rpc.rs`. Entry/exit, per-step start/ok, with the cron-row count and the resolved root dir surfaced so failures during the wipe are diagnosable from the log. Also fixes the failing Frontend Unit Tests check: the `restartApp` test that asserted on the literal debug-log message broke when the message changed; restore the original "dev mode → window.location.reload()" text now that `IS_DEV_LIKE` carries the meaning. Defer the testid nitpick to follow-up issue tinyhumansai#1861.

senamakel added 7 commits May 15, 2026 02:04

chore: apply auto-fixes

0f5e4e6

chore: apply auto-fixes

4d73f11

senamakel requested a review from a team May 15, 2026 22:06

coderabbitai Bot requested changes May 15, 2026

View reviewed changes

Comment thread app/src/utils/tauriCommands/core.ts Outdated

Comment thread src/core/all.rs

Comment thread src/openhuman/test_support/rpc.rs

senamakel added 2 commits May 15, 2026 15:23

chore: apply auto-fixes

b494e53

aregmii mentioned this pull request May 15, 2026

fix(lint): make commands-tokens script fail with a clear error when ripgrep is missing + add to CONTRIBUTING prereqs #1867

Merged

12 tasks

coderabbitai Bot approved these changes May 15, 2026

View reviewed changes

senamakel merged commit 2d54e46 into tinyhumansai:main May 15, 2026
24 checks passed

coderabbitai Bot mentioned this pull request May 16, 2026

feat: redesign onboarding, boot, and LLM settings end-to-end #1885

Merged

12 tasks

senamakel mentioned this pull request May 16, 2026

test(e2e): convert ~22 stub specs to resetApp() UI-driven pattern #1889

Merged

This was referenced May 16, 2026

test(e2e): deep chat-harness coverage + streaming mock LLM + rust-e2e Linux lane #1892

Merged

feat(core): add authenticated static directory hosting #1966

Merged

feat(runtime): add javascript facade and skill creator agent #1971

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

test(e2e): make the harness actually exercise the UI past login#1859

test(e2e): make the harness actually exercise the UI past login#1859
senamakel merged 9 commits into
tinyhumansai:mainfrom
senamakel:fix/e2e-tests

senamakel commented May 15, 2026 •

edited by coderabbitai Bot

Loading

Uh oh!

coderabbitai Bot commented May 15, 2026 •

edited

Loading

Walkthrough

Changes

Sequence Diagram

Estimated code review effort

Possibly related issues

Possibly related PRs

Poem

Uh oh!

coderabbitai Bot left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

senamakel commented May 15, 2026 • edited by coderabbitai Bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Problem

Solution

Submission Checklist

Impact

Related

AI Authored PR Metadata (required for Codex/Linear PRs)

Summary by CodeRabbit

Uh oh!

coderabbitai Bot commented May 15, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Walkthrough

Changes

Sequence Diagram

Estimated code review effort

Possibly related issues

Possibly related PRs

Poem

Uh oh!

coderabbitai Bot left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

senamakel commented May 15, 2026 •

edited by coderabbitai Bot

Loading

coderabbitai Bot commented May 15, 2026 •

edited

Loading