fix(refresh): replace `codex exec` fallback with app-server JSON-RPC by Cmochance · Pull Request #33 · Cmochance/Codex_Account_Switch

Cmochance · 2026-05-10T06:48:27Z

Summary

Replace the codex exec "Reply with the single word OK." fallback in the per-card Refresh button with codex app-server's JSON-RPC account/read + account/rateLimits/read. Same backend data, no LLM round-trip, no quota consumed, returns within seconds instead of 30–90 s.
Drop the post-spawn JSONL session-file scan + cleanup that was only needed because the legacy codex exec had to provoke a session write to leak rate-limit data.
New shared module src-tauri/shared/runtime/codex_app_server.rs implements a stdio JSON-RPC 2.0 client (newline-delimited frames, mandatory initialize + initialized handshake, monotonic id matching, per-request + overall session deadlines, ChildGuard that always kills + reaps, dedicated reader + stderr-drain threads to avoid pipe back-pressure deadlocks). Verified against openai/codex codex-rs/app-server upstream protocol.
PlatformHooks::run_codex_auth_refresh (AppResult<()>) renamed to fetch_account_via_app_server (AppResult<AppServerSnapshot>); both mac and Windows process.rs build a Command and the shared client drives it.
Hardened three macOS discover_real_codex_cli_path_* tests that mutate HOME/PATH against parallel-execution races by threading them through the existing env_guard() mutex.

Compatibility

Requires codex ≥ 0.130.0 on the fallback path (the primary HTTP path is unchanged and works on all versions). Older CLIs surface a new APP_SERVER_METHOD_UNSUPPORTED error so users get an upgrade hint instead of an opaque hang.

Why

For users in slow-network regions (or whenever the direct HTTP refresh hit a transient 401 / GFW interference), every Refresh click that fell through to the legacy path wasted real ChatGPT quota and made the user wait 30–90 s for a model to reply with one token. The app-server protocol exposes the same data without the model spend.

Test plan

cargo test --lib mac: 84/84 pass (10 + 3 new tests on the JSON-RPC client cover routing, plan extraction, error mapping, partial-data branches).
Manual: open the app, click Refresh on a profile while disabling the network long enough for the HTTP fast path to fail (e.g. sudo pfctl or unplug Wi-Fi briefly), verify fallback now completes in seconds without a quota deduction.
Manual: same flow with codex < 0.130.0 — verify the user sees APP_SERVER_METHOD_UNSUPPORTED toast hinting at upgrade, not a hang.
CI: macOS + Windows runners.

Follow-ups (not in this PR)

drain_stderr_in_background writes the child's stderr straight to /dev/null; on a child crash mid-handshake the user sees APP_SERVER_PIPE_CLOSED with no context. Buffering the last N KB and appending to error messages is a small structural improvement.
try_refresh_via_chatgpt_api swallows every HTTP error and falls through. For some cases (e.g. matching the relogin-required token signatures) it could short-circuit to AUTH_REFRESH_RELOGIN_REQUIRED directly without a second app-server round-trip.
mac/runtime/refresh_runtime.rs and win/runtime/refresh_runtime.rs are now near-mirror images of each other — candidate for a shared helper in a separate refactor.

🤖 Generated with Claude Code

The per-card Refresh button's fallback path used to spawn `codex exec "Reply with the single word OK."` whenever the direct ChatGPT HTTP refresh failed. That path: * burned ~30–90 s on the user's clock (real LLM round-trip) * consumed real ChatGPT quota * relied on parsing the resulting session JSONL to extract rate limits, which only worked because codex happened to write that data on every request It is replaced by `codex app-server`'s JSON-RPC interface: `account/read` (refreshToken=true) for plan tier + auth.json refresh, followed by `account/rateLimits/read` for the live primary/secondary rate-limit windows. Same backend endpoint, no LLM round-trip, returns within seconds. Implementation lives in `shared/runtime/codex_app_server.rs`: a stdio JSON-RPC 2.0 client driving the upstream protocol verified against `openai/codex` `codex-rs/app-server` (newline-delimited JSON, mandatory `initialize` + `initialized` handshake, monotonic id matching with per-request and overall session deadlines, ChildGuard that always kills + reaps on every exit path, dedicated reader and stderr-drain threads to avoid pipe back-pressure deadlocks). Mac and Windows `process.rs` build the `Command` (Windows hides the console window via the existing `hide_console_window` helper); the shared client runs the rest. The `PlatformHooks` trait method renamed `run_codex_auth_refresh` → `fetch_account_via_app_server` with a return type that surfaces both plan + quota in one shot, so both `mac/runtime/refresh_runtime.rs` and `win/runtime/refresh_runtime.rs` drop the post-spawn JSONL scan and sandbox-cleanup dance. Requires `codex` ≥ 0.130.0 on the fallback path; older CLIs surface the new `APP_SERVER_METHOD_UNSUPPORTED` error so the user can upgrade instead of hitting an opaque hang. Also: hardened three `discover_real_codex_cli_path_*` macOS tests that mutate `HOME` / `PATH` against parallel-execution races by routing them through the existing `env_guard()` mutex (these were flaky before this PR; touching the same file made it cheap to fix). Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>

Cmochance · 2026-05-10T06:48:45Z

用户手测清单（push 后请在本机跑一遍）

PR-1 替换的是"刷新按钮失败兜底"的路径，最常见触发条件是网络抖一下让直连 HTTP 失败。建议测试：

1. 正常路径（未变）

打开 App，正常点某张卡的"刷新"按钮
预期：和之前一样几秒内出新 quota（直连 HTTP 路径，未改）

2. 兜底路径（核心改动）

临时屏蔽掉 chatgpt.com 让 fast path 失败：

sudo pfctl -e
echo "block drop quick from any to any port 443" | sudo pfctl -ef -
# 或更简单：临时关 Wi-Fi 让一次刷新失败

点刷新
预期 (新行为)：几秒内（≤10s）出 quota；不再消耗任何 ChatGPT 额度；不再出现 30–90s 卡死的 spinner
取消 pfctl 屏蔽：sudo pfctl -d

3. 老版 codex CLI 兼容性

如果你装的 codex < 0.130.0：兜底路径会报 APP_SERVER_METHOD_UNSUPPORTED，提示升级
codex --version 看一下；当前主流是 ≥ 0.130

4. 已登录账号 token 失效

拿一个已经 expired 的 profile 点刷新
预期：和之前一样 toast 本账号会话已过期，请重新登录（错误码 AUTH_REFRESH_RELOGIN_REQUIRED，已在新代码里 mapping）

5. 没装 codex CLI

临时把 codex CLI rename 掉
预期：toast 报 REAL_CODEX_NOT_FOUND，弹出 Codex CLI 路径设置对话框（已有行为，未改）

CI 全绿后，等你手测确认 → squash merge。

chatgpt-codex-connector

💡 Codex Review

Here are some automated review suggestions for this pull request.

Reviewed commit: d493819846

ℹ️ About Codex in GitHub

Your team has set up Codex to review pull requests in this repo. Reviews are triggered when you

Open a pull request for review
Mark a draft as ready
Comment "@codex review".

If Codex has suggestions, it will comment; otherwise it will react with 👍.

Codex can also answer questions or update the PR. Try commenting "@codex address that feedback".

`handleRefreshProfile` was missing the symmetric counterpart of `handleLoginProfile`'s `isRefreshPending(profile)` guard. With the old code, clicking Refresh on a profile whose login was still in flight would push it onto the refresh queue; once the worker drained, refresh and login could race on writing the same per-profile `auth.json`. Atomic writes prevented torn data, but whichever lost the race got silently overwritten. Add `state.loginActiveProfile === profile` to the early-return. Cross-profile Refresh during a login is still allowed (different sandboxes, different `auth.json`s). Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>

Cmochance · 2026-05-10T07:13:14Z

追加 commit：refresh ↔ login 同 profile 互锁

补一条 3 行守卫到 handleRefreshProfile：当同一个卡片的 login 还在飞时，refresh 点击直接 no-op，对称掉 handleLoginProfile 那边已有的 isRefreshPending 守卫。跨 profile 的 refresh 不受影响（不同沙箱，不同 auth.json）。

增量手测建议

卡 A 点 Login，OAuth 浏览器还没完成时，立刻点同卡 Refresh
预期：refresh 按钮点击没反应，等 login 完成后再点恢复正常
跨卡片：login A 在飞时点 Refresh B —— 预期正常进队列

Codex bot raised a P2 on PR #33: the new fallback's hard 8 s deadline on every JSON-RPC call could fail on the exact "slow network" scenarios it is meant to recover from, since `account/read` with `refreshToken: true` chains an OAuth refresh round-trip + account read on the server side, and either leg can legitimately take 5–15 s on a high-latency link. Split the per-method budget so `account/read` gets 25 s (covers chained OAuth refresh + read) while the lighter `account/rateLimits/read` GET keeps a 15 s ceiling — matching `chatgpt_api.rs:109` `HTTP_TIMEOUT`. Session ceiling raised to 60 s with margin to cover handshake + both calls at their per-method maxima. Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>

Bumps `package.json` to 1.5.8 (`version-sync.mjs` propagates to Cargo.toml + lockfiles) and stamps the 1.5.8 CHANGELOG entry. Patches landing as 1.5.8: - **#33** — Replace the legacy `codex exec "Reply with the single word OK."` refresh fallback with `codex app-server`'s JSON-RPC `account/read` + `account/rateLimits/read`. No more burning user quota for an LLM round-trip just to provoke a session-file write. Also closes a same-profile Refresh-vs-Login race and tunes the RPC timeouts so slow-network users still benefit from the fallback instead of tripping `APP_SERVER_TIMEOUT`. Co-authored-by: Claude Opus 4.7 <noreply@anthropic.com>

chatgpt-codex-connector Bot reviewed May 10, 2026

View reviewed changes

Comment thread src-tauri/shared/runtime/codex_app_server.rs Outdated

Cmochance merged commit 4775641 into main May 10, 2026
3 checks passed

Cmochance deleted the feat/refresh-app-server-rpc branch May 10, 2026 07:21

Cmochance mentioned this pull request May 10, 2026

chore(release): prep v1.5.8 #34

Merged

4 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix(refresh): replace `codex exec` fallback with app-server JSON-RPC#33

fix(refresh): replace `codex exec` fallback with app-server JSON-RPC#33
Cmochance merged 3 commits into
mainfrom
feat/refresh-app-server-rpc

Cmochance commented May 10, 2026

Uh oh!

Cmochance commented May 10, 2026

Uh oh!

chatgpt-codex-connector Bot left a comment

Uh oh!

Uh oh!

Cmochance commented May 10, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

Cmochance commented May 10, 2026

Summary

Compatibility

Why

Test plan

Follow-ups (not in this PR)

Uh oh!

Cmochance commented May 10, 2026

用户手测清单（push 后请在本机跑一遍）

1. 正常路径（未变）

2. 兜底路径（核心改动）

3. 老版 codex CLI 兼容性

4. 已登录账号 token 失效

5. 没装 codex CLI

Uh oh!

chatgpt-codex-connector Bot left a comment

Choose a reason for hiding this comment

💡 Codex Review

Uh oh!

Uh oh!

Cmochance commented May 10, 2026

追加 commit：refresh ↔ login 同 profile 互锁

增量手测建议

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant