fix: sanitize shell-derived env values before building auth headers by KYankee6 · Pull Request #398 · robinebers/openusage

KYankee6 · 2026-04-22T06:38:29Z

Description

Fixes the Z.ai failure mode where OpenUsage reads a valid API key from an interactive shell, but hidden ANSI/control bytes in shell startup output contaminate the resolved value and make the Authorization header invalid.

The fix hardens the shared host env resolver instead of patching only the Z.ai plugin:

add explicit start/end markers around interactive-shell env reads
sanitize ANSI/control sequences from resolved env values
add regression tests for noisy shell output and inline control bytes

Related Issue

Fixes #358

Type of Change

Testing

I ran bun run build and it succeeded
I ran bun run test and all tests pass
I tested the change locally with bun tauri dev
I ran cargo test --manifest-path src-tauri/Cargo.toml
I ran cargo check --manifest-path src-tauri/Cargo.toml

Checklist

I read CONTRIBUTING.md
I read CODE_OF_CONDUCT.md
My PR targets the main branch
I did not introduce new dependencies without justification

Summary by cubic

Sanitizes env values from interactive shells and direct env reads to prevent ANSI/control noise from corrupting API keys and breaking Authorization headers (notably for Z.ai). Hardens the shared host env resolver so all plugins benefit; also fixes empty-marker cases.

Bug Fixes
- Use start/end markers for interactive shell reads; parse only the marked value. If markers exist but payload is empty, return None (no fallback). Keep sanitized fallback only when markers are missing.
- Strip ANSI/control bytes and trim across shell and process env reads; added regression tests for noisy startup output, inline control bytes, and empty marked payload.

^{Written for commit 7cf7a6f. Summary will update on new commits.}

OpenUsage falls back to reading whitelisted env vars from an interactive shell when the GUI process does not inherit them directly. Some zsh setups emit ANSI/control sequences on stdout during startup, which can stick to the resolved API key and make the Authorization header invalid. This change wraps shell reads in explicit start/end markers, sanitizes resolved env values, and adds regression tests for noisy shell output and inline terminal control sequences. Constraint: GUI launches may not inherit terminal-only env vars Constraint: Interactive shell startup output can contain ANSI/control bytes Rejected: Restrict fix to the Z.ai plugin | other providers share the same host env resolver Rejected: Rely on shell config hygiene alone | the app still needs defensive parsing Confidence: high Scope-risk: narrow Reversibility: clean Directive: Treat shell stdout as untrusted whenever it is used as structured data Tested: bun run build Tested: bun run test Tested: cargo test --manifest-path src-tauri/Cargo.toml Tested: cargo check --manifest-path src-tauri/Cargo.toml Not-tested: bun tauri dev manual UI verification Related: robinebers#358

cubic-dev-ai

1 issue found across 1 file

Prompt for AI agents (unresolved issues)


Check if these issues are valid — if so, understand the root cause of each and fix them. If appropriate, use sub-agents to investigate and fix each issue separately.


<file name="src-tauri/src/plugin_engine/host_api.rs">

<violation number="1" location="src-tauri/src/plugin_engine/host_api.rs:84">
P1: Fallback parsing can misinterpret marker output for empty env vars and return `__OPENUSAGE_ENV_END__` as the value.</violation>
</file>

_{Reply with feedback, questions, or to request a fix. Tag @cubic-dev-ai to re-run a review, or fix all with cubic.}

Interactive-shell env reads use start/end markers so GUI launches can recover terminal-only variables. An empty env var produced a complete marker block with no payload, but the parser treated that the same as a marker miss and fell back to whole-output parsing. That fallback could select the end marker itself as the resolved value. This change distinguishes complete marker blocks from true fallback cases, returning None for empty marked output while preserving the no-marker sanitization path. Constraint: GUI env recovery still needs a no-marker fallback for noisy shell output Constraint: Empty env vars must stay empty instead of becoming marker text Rejected: Remove fallback parsing entirely | would regress shell-noise recovery for GUI launches Rejected: Special-case only the exact end marker string | leaves the real state distinction ambiguous Confidence: high Scope-risk: narrow Reversibility: clean Directive: Keep marker-aware parsing separate from generic stdout sanitization so empty payloads stay empty Tested: cargo test --manifest-path src-tauri/Cargo.toml parse_interactive_shell_env_output -- --nocapture Tested: cargo test --manifest-path src-tauri/Cargo.toml extract_marked_value -- --nocapture Tested: cargo check --manifest-path src-tauri/Cargo.toml Not-tested: bun tauri dev manual UI verification Related: robinebers#358 Related: robinebers#398

validatedev · 2026-04-22T16:35:06Z

@codex review

Copilot

Pull request overview

Hardens the host-side environment variable resolver to prevent interactive-shell output (ANSI/control noise) from contaminating API keys and producing invalid Authorization headers (notably impacting the Z.ai provider).

Changes:

Adds start/end markers around interactive-shell env reads and parses only the marked payload.
Introduces sanitization to strip ANSI/control characters and normalize env values for both process and shell-derived reads.
Adds regression tests covering noisy shell output, inline control bytes, and empty marked payload handling.

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

Copilot · 2026-04-22T16:38:41Z

+    let mut cleaned = if let Ok(ansi_re) = regex_lite::Regex::new(r"\x1B\[[0-?]*[ -/]*[@-~]") {
+        ansi_re.replace_all(text, "").to_string()
    } else {
-        Some(trimmed.to_string())
+        text.to_string()
+    };


The ANSI/escape stripping regex only removes CSI sequences (ESC [ ...), but other common terminal escape sequences (e.g., OSC: ESC ] ... BEL / ESC \, single-char ESC sequences like ESC ( B) can survive. Because retain then drops the ESC control byte but leaves the rest of the sequence’s printable payload, sanitize_env_value can still return a contaminated key (e.g., "]0;title" prefix). Consider expanding sanitization to remove OSC and other ESC sequences (or stripping any ESC-initiated sequence) before filtering control chars.

Copilot · 2026-04-22T16:38:41Z

+fn sanitize_env_value(text: &str) -> Option<String> {
+    let mut cleaned = if let Ok(ansi_re) = regex_lite::Regex::new(r"\x1B\[[0-?]*[ -/]*[@-~]") {
+        ansi_re.replace_all(text, "").to_string()


sanitize_env_value recompiles the same regex on every call. Since env resolution can happen frequently (and is used by both process and interactive-shell reads), consider compiling the regex once (e.g., via OnceLock/Lazy) and reusing it to avoid repeated compilation overhead.

Copilot · 2026-04-22T16:38:41Z

+    let script = format!(
+        "printf '{}\\n'; printenv {}; printf '{}\\n'",
+        START_MARKER, name, END_MARKER
+    );


The interactive-shell script interpolates name directly into the command string (printenv {}) without quoting/escaping. Today this is mitigated by the JS-side whitelist, but this helper is generic and could be reused from Rust with non-whitelisted input later. Consider adding a strict env-var-name validation (e.g., [A-Z0-9_]+) before building the script and/or quoting the argument in a shell-safe way (printenv -- ...) to prevent command injection.

chatgpt-codex-connector · 2026-04-22T16:38:50Z

Codex Review: Didn't find any major issues. Delightful!

ℹ️ About Codex in GitHub

Your team has set up Codex to review pull requests in this repo. Reviews are triggered when you

Open a pull request for review
Mark a draft as ready
Comment "@codex review".

If Codex has suggestions, it will comment; otherwise it will react with 👍.

Codex can also answer questions or update the PR. Try commenting "@codex address that feedback".

robinebers · 2026-04-28T07:57:39Z

Great, thank you!

Interactive-shell env reads use start/end markers so GUI launches can recover terminal-only variables. An empty env var produced a complete marker block with no payload, but the parser treated that the same as a marker miss and fell back to whole-output parsing. That fallback could select the end marker itself as the resolved value. This change distinguishes complete marker blocks from true fallback cases, returning None for empty marked output while preserving the no-marker sanitization path. Constraint: GUI env recovery still needs a no-marker fallback for noisy shell output Constraint: Empty env vars must stay empty instead of becoming marker text Rejected: Remove fallback parsing entirely | would regress shell-noise recovery for GUI launches Rejected: Special-case only the exact end marker string | leaves the real state distinction ambiguous Confidence: high Scope-risk: narrow Reversibility: clean Directive: Keep marker-aware parsing separate from generic stdout sanitization so empty payloads stay empty Tested: cargo test --manifest-path src-tauri/Cargo.toml parse_interactive_shell_env_output -- --nocapture Tested: cargo test --manifest-path src-tauri/Cargo.toml extract_marked_value -- --nocapture Tested: cargo check --manifest-path src-tauri/Cargo.toml Not-tested: bun tauri dev manual UI verification Related: robinebers#358 Related: robinebers#398 (cherry picked from commit 7cf7a6f)

KYankee6 requested review from davidarny, robinebers and validatedev as code owners April 22, 2026 06:38

github-actions Bot added the rust Pull requests that update rust code label Apr 22, 2026

cubic-dev-ai Bot reviewed Apr 22, 2026

View reviewed changes

Comment thread src-tauri/src/plugin_engine/host_api.rs Outdated

KYankee6 mentioned this pull request Apr 22, 2026

Z.AI provider fails with “Usage request failed. Check your connection.” despite valid API key #358

Closed

KYankee6 force-pushed the fix/zai-env-header-noise branch from 050513d to 5a7de06 Compare April 22, 2026 06:45

validatedev requested a review from Copilot April 22, 2026 16:34

Copilot started reviewing on behalf of validatedev April 22, 2026 16:35 View session

Copilot AI reviewed Apr 22, 2026

View reviewed changes

robinebers merged commit 77ef460 into robinebers:main Apr 28, 2026
6 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix: sanitize shell-derived env values before building auth headers#398

fix: sanitize shell-derived env values before building auth headers#398
robinebers merged 2 commits into
robinebers:mainfrom
KYankee6:fix/zai-env-header-noise

KYankee6 commented Apr 22, 2026 •

edited

Loading

Uh oh!

cubic-dev-ai Bot left a comment

Uh oh!

Uh oh!

validatedev commented Apr 22, 2026

Uh oh!

Copilot AI left a comment

Uh oh!

Copilot AI Apr 22, 2026

Uh oh!

Copilot AI Apr 22, 2026

Uh oh!

Copilot AI Apr 22, 2026

Uh oh!

chatgpt-codex-connector Bot commented Apr 22, 2026

Uh oh!

robinebers commented Apr 28, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Conversation

KYankee6 commented Apr 22, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Description

Related Issue

Type of Change

Testing

Checklist

Summary by cubic

Uh oh!

cubic-dev-ai Bot left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

validatedev commented Apr 22, 2026

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Uh oh!

Copilot AI Apr 22, 2026

Choose a reason for hiding this comment

Uh oh!

Copilot AI Apr 22, 2026

Choose a reason for hiding this comment

Uh oh!

Copilot AI Apr 22, 2026

Choose a reason for hiding this comment

Uh oh!

chatgpt-codex-connector Bot commented Apr 22, 2026

Uh oh!

robinebers commented Apr 28, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

KYankee6 commented Apr 22, 2026 •

edited

Loading