Refactor: direct spawn_executable (remove shell escaping) by ppsplus-bradh · Pull Request #42 · guess/claude_code

ppsplus-bradh · 2026-03-27T04:03:02Z

Summary

Replace /bin/sh -c command string invocation with direct Port.open({:spawn_executable, ...}) using native :args, :env, and :cd port options
Delete build_shell_command/4, shell_escape/1, and @shell_safe_pattern — shell escaping is no longer needed
MCP tool DSL: move description from positional argument to inside the block

Motivation

open_cli_port/4 previously spawned the CLI by building a single shell command string:

cd /path && KEY1='val1' KEY2='val2' exec /path/to/claude --flag arg

This required every component (env values, paths, arguments) to be shell-escaped via shell_escape/1. That function has already had one bug (#38) where the trigger list missed characters like !, #, <, >, ?, [, ], *, ~, and tab. Even after the safelist fix (#39), hand-rolled shell escaping remains a maintenance burden and a source of correctness risk.

Erlang's Port.open/2 with {:spawn_executable, path} passes arguments directly to execvp(3) — no shell is involved. This is the approach recommended by the Erlang Security WG and what Elixir's own System.cmd/3 uses internally.

Concern	Before (`sh -c`)	After (direct)
Arguments	Concatenated + escaped string	`{:args, ["--flag", "value"]}` — passed directly
Env vars	`KEY='escaped_val'` prefix in shell string	`{:env, [{~c"KEY", ~c"val"}]}` — set by Erlang runtime
Working dir	`cd /path &&` prefix in shell string	`{:cd, ~c"/path"}` — set by Erlang runtime
Special chars	Must be escaped for shell interpretation	Not interpreted — passed as-is
Process tree	sh → exec → CLI	CLI directly

Changes

open_cli_port/4 — rewritten to spawn the CLI binary directly:

defp open_cli_port(executable, args, state, opts) do
  exe_path = executable |> String.to_charlist() |> :os.find_executable()
  if !exe_path, do: raise "CLI executable not found: #{executable}"

  env_list = prepare_env(state)
  port_opts = maybe_add_cd([{:args, args}, {:env, env_list}, :binary, :exit_status, :stderr_to_stdout], opts)
  port = Port.open({:spawn_executable, exe_path}, port_opts)
  {:ok, port}
end

Deleted: build_shell_command/4, shell_escape/1, @shell_safe_pattern

prepare_env/1 — returns [{charlist(), charlist()}] for Erlang's native :env port option.

MCP tool DSL — tool/3 replaced with tool/2. Description moves inside the block:

# Before
tool :add, "Add two numbers" do ... end

# After
tool :add do
  description "Add two numbers"
  ...
end

feat: environment variable filtering with filter_env, allowed_env, disallowed_env #44 — Environment variable filtering (extracted from this PR into a separate PR)
bug: MCP Server tool macro does not define description/0 on generated modules #45 — MCP Server tool macro description/0 not exported (test issue discovered during this work)

Test plan

All 1485 tests pass (env filtering tests moved to feat: environment variable filtering with filter_env, allowed_env, disallowed_env #44)
mix compile --warnings-as-errors clean
mix format --check-formatted clean
New tests verify special characters in env vars pass through without escaping
Manual integration testing with real CLI binary in Kubernetes (ClaudeRun BEAM runtime)

ppsplus-bradh · 2026-03-27T04:04:46Z

The shell escaping was stuck in my brain. 🤷‍♂️

ppsplus-bradh · 2026-03-27T04:05:28Z

The shell escaping was stuck in my brain. 🤷‍♂️

The env filtering was a bonus (also stuck in my brain).

ppsplus-bradh · 2026-03-27T04:25:51Z

From smoke testing, FWIW.

col · 2026-03-27T11:51:36Z

Quick question - does this still allow passing additional env vars via the 'env' option without them being filtered?

I definitely agree with avoiding leaking env vars from the parent Beam process. I hit that issue just a few hours ago so this is perfectly timed! 🙏

ppsplus-bradh · 2026-03-27T13:53:05Z

Quick question - does this still allow passing additional env vars via the 'env' option without them being filtered?

I definitely agree with avoiding leaking env vars from the parent Beam process. I hit that issue just a few hours ago so this is perfectly timed! 🙏

Yes, sir, it does! https://github.com/guess/claude_code/pull/42/changes#diff-4efd7a87a453012236311f25c8f78325c50fec63c901258131124c0bb7c3a4c7L499 The user_env is still merged in as it was previously.

Eliminates shell escaping entirely by spawning the CLI binary directly via Port.open with native :args, :env, and :cd options instead of building a concatenated command string for /bin/sh -c.

Prevents leaking sensitive host environment (SSH keys, database URLs, cloud credentials) to the CLI subprocess. Filters by CLI-recognized prefixes (ANTHROPIC_, CLAUDE_CODE_, CLAUDE_, VERTEX_REGION_), an explicit allowlist of non-namespaced CLI vars, and essential system vars (PATH, HOME, etc.). User-provided :env bypasses the filter.

The test expected exactly {:unhealthy, :provisioning} but on fast runners the adapter can resolve (and fail) before the assertion, landing in :not_connected. Accept either state since both are valid unhealthy states during startup without a real CLI.

ppsplus-bradh · 2026-03-27T15:08:42Z

CI failure seems to be another race condition in the test suite. Reviewing the test suite for any other potential/similar conditions, and may open a new PR depending on scope to address the race conditions.

Direct spawn_executable resolves faster than sh -c, making it more likely that the adapter fails before the stream request is queued. Accept both :stream_init_error and :stream_error in session_test.exs and session_adapter_test.exs.

ppsplus-bradh · 2026-03-27T15:18:22Z

Addressed two race conditions introduced by the fact that spawn_executable resolves faster than sh -c. Investigation did uncover some other potentially brittle tests. Will create a separate PR for consideration on addressing those. Also, if there's any issue with this PR being dual purpose, let me know and I can separate the two tasks (refactoring out the sh -c and filtering env vars) so they may be considered individually.

guess · 2026-03-28T05:02:11Z

wow thanks, @ppsplus-bradh !

looking at this, i'm wondering if it's worth automatically allowing sdk-known cli env vars by default? maybe it'll be better for the user to specify which env vars they want to leak from the parent beam process?

the way it is now we'd have to maintain this list as they add or remove supported env vars. i think the system-critical ones are good to have & i'm on the fence about the prefixed ones. what do you think?

we could potentially have :env and :passthrough_env or something like that. then we can set a list as a sensible default, like:

[
  "HTTP_PROXY",
  {:prefix, "CLAUDE_CODE_"},
  ...
]

And if people don't want anything to leak they can just set it to []

open to whatever you think would be best, just thinking out loud :)

col · 2026-03-28T09:28:11Z

:passthrough_env with a sensible default sounds like a good plan with enough flexibility to cater for all reasonable use cases. 👍

I have cases where I want it to pass through some non-Claude env vars and some cases where I need to block some.

Add `allowed_env` option that accepts a list of environment variable names to pass through from the system environment to the CLI, beyond the built-in allowlist. Unlike `env` (key-value pairs), `allowed_env` takes only keys — values are read from System.get_env() at spawn time. This enables applications to forward specific env vars (e.g. DATABASE_URL, custom config) without hardcoding values in the `env` map, while still benefiting from the security filtering that excludes RELEASE_*, SSH keys, and other sensitive process-level vars. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

Runs `mix format` after every Write or Edit tool use on Elixir files, ensuring code is always formatted before it reaches git. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

Complete the env control surface with two new options: - `filter_env` (boolean, default true) — when true, applies the built-in allowlist (ANTHROPIC_*, CLAUDE_*, PATH, HOME, etc.). When false, passes all system env vars through unfiltered. - `disallowed_env` (list of strings) — keys to exclude from the CLI environment. Works in both filtered and unfiltered modes. Combined with the existing `allowed_env` and `env` options, this gives users full control over what reaches the CLI: # Filtered (default): built-in allowlist + extras filter_env: true, allowed_env: ["DATABASE_URL"] # Unfiltered: everything minus exclusions filter_env: false, disallowed_env: ["RELEASE_COOKIE", "SECRET_KEY"] # Explicit overrides always win regardless of mode env: %{"FORCE_THIS" => "value"} Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

ppsplus-bradh · 2026-03-28T16:35:33Z

Hey guys. This is Claude. Brad asked me to outline our thinking around a more comprehensive and flexible env var filtering system.

Environment Variable Control

The env filtering introduced earlier in this PR (built-in allowlist that reduced 91 vars down to 9) was a good security default, but it locked users into a single mode. We've expanded it into a complete control surface with three complementary options that follow the SDK's existing naming conventions (allowed_tools/disallowed_tools → allowed_env/disallowed_env):

Option	Type	Default	Purpose
`filter_env`	`boolean`	`true`	Toggle built-in allowlist filtering on/off
`allowed_env`	`[String.t()]`	`[]`	Additional keys to pass through (additive to built-in allowlist)
`disallowed_env`	`[String.t()]`	`[]`	Keys to exclude (works in both modes)
`env`	`%{String.t() => String.t()}`	`%{}`	Explicit key-value overrides (always applied, highest priority)

Two operating modes for different use cases

Filtered mode (filter_env: true, default) — start from a secure minimum and add what you need:

System.get_env()  (91 vars in a typical BEAM release)
  → filter: key in (built-in allowlist OR allowed_env) AND key not in disallowed_env  (→ ~9 + extras)
  → Map.new()

Unfiltered mode (filter_env: false) — start from everything and remove what you don't want:

System.get_env()  (91 vars)
  → reject: key in disallowed_env  (→ 91 minus exclusions)
  → Map.new()

Notably, if a user simply sets filter_env: false with no other options, the system behaves exactly as it did before this PR — all system env vars pass through to the CLI unchanged. This makes the filtering opt-in by default with zero breaking changes for existing users who want the previous behavior.

Both paths then merge identically:

  → Map.merge(sdk_env_vars())        # CLAUDE_CODE_ENTRYPOINT, SDK_VERSION
  → Map.merge(user_env)              # explicit :env key-value overrides (highest priority)
  → maybe_put_api_key()              # ANTHROPIC_API_KEY from :api_key option
  → maybe_put_file_checkpointing()   # optional checkpoint flag

The merge order is intentional — :env overrides always win. Even if you disallowed_env a key, you can still force it through via :env with an explicit value. disallowed_env filters the inherited system environment, not the user's explicit intent.

Examples:

# Default: secure minimum, just add DATABASE_URL
ClaudeCode.start_link(allowed_env: ["DATABASE_URL"])

# Everything except secrets (matches pre-filter behavior + exclusions)
ClaudeCode.start_link(filter_env: false, disallowed_env: ["RELEASE_COOKIE", "GITHUB_SSH_KEY"])

# Pre-filter behavior exactly (no filtering at all)
ClaudeCode.start_link(filter_env: false)

# Filtered + force a specific override regardless of filtering
ClaudeCode.start_link(
  allowed_env: ["MY_CONFIG"],
  disallowed_env: ["CLAUDE_CODE_SOME_FLAG"],
  env: %{"FORCED_VAR" => "always_set"}
)

Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

ppsplus-bradh · 2026-03-28T17:04:51Z

Alright, @guess and @col. Let me know what you think of that. Comment above.

ppsplus-bradh · 2026-03-28T17:06:36Z

And I swear, I'll break stuff like this apart next time. 🤣 Single concern PRs.

guess · 2026-03-28T18:07:26Z

Looking into this further, I think we should keep the behaviour the same (inherit all env), since that is also what the Python SDK is doing, with the exception of filtering out CLAUDECODE variable. So maybe we should follow suit and do that as well by default.

I think splitting this into 4 options will get confusing and a single additional option could support 99% of use-cases:

# everything gets inherited (current behaviour)
ClaudeCode.start_link()

# nothing gets inherited from parent
ClaudeCode.start_link(inherit_env: [])

# only these get inherited from parent
ClaudeCode.start_link(inherit_env: ["CLAUDE_CODE_SOME_FLAG"])

And we still have :env to be able to pass in whatever environment overrides on top of these options.

guess · 2026-03-28T18:13:20Z

@ppsplus-bradh We could also just remove the env var filtering from this PR, keeping the behaviour the same so we can merge this and #43 in. Then we can make a sep. focused PR for the env filtering changes?

ppsplus-bradh · 2026-03-28T18:18:24Z

@ppsplus-bradh We could also just remove the env var filtering from this PR, keeping the behaviour the same so we can merge this and #43 in. Then we can make a sep. focused PR for the env filtering changes?

Yeah... My head was going there. On it.

Remove filter_env, allowed_env, disallowed_env, and all filtering infrastructure from this PR. These will be submitted as a separate PR to keep the shell escaping refactor focused. build_env now passes System.get_env() through unfiltered, matching the pre-refactor behavior. The spawn_executable change is the sole focus of this PR. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

ppsplus-bradh · 2026-03-28T19:12:43Z

@guess 2 PRs now. Separate concerns.

ppsplus-bradh marked this pull request as ready for review March 27, 2026 04:03

ppsplus-bradh added 5 commits March 27, 2026 09:04

Refactor port spawning to use direct spawn_executable instead of sh -c

5f33513

Eliminates shell escaping entirely by spawning the CLI binary directly via Port.open with native :args, :env, and :cd options instead of building a concatenated command string for /bin/sh -c.

retrigger CI

8ca98e6

Update CHANGELOG with spawn_executable refactor and env filtering

1e103c0

ppsplus-bradh force-pushed the refactor/direct-spawn-executable branch from a4df881 to 1e103c0 Compare March 27, 2026 14:07

Fix race conditions in provisioning error tests

ee47e21

Direct spawn_executable resolves faster than sh -c, making it more likely that the adapter fails before the stream request is queued. Accept both :stream_init_error and :stream_error in session_test.exs and session_adapter_test.exs.

ppsplus-bradh mentioned this pull request Mar 27, 2026

Harden timing-sensitive tests: replace Process.sleep with proper synchronization #43

Merged

5 tasks

ppsplus-bradh and others added 5 commits March 28, 2026 10:22

style: apply mix format to allowed_env changes

ccbed13

Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

chore: add PostToolUse hook for auto mix format on .ex/.exs files

4fd06c0

Runs `mix format` after every Write or Edit tool use on Elixir files, ensuring code is always formatted before it reaches git. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

docs: update changelog with full env control surface

30b932c

Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

ppsplus-bradh and others added 2 commits March 28, 2026 11:37

fix: combine filter+reject into single Enum.filter for Credo

8a97958

Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

ci: re-trigger upstream CI after fork validation

13004c9

This was referenced Mar 28, 2026

feat: environment variable filtering with filter_env, allowed_env, disallowed_env #44

Closed

bug: MCP Server tool macro does not define description/0 on generated modules #45

Closed

ppsplus-bradh changed the title ~~Refactor: direct spawn_executable + env var filtering~~ Refactor: direct spawn_executable (remove shell escaping) Mar 28, 2026

guess approved these changes Mar 28, 2026

View reviewed changes

guess merged commit e896ca3 into guess:main Mar 28, 2026
2 checks passed

guess mentioned this pull request Mar 29, 2026

Add :inherit_env option for environment variable control #47

Merged

7 tasks

Conversation

ppsplus-bradh commented Mar 27, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Motivation

Changes

Related

Test plan

Uh oh!

ppsplus-bradh commented Mar 27, 2026

Uh oh!

ppsplus-bradh commented Mar 27, 2026

Uh oh!

ppsplus-bradh commented Mar 27, 2026

Uh oh!

col commented Mar 27, 2026

Uh oh!

ppsplus-bradh commented Mar 27, 2026

Uh oh!

ppsplus-bradh commented Mar 27, 2026

Uh oh!

ppsplus-bradh commented Mar 27, 2026

Uh oh!

guess commented Mar 28, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

col commented Mar 28, 2026

Uh oh!

ppsplus-bradh commented Mar 28, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

ppsplus-bradh commented Mar 28, 2026

Uh oh!

ppsplus-bradh commented Mar 28, 2026

Uh oh!

guess commented Mar 28, 2026

Uh oh!

guess commented Mar 28, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

ppsplus-bradh commented Mar 28, 2026

Uh oh!

ppsplus-bradh commented Mar 28, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

ppsplus-bradh commented Mar 27, 2026 •

edited

Loading

guess commented Mar 28, 2026 •

edited

Loading

ppsplus-bradh commented Mar 28, 2026 •

edited

Loading

guess commented Mar 28, 2026 •

edited

Loading