add interactive gateway demo by ericcurtin · Pull Request #849 · docker/model-runner

ericcurtin · 2026-04-08T13:47:35Z

Introduces demos/gateway/ with a step-by-step shell demo for the model-cli gateway command. Each step pauses for Enter, types commands character-by-character as 'docker model gateway', and covers health, auth, chat completions, streaming, embeddings, load balancing, fallbacks, and OpenAI SDK compatibility.

sourcery-ai

Hey - I've left some high level feedback:

The demo script assumes the model-cli binary exists at a hard-coded path (${REPO_ROOT}/model-cli/target/release/model-cli); consider either checking for its existence with a helpful error message or allowing an override via an environment variable (e.g., MODEL_CLI_BIN).
Several steps depend on python3 and the openai package being available; you already skip the SDK step when openai is missing, but it may be more robust to add an early prerequisite check (for python3 and core Python usage) and exit with a clear message if they are not installed.

Prompt for AI Agents

Please address the comments from this code review:

## Overall Comments
- The demo script assumes the `model-cli` binary exists at a hard-coded path (`${REPO_ROOT}/model-cli/target/release/model-cli`); consider either checking for its existence with a helpful error message or allowing an override via an environment variable (e.g., `MODEL_CLI_BIN`).
- Several steps depend on `python3` and the `openai` package being available; you already skip the SDK step when `openai` is missing, but it may be more robust to add an early prerequisite check (for `python3` and core Python usage) and exit with a clear message if they are not installed.

Sourcery is free for open source - if you like our reviews please consider sharing them ✨

_{Help me be more useful! Please click 👍 or 👎 on each comment and I'll use the feedback to improve your reviews.}

gemini-code-assist

Code Review

This pull request introduces a new interactive demo for the model-cli gateway command, providing configuration examples and a bash script to walk users through features like load balancing, retries, and fallbacks. The review identified a critical security issue regarding the use of eval in the demo script's run_step function, which should be refactored to avoid potential command injection.

gemini-code-assist · 2026-04-08T13:56:30Z

demos/gateway/demo.sh

+run_step() {
+    local description="$1"
+    local command="$2"
+
+    echo
+    printf "${dim}# %s${reset}\n" "$description"
+    printf "${bold}${green}\$${reset} "
+    typewrite "$command" 0.035
+    printf "${dim} ▌${reset}"   # blinking-cursor illusion
+
+    # Wait for Enter
+    read -r -s _
+    printf "\r${bold}${green}\$${reset} ${white}%s${reset}\n" "$command"
+
+    # Actually run it
+    eval "$command"
+}


The run_step function uses eval on a command string that has had variables expanded into it. This is a critical security and correctness issue. If a variable (like $API_KEY) contains shell metacharacters (e.g., a single quote), it can break the command syntax or lead to arbitrary code execution.

According to the repository style guide, which prioritizes security, eval should be avoided.

I recommend removing this function and inlining its logic at its two call sites (in Step 3 and Step 4). This will make variable handling explicit and safe for each command, which is consistent with how more complex commands are handled in other steps of this script.

Here is an example of how the health check in Step 3 could be rewritten safely:

section "Step 3 — Health check" local description="The gateway exposes /health — no auth required" local command="curl -s http://localhost:${GATEWAY_PORT}/health | python3 -m json.tool" echo printf "${dim}# %s\n" "$description" printf "${bold}${green}\\$${reset} " typewrite "$command" 0.035 printf "${dim} ▌${reset}" # blinking-cursor illusion # Wait for Enter read -r -s _ printf "\r${bold}${green}\\$${reset} ${white}%s${reset}\n" "$command" # Actually run it curl -s "http://localhost:${GATEWAY_PORT}/health" | python3 -m json.tool

References

The use of eval presents a significant security risk (potential for command injection), which goes against the security principles outlined in the style guide. The guide states a priority on security, asking "What are the threat surfaces? Are trust boundaries respected? ... Think about the OWASP top 10..." ^(link)

Introduces demos/gateway/ with a step-by-step shell demo for the model-cli gateway command. Each step pauses for Enter, types commands character-by-character as 'docker model gateway', and covers health, auth, chat completions, streaming, embeddings, load balancing, fallbacks, and OpenAI SDK compatibility. Signed-off-by: Eric Curtin <eric.curtin@docker.com>

vllm-metal uses ZMQ IPC sockets at temporary paths under /private/var/folders (the macOS TMPDIR) for internal inter-process communication between API server workers. The Python sandbox profile only allowed network-bind for Unix sockets matching the inference.*-[0-9]+\.sock$ pattern and TCP loopback, which caused a ZMQError: Operation not permitted when vllm-metal tried to bind those sockets. Allow network-bind on paths under /private/var/folders so vllm-metal can create its internal ZMQ IPC sockets in the system temp directory.

sourcery-ai bot reviewed Apr 8, 2026

View reviewed changes

gemini-code-assist bot reviewed Apr 8, 2026

View reviewed changes

ericcurtin force-pushed the some-fix5 branch 2 times, most recently from 33c2c7b to 1c7f10c Compare April 9, 2026 12:37

ilopezluna approved these changes Apr 9, 2026

View reviewed changes

ericcurtin force-pushed the some-fix5 branch from 1c7f10c to 2e8c00c Compare April 9, 2026 13:05

ericcurtin merged commit 9a168e7 into main Apr 9, 2026
13 of 14 checks passed

ericcurtin deleted the some-fix5 branch April 9, 2026 13:56

ericcurtin restored the some-fix5 branch April 9, 2026 14:01

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

add interactive gateway demo#849

add interactive gateway demo#849
ericcurtin merged 2 commits intomainfrom
some-fix5

ericcurtin commented Apr 8, 2026

Uh oh!

sourcery-ai bot left a comment

Uh oh!

gemini-code-assist bot left a comment

Uh oh!

gemini-code-assist bot Apr 8, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

ericcurtin commented Apr 8, 2026

Uh oh!

sourcery-ai bot left a comment

Choose a reason for hiding this comment

Uh oh!

gemini-code-assist bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

gemini-code-assist bot Apr 8, 2026

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants