fix(mcpcatalog): tell the model to proceed after enabling an OAuth server by trungutt · Pull Request #2925 · docker/docker-agent

trungutt · 2026-05-29T08:40:49Z

Problem

When a user asks something that requires an OAuth-protected catalog
server (e.g. "List all pages by David Gageot in Notion"), the model:

calls search_remote_mcp_servers,
calls enable_remote_mcp_server for notion,
receives a tool result ending with "auth: OAuth — an authorization URL will be elicited on the next turn.",
then narrates to the user "OAuth will happen on your next request, please ask me again" and stops.

The user must repeat their original prompt to actually get an answer:

The Notion server is now enabled and ready to use. When you make your next request, it will initiate OAuth authorization. […] Would you like me to proceed with searching your Notion workspace for pages and David Gageot's comments?

Why this happens

The runtime actually drives the OAuth flow inside the same loop iteration:

processToolCalls runs handleEnable, which only registers the toolset.
reprobe (pkg/runtime/loop.go:608-610) calls getTools → Tools() on the catalog → first-call lazy Start() on the freshly enabled toolset (mcpcatalog.go:371-372).
Start() runs the MCP handshake, the 401 triggers the OAuth elicitation, the user grants, Start() returns success and the server's tools become available.
The next loop iteration calls the model with the new tools already in the prompt.

So by the time the next LLM turn begins, OAuth is already complete. The model just doesn't know that — it only sees the previous tool result, which literally said "elicited on the next turn".

Fix

This PR rewrites the two strings the model uses to reason about the post-enable state:

handleEnable's OAuth-branch success message now tells the model the host triggers any required authorization transparently the first time the server's tools are used, and that the model should continue with the user's original request now.
Instructions() step 3 is hardened from a soft "Use the newly activated tools as you would any other" into an explicit "Immediately proceed with the user's ORIGINAL request […]; do NOT stop and ask the user to repeat themselves; do NOT narrate the OAuth flow".

No runtime behaviour changes; this is a prompt fix only

…rver When the model called enable_remote_mcp_server for an OAuth server, the tool result said "auth: OAuth — an authorization URL will be elicited on the next turn" and the instructions softly suggested using the activated tools. The runtime actually drives the OAuth flow inside the same loop iteration (via reprobe + Tools() → Start() → elicitation), so by the time the next LLM call begins the tools are live and OAuth is already complete. But the model still read the previous tool result and narrated to the user "OAuth will happen on your next request, ask me again" — forcing the user to repeat themselves. This change rewrites two strings the model relies on: - The OAuth branch of handleEnable's success message now tells the model the host triggers any required authorization transparently and that the model should continue with the user's original request immediately. - Step 3 of the Instructions() workflow is hardened from "use the newly activated tools as you would any other" into an explicit "proceed with the user's ORIGINAL request, do NOT stop and ask". No runtime behaviour changes; this is purely a prompt fix.

docker-agent

Assessment: 🟡 NEEDS ATTENTION

This is a prompt-only fix that rewrites two LLM-facing instruction strings in mcpcatalog.go. No runtime logic is changed. Two findings surfaced during review — both are edge-case wording concerns rather than code bugs.

- Step 3 of Instructions(): make the timing explicit. The previous "Immediately proceed … by calling the newly activated server's tools" wording contradicted step 2's "adds the server's tools to your set on the *next* turn" and could lead a literal model to try to call the new tools in the same response. Reworded to "On your NEXT turn (once the activated server's tools appear in your set), go straight to the user's ORIGINAL request using those tools". - handleEnable's OAuth-branch result message: drop "transparently", which overstated seamlessness for the unmanaged-OAuth path (which requires a browser redirect and explicit grant). Reworded to "the host will surface any required authorization (browser redirect or in-app dialog) the first time the server's tools are used, and resume the tool call once access is granted".

rumpl · 2026-05-29T08:50:34Z

Hold on, why does the LLM need to know that there is an oauth thing?

@rumpl

Per @rumpl's review feedback on docker#2925: the LLM has no agency over the OAuth flow — it can't trigger it, grant it, or recover from it. Every OAuth-related string the catalog exposes to the model is a leaky abstraction, and the original bug (model stops and asks the user to repeat themselves after enabling an OAuth-protected server) is a direct consequence of leaking 'auth: OAuth — elicited on the next turn' in the enable result. This commit removes OAuth as a model-visible concept from every LLM-facing string in the catalog: - handleEnable: drop the per-auth-type switch. The success message is now a single line saying 'tools will appear on your next turn, proceed with the user's original request'. The only conditional emission is the WARNING for unresolved env vars, which is the one case the model has actual agency over (it can ask the user to set the variable and retry). - Instructions() workflow: step 2 drops the 'Authentication (OAuth or API key) is deferred' sentence; step 3 drops 'do not narrate the OAuth flow' (the model no longer learns OAuth exists, so the instruction can shrink to 'do not narrate connection setup'); step 5 reframes reset_remote_mcp_server_auth as 'if a previously enabled server starts rejecting requests, clear any persisted credentials' without mentioning OAuth tokens or authorization URLs. - Tool descriptions: enable_remote_mcp_server no longer mentions 'OAuth flow or API-key check'; reset_remote_mcp_server_auth no longer mentions 'OAuth credentials' or 'fresh authorization flow'. - ResetAuthArgs jsonschema and handleResetAuth result strings: drop every 'OAuth credentials' and 'fresh authorization flow' reference. Internal field names (oauthSuccessHandler, managedOAuth, etc.), Go package-doc comments, and slog debug messages still mention OAuth — those are developer-facing, not LLM-facing, and removing them would obscure intent for anyone reading the package. Tests updated: - TestEnableAddsServerAndFiresChangeNotification now asserts the result does NOT contain 'OAuth' or 'authorization', protecting against a regression where someone re-adds the leaky wording. - TestEnableAPIKeyEnvPresent no longer expects 'auth: API key' (the line is gone); still asserts no WARNING. - TestResetAuthClearsCredentials and TestResetAuthNoOpForNonOAuth updated for the new 'cleared credentials' / 'no persisted credentials' wording. The unresolved-header WARNING test still passes because the WARNING branch survived the rewrite under the unified 'len(missing) > 0' check.

trungutt · 2026-05-29T09:14:18Z

@rumpl Good question — and you're right, it shouldn't. The original wording (auth: OAuth — elicited on the next turn) caused this bug precisely because it leaked OAuth as a concept the model then tried to reason about. My first patch only rewrote that sentence to be less misleading; that left the leaky abstraction in place.

c3c85b3 removes OAuth as an LLM-visible concept from the catalog entirely:

handleEnable no longer branches on server.Auth.Type. The success message is a single line — "the server's tools will appear on your next turn. Proceed with the user's original request; the host handles any required connection setup." The only conditional emission left is the WARNING: env var(s) NOT set line, which is the one case the model has actual agency (it can ask the user to set the var and retry).
Instructions() drops "Authentication (OAuth or API key) is deferred" from step 2, drops "do not narrate the OAuth flow" from step 3 (the model no longer learns OAuth exists, so the instruction can just say "do not narrate connection setup"), and reframes step 5 (reset_auth) as "if a previously enabled server starts rejecting requests, clear any persisted credentials" — no mention of OAuth tokens or authorization URLs.
enable_remote_mcp_server and reset_remote_mcp_server_auth tool descriptions, ResetAuthArgs jsonschema, and handleResetAuth result strings all lose every OAuth/authorization reference.

Internal field names (oauthSuccessHandler, managedOAuth, …), Go package-doc comments, and slog debug messages still say OAuth — those are developer-facing and removing them would obscure intent for anyone reading the package.

A new test assertion (NotContains("OAuth"), NotContains("authorization") in TestEnableAddsServerAndFiresChangeNotification) guards against re-adding the leak.

PTAL.

docker-agent

Assessment: 🟢 APPROVE

This is a prompt/message-only fix that removes OAuth-specific wording from LLM-facing tool results and instructions so the model proceeds with the user's original request instead of stopping to ask them to repeat themselves.

What was reviewed:

pkg/tools/builtin/mcpcatalog/mcpcatalog.go — handleEnable message consolidation, handleResetAuth message updates, Instructions() step 3 hardening, tool description rewrites
pkg/tools/builtin/mcpcatalog/mcpcatalog_test.go — updated string assertions to match new production strings

Findings summary:

✅ No high or medium severity bugs found
✅ Auth-type consolidation in handleEnable correctly preserves env-var warnings for all relevant auth types (api_key, custom headers); the removed OAuth-specific branch was informational-only and its removal is intentional
✅ All updated test assertions ("enabled", "cleared credentials", "no persisted credentials", assert.NotContains for "OAuth" / "authorization") correctly match the new production string literals
ℹ️ Minor (not flagged): handleResetAuth's early-exit message "has no persisted credentials" is slightly imprecise for api_key servers (they have credentials, just not ones stored by this tool) — worth a follow-up comment, but not a bug and consistent with the PR's intent of not leaking auth-type details to the model

The fix correctly addresses the root cause: the previous "auth: OAuth — an authorization URL will be elicited on the next turn" string was the signal causing models to pause and narrate instead of proceeding. The new wording and explicit step-3 instruction ("do NOT stop and ask the user to repeat themselves") should resolve the reported behavior.

docker-agent

Assessment: 🟡 NEEDS ATTENTION

One medium-severity issue found in the changed code. The rest of the prompt-text changes look correct and well-motivated.

docker-agent · 2026-05-29T11:49:02Z


 	msg := strings.Builder{}
-	fmt.Fprintf(&msg, "enabled %q (%s) — connection deferred until the host next enumerates tools.\n", id, server.Title)
+	fmt.Fprintf(&msg, "enabled %q (%s) — the server's tools will appear on your next turn. Proceed with the user's original request; the host handles any required connection setup.\n", id, server.Title)


[MEDIUM] Contradictory model instructions when API-key env vars are missing

When missing is non-empty, the message assembled in handleEnable simultaneously tells the model two conflicting things:

"Proceed with the user's original request; the host handles any required connection setup."

"WARNING: the following env var(s) are NOT set: … Ask the user to set them, then call disable_remote_mcp_server and enable_remote_mcp_server for this id again, otherwise tool calls will fail."

An LLM receiving both instructions in the same tool-result has no explicit prioritisation signal — the Proceed clause comes first and is unconditional, while the WARNING appears later but doesn't say "ignore the previous instruction". Depending on which instruction the model weighs more heavily it may either:

dutifully proceed and immediately fail every downstream tool call (env vars not set), or

correctly pause and ask the user to set the vars but now via a path the new wording was designed to remove.

Suggested fix: move the "Proceed with the user's original request" sentence into the else branch (i.e., only emit it when len(missing) == 0), so it is never paired with a WARNING that contradicts it:

msg := strings.Builder{} fmt.Fprintf(&msg, "enabled %q (%s) — the server's tools will appear on your next turn.\n", id, server.Title) fmt.Fprintf(&msg, "endpoint: %s\n", server.URL) if len(missing) > 0 { fmt.Fprintf(&msg, "WARNING: the following env var(s) are NOT set: %s. Ask the user to set them, then call %s and %s for this id again, otherwise tool calls will fail.\n", strings.Join(missing, ", "), ToolNameDisable, ToolNameEnable) } else { fmt.Fprintf(&msg, "Proceed with the user's original request; the host handles any required connection setup.\n") }

docker-agent reviewed May 29, 2026

View reviewed changes

Comment thread pkg/tools/builtin/mcpcatalog/mcpcatalog.go Outdated

Comment thread pkg/tools/builtin/mcpcatalog/mcpcatalog.go Outdated

aheritier added area/mcp MCP protocol, MCP tool servers, integration kind/fix PR fixes a bug (maps to fix: commit prefix) labels May 29, 2026

trungutt requested a review from docker-agent May 29, 2026 09:46

docker-agent reviewed May 29, 2026

View reviewed changes

trungutt marked this pull request as ready for review May 29, 2026 11:45

trungutt requested a review from a team as a code owner May 29, 2026 11:45

docker-agent reviewed May 29, 2026

View reviewed changes

dgageot approved these changes May 29, 2026

View reviewed changes

dgageot merged commit 6683688 into docker:main May 29, 2026
10 checks passed

aheritier mentioned this pull request May 29, 2026

docs: document --auth-token flag, OAuth callback security note, and TUI notification UX #2932

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix(mcpcatalog): tell the model to proceed after enabling an OAuth server#2925

fix(mcpcatalog): tell the model to proceed after enabling an OAuth server#2925
dgageot merged 3 commits into
docker:mainfrom
trungutt:mcpcatalog-oauth-proceed

trungutt commented May 29, 2026 •

edited

Loading

Uh oh!

docker-agent left a comment

Uh oh!

Uh oh!

Uh oh!

rumpl commented May 29, 2026

Uh oh!

trungutt commented May 29, 2026

Uh oh!

docker-agent left a comment

Uh oh!

docker-agent left a comment

Uh oh!

docker-agent May 29, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

Conversation

trungutt commented May 29, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Problem

Why this happens

Fix

Uh oh!

docker-agent left a comment

Choose a reason for hiding this comment

Assessment: 🟡 NEEDS ATTENTION

Uh oh!

Uh oh!

Uh oh!

rumpl commented May 29, 2026

Uh oh!

trungutt commented May 29, 2026

Uh oh!

docker-agent left a comment

Choose a reason for hiding this comment

Assessment: 🟢 APPROVE

Uh oh!

docker-agent left a comment

Choose a reason for hiding this comment

Assessment: 🟡 NEEDS ATTENTION

Uh oh!

docker-agent May 29, 2026

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

trungutt commented May 29, 2026 •

edited

Loading