fix: make session MCP startup lazy by efrazer-oai · Pull Request #22901 · openai/codex

efrazer-oai · 2026-05-15T21:05:11Z

Summary

App-server thread startup used to build the session's MCP pool up front. thread/start and thread/resume waited for that work, including required MCP server startup, before returning.

This change keeps one MCP pool per session, but removes that startup wait. thread/start and thread/resume now return after session configuration, while MCP initialization begins in the background. The first real MCP consumer still awaits the shared session pool if startup has not finished yet.

This keeps the user-visible startup path fast without changing how MCP consumers observe readiness, and it preserves the early mcpServer/startupStatus/updated notifications that clients already rely on.

This also keeps the forked-thread boundary intact. We still do not share one process-wide MCP pool because MCP runtime state depends on session config, cwd, environment selection, and approval state.

Design decisions

Keep the singleton at session scope. Forked threads can carry different MCP config and runtime context, so a process-wide singleton would mix unrelated state.
Start initialization in the background after session configuration. This removes MCP startup from the thread/start and thread/resume critical path while still emitting startup status updates promptly.
Preserve explicit refresh. Refresh still rebuilds and swaps the shared session pool, and it still uses the active turn context so turn-local environment overrides keep working.
Move readiness waits to consumers. Tool building, app tool discovery, and MCP resource/tool calls now await the shared pool on demand instead of making thread creation wait.

Testing

Targeted codex-core tests covering deferred refresh and idempotent lazy initialization.
Targeted codex-app-server tests covering thread/start, thread/resume, and MCP status notifications.
Scoped clippy autofix and formatting for the touched crates.

efrazer-oai · 2026-05-15T21:05:23Z

@codex review

chatgpt-codex-connector

💡 Codex Review

Here are some automated review suggestions for this pull request.

Reviewed commit: 599f53bc67

ℹ️ About Codex in GitHub

Your team has set up Codex to review pull requests in this repo. Reviews are triggered when you

Open a pull request for review
Mark a draft as ready
Comment "@codex review".

If Codex has suggestions, it will comment; otherwise it will react with 👍.

Codex can also answer questions or update the PR. Try commenting "@codex address that feedback".

chatgpt-codex-connector · 2026-05-15T21:09:59Z

+                pending_mcp_server_refresh: Mutex::new(None),
+                mcp_connection_manager_initialized: std::sync::atomic::AtomicBool::new(false),


Start MCP initialization for idle thread starts

With the manager now left uninitialized here and no replacement startup path after SessionConfigured, a client that only calls thread/start or thread/resume and then listens for MCP server status never triggers ensure_mcp_connection_manager_initialized(). That means mcpServer/startupStatus/updated notifications are no longer emitted for configured servers until a later turn/tool listing happens; the existing thread_start_emits_mcp_server_status_updated_notifications workflow still waits for those notifications immediately after ThreadStartResponse, so it will time out for optional/broken MCP configs.

Useful? React with 👍 / 👎.

efrazer-oai · 2026-05-15T21:26:59Z

@codex review

efrazer-oai · 2026-05-15T21:32:20Z

@codex review

chatgpt-codex-connector

💡 Codex Review

Here are some automated review suggestions for this pull request.

Reviewed commit: 68dc2fda2f

ℹ️ About Codex in GitHub

Your team has set up Codex to review pull requests in this repo. Reviews are triggered when you

Open a pull request for review
Mark a draft as ready
Comment "@codex review".

If Codex has suggestions, it will comment; otherwise it will react with 👍.

Codex can also answer questions or update the PR. Try commenting "@codex address that feedback".

chatgpt-codex-connector · 2026-05-15T21:37:21Z

+            client_elicitation_capability: mcp_config.client_elicitation_capability,
+            elicitation_reviewer: None,


Pass the Guardian reviewer during lazy MCP init

When the session MCP manager is first created through the new lazy/startup path, elicitation_reviewer is hard-coded to None, whereas the old startup path passed Some(sess.mcp_elicitation_reviewer()). In sessions using Guardian approval routing, MCP servers that emit Guardian MCP-tool elicitations during normal tool calls will now bypass review_guardian_mcp_elicitation for the lifetime of the initial manager and fall back to raw ElicitationRequest handling until a refresh happens.

Useful? React with 👍 / 👎.

chatgpt-codex-connector · 2026-05-15T21:37:21Z

-        let (refreshed_manager, cancel_token) = McpConnectionManager::new(
-            &mcp_servers,
+
+        self.replace_mcp_connection_manager(ReplaceMcpConnectionManagerArgs {


Serialize refresh with lazy MCP initialization

This refresh path can run while the background startup task is still inside ensure_mcp_connection_manager_initialized(), but it does not take the initialization lock or otherwise invalidate the in-flight initial build. If an app-server client sends RefreshMcpServers before the startup prewarm completes and the next turn applies it, the refresh and the original lazy init can race; whichever replace_mcp_connection_manager finishes last wins, so the original config can overwrite the just-refreshed server set.

Useful? React with 👍 / 👎.

chatgpt-codex-connector · 2026-05-15T21:37:21Z

+        .unwrap_or_else(|err| {
+            panic!("session MCP environment selections should remain valid: {err}")
+        })


Return an error for invalid MCP environments

With a stored/sticky MCP environment id that is no longer known, this lazy initialization path now panics inside the spawned startup task instead of returning the InvalidRequest error that the old Session::new path produced. That leaves app-server thread/start/thread/resume apparently successful, and the first later MCP consumer can panic again while trying to initialize the manager.

Useful? React with 👍 / 👎.

chatgpt-codex-connector · 2026-05-15T21:37:21Z

-                    anyhow::bail!("required MCP servers failed to initialize: {details}");
-                }
-            }
+            sess.start_mcp_connection_manager_initialization();


Cancel lazy MCP initialization on shutdown

Starting MCP initialization as a detached task here means an immediate Op::Shutdown can drain the still-uninitialized manager and emit ShutdownComplete while this task continues holding the session alive. When it later reaches replace_mcp_connection_manager, it can create and swap in real MCP clients after shutdown, leaving stdio MCP server processes running and possibly emitting startup events for a closed thread.

Useful? React with 👍 / 👎.

chatgpt-codex-connector · 2026-05-15T21:37:21Z

+        self.replace_mcp_connection_manager(ReplaceMcpConnectionManagerArgs {
+            submit_id: INITIAL_SUBMIT_ID.to_owned(),
+            approval_policy: &session_configuration.approval_policy,
+            permission_profile: session_configuration.permission_profile(),
+            runtime_environment: self.session_mcp_runtime_environment(&session_configuration),
+            config: &config,
+            mcp_servers,
+            store_mode: config.mcp_oauth_credentials_store_mode,
+            auth: auth.as_ref(),
+            host_owned_codex_apps_enabled: host_owned_codex_apps_enabled(
+                &mcp_config,
+                auth.as_ref(),
+            ),
+            client_elicitation_capability: mcp_config.client_elicitation_capability,
+            elicitation_reviewer: None,
+        })


Preserve required MCP server failures

After moving startup behind lazy initialization, the old required_startup_failures(...) check is no longer called anywhere, so a configured required = true server that fails to start only produces startup events and the turn proceeds with the missing tools. For setups that mark MCP servers required because the agent depends on them, the first MCP/tool-building consumer should still fail or otherwise stop the turn instead of silently continuing without the required server.

Useful? React with 👍 / 👎.

chatgpt-codex-connector · 2026-05-15T21:37:21Z

+        {
+            let mut guard = self.services.mcp_startup_cancellation_token.lock().await;
+            guard.cancel();
+            *guard = CancellationToken::new();
+        }


Honor interrupts that arrive before token replacement

If the user interrupts an idle session while the lazy MCP init task is still computing provenance/auth status, cancel_mcp_startup() cancels the placeholder token, but this block then discards that cancelled token and installs a fresh uncancelled one before spawning MCP clients. In that timing window, the interrupt no longer cancels startup, so MCP processes can still be launched after the user asked to stop idle startup work.

Useful? React with 👍 / 👎.

fix: make session MCP startup lazy

8bd391e

chatgpt-codex-connector Bot reviewed May 15, 2026

View reviewed changes

fix: align lazy MCP startup with current main

95d02e8

efrazer-oai force-pushed the fix/lazy-session-mcp-startup branch from 599f53b to 95d02e8 Compare May 15, 2026 21:26

fix: prewarm MCP startup notifications in background

68dc2fd

chatgpt-codex-connector Bot reviewed May 15, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix: make session MCP startup lazy#22901

fix: make session MCP startup lazy#22901
efrazer-oai wants to merge 3 commits into
mainfrom
fix/lazy-session-mcp-startup

efrazer-oai commented May 15, 2026 •

edited

Loading

Uh oh!

efrazer-oai commented May 15, 2026

Uh oh!

chatgpt-codex-connector Bot left a comment

Uh oh!

chatgpt-codex-connector Bot May 15, 2026

Uh oh!

efrazer-oai commented May 15, 2026

Uh oh!

efrazer-oai commented May 15, 2026

Uh oh!

chatgpt-codex-connector Bot left a comment

Uh oh!

chatgpt-codex-connector Bot May 15, 2026

Uh oh!

chatgpt-codex-connector Bot May 15, 2026

Uh oh!

chatgpt-codex-connector Bot May 15, 2026

Uh oh!

chatgpt-codex-connector Bot May 15, 2026

Uh oh!

chatgpt-codex-connector Bot May 15, 2026

Uh oh!

chatgpt-codex-connector Bot May 15, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

		pending_mcp_server_refresh: Mutex::new(None),
		mcp_connection_manager_initialized: std::sync::atomic::AtomicBool::new(false),

		client_elicitation_capability: mcp_config.client_elicitation_capability,
		elicitation_reviewer: None,

Conversation

efrazer-oai commented May 15, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Design decisions

Testing

Uh oh!

efrazer-oai commented May 15, 2026

Uh oh!

chatgpt-codex-connector Bot left a comment

Choose a reason for hiding this comment

💡 Codex Review

Uh oh!

chatgpt-codex-connector Bot May 15, 2026

Choose a reason for hiding this comment

Uh oh!

efrazer-oai commented May 15, 2026

Uh oh!

efrazer-oai commented May 15, 2026

Uh oh!

chatgpt-codex-connector Bot left a comment

Choose a reason for hiding this comment

💡 Codex Review

Uh oh!

chatgpt-codex-connector Bot May 15, 2026

Choose a reason for hiding this comment

Uh oh!

chatgpt-codex-connector Bot May 15, 2026

Choose a reason for hiding this comment

Uh oh!

chatgpt-codex-connector Bot May 15, 2026

Choose a reason for hiding this comment

Uh oh!

chatgpt-codex-connector Bot May 15, 2026

Choose a reason for hiding this comment

Uh oh!

chatgpt-codex-connector Bot May 15, 2026

Choose a reason for hiding this comment

Uh oh!

chatgpt-codex-connector Bot May 15, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

efrazer-oai commented May 15, 2026 •

edited

Loading