fix: recover from ErrSessionMissing when remote MCP server restarts by dgageot · Pull Request #2212 · docker/docker-agent

dgageot · 2026-03-22T10:46:51Z

hi all - i /think/ we're observing an issue where a long-lived docker agent (via serve) fails to reconnect to MCP servers after the MCP server restarts - not recovering from errSessionMissing

Assisted-By: docker-agent

docker-agent

Assessment: 🟡 NEEDS ATTENTION

This PR adds reconnection logic to recover from ErrSessionMissing when MCP servers restart. The implementation is mostly sound, but there's one timing issue worth addressing.

docker-agent · 2026-03-22T10:53:14Z

pkg/tools/mcp/mcp.go

+		return nil
+	case <-ctx.Done():
+		return ctx.Err()
+	case <-time.After(sessionMissingRetryTimeout):


⚠️ MEDIUM: Timeout not coordinated with retry backoff

The sessionMissingRetryTimeout is set to 30 seconds, but tryRestart() uses exponential backoff that can take up to 31 seconds total (1+2+4+8+16 seconds across 5 retry attempts).

This creates a race condition where:

A tool call encounters ErrSessionMissing and calls forceReconnectAndWait()

watchConnection is in the middle of a backoff sleep (e.g., the 16-second sleep on the 5th retry)

The tool call times out at 30 seconds and returns an error to the user

A second later, the reconnect completes successfully

The next identical tool call succeeds

Impact: Non-deterministic failures where tool calls fail with "timed out waiting for MCP server reconnection" even though the server successfully reconnects moments later.

Recommendation: Either increase sessionMissingRetryTimeout to 35-40 seconds to account for the maximum backoff duration, or coordinate the timeout with the actual retry logic (e.g., calculate remaining backoff time).

Assisted-By: docker-agent

dgageot marked this pull request as ready for review March 22, 2026 10:46

dgageot requested a review from a team as a code owner March 22, 2026 10:46

docker-agent bot reviewed Mar 22, 2026

View reviewed changes

fix: recover from ErrSessionMissing when remote MCP server restarts

02c372e

Assisted-By: docker-agent

dgageot force-pushed the mcp-reconnect branch from 6e70f65 to 02c372e Compare March 22, 2026 11:01

rumpl approved these changes Mar 22, 2026

View reviewed changes

dgageot merged commit 0c2bf5d into docker:main Mar 22, 2026
4 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix: recover from ErrSessionMissing when remote MCP server restarts#2212

fix: recover from ErrSessionMissing when remote MCP server restarts#2212
dgageot merged 1 commit intodocker:mainfrom
dgageot:mcp-reconnect

dgageot commented Mar 22, 2026

Uh oh!

docker-agent bot left a comment

Uh oh!

docker-agent bot Mar 22, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

dgageot commented Mar 22, 2026

Uh oh!

docker-agent bot left a comment

Choose a reason for hiding this comment

Assessment: 🟡 NEEDS ATTENTION

Uh oh!

docker-agent bot Mar 22, 2026

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants