Python: skip hosted tools test on transient upstream MCP errors by moonbox3 · Pull Request #5296 · microsoft/agent-framework

moonbox3 · 2026-04-16T02:45:36Z

Summary

The test_anthropic_client_integration_hosted_tools test calls Anthropic's API with a hosted MCP tool pointed at learn.microsoft.com/api/mcp. Anthropic's backend connects to that URL server-side, so when MS Learn rate-limits or returns 5xx, the test fails with errors we can't retry around — and it blocks the merge queue.

This catches BadRequestError, InternalServerError, APIConnectionError, and APITimeoutError from the Anthropic SDK and pytest.skips so upstream outages don't break CI. The test still runs and validates the full hosted-tools path when the upstream MCP server is healthy.

Also broadens the image integration test assertion to use word-boundary matching with common building synonyms, since the model legitimately describes the same image with varied vocabulary.

Test plan

Unit tests pass locally (113 passed)
Verify misc-integration CI passes (hosted tools test skips on upstream error or passes when healthy)

The local MCP server can't be used for hosted tools tests because Anthropic's backend needs to reach the MCP URL from their infrastructure (not localhost on the CI runner). Revert to learn.microsoft.com/api/mcp but catch BadRequestError, InternalServerError, APIConnectionError, and APITimeoutError and pytest.skip so upstream outages don't block the merge queue.

moonbox3 · 2026-04-16T02:48:11Z

Python Test Coverage Report •

File	Stmts	Miss	Cover	Missing
TOTAL	27670	3200	88%

report-only-changed-files is enabled. No files were changed during this commit :)

Python Unit Test Overview

Tests	Skipped	Failures	Errors	Time
5593	20 💤	0 ❌	0 🔥	1m 29s ⏱️

Copilot

Pull request overview

Updates Anthropic Python integration testing to reduce CI failures caused by transient upstream MCP/hosted-tools errors when exercising the full hosted-tools path.

Changes:

Switch the hosted-tools integration test from a LOCAL_MCP_URL-based MCP server to https://learn.microsoft.com/api/mcp.
Wrap the hosted-tools request in exception handling that pytest.skips on certain Anthropic SDK errors.

Copilot · 2026-04-16T02:49:08Z

+    except (
+        anthropic.BadRequestError,


Catching anthropic.BadRequestError and skipping can mask real regressions (a 400 typically indicates an invalid request/tool schema rather than a transient upstream outage). Consider limiting the skip behavior to clearly-transient failures (e.g., connection/timeout/5xx) or inspecting the error details and only skipping when the 400 is explicitly caused by the hosted MCP call failing; otherwise re-raise so genuine test failures still fail CI.

Suggested change

except (

anthropic.BadRequestError,

except anthropic.BadRequestError as e:

error_message = str(e)

error_body = getattr(e, "body", None)

if isinstance(error_body, dict):

error_details = error_body.get("error")

if isinstance(error_details, dict):

detail_message = error_details.get("message")

if isinstance(detail_message, str) and detail_message:

error_message = f"{error_message} {detail_message}".strip()

normalized_error_message = error_message.lower()

is_hosted_mcp_failure = "mcp" in normalized_error_message and any(

marker in normalized_error_message

for marker in ("unavailable", "failed", "failure", "timeout", "timed out")

)

if is_hosted_mcp_failure:

pytest.skip(f"Upstream MCP server unavailable: {e}")

raise

except (

Copilot · 2026-04-16T02:49:08Z

+    import anthropic



import anthropic is added inside the test function, but this module is already required at import-time in this file (e.g., from anthropic.types.beta import ... near the top). Consider moving this import to the module import section (or importing only the needed exception types) for consistent import ordering and to avoid a redundant local import.

moonbox3 added the python label Apr 16, 2026

Copilot AI review requested due to automatic review settings April 16, 2026 02:45

moonbox3 self-assigned this Apr 16, 2026

Copilot started reviewing on behalf of moonbox3 April 16, 2026 02:46 View session

moonbox3 merged commit fe4cd3c into microsoft:main Apr 16, 2026
34 checks passed

Copilot AI reviewed Apr 16, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Python: skip hosted tools test on transient upstream MCP errors#5296

Python: skip hosted tools test on transient upstream MCP errors#5296
moonbox3 merged 1 commit intomicrosoft:mainfrom
moonbox3:fix-anthropic-test

moonbox3 commented Apr 16, 2026

Uh oh!

Uh oh!

moonbox3 commented Apr 16, 2026

Uh oh!

Copilot AI left a comment

Uh oh!

Copilot AI Apr 16, 2026

Uh oh!

Copilot AI Apr 16, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

-    except (
-        anthropic.BadRequestError,
+    except anthropic.BadRequestError as e:
+        error_message = str(e)
+        error_body = getattr(e, "body", None)
+        if isinstance(error_body, dict):
+            error_details = error_body.get("error")
+            if isinstance(error_details, dict):
+                detail_message = error_details.get("message")
+                if isinstance(detail_message, str) and detail_message:
+                    error_message = f"{error_message} {detail_message}".strip()
+        normalized_error_message = error_message.lower()
+        is_hosted_mcp_failure = "mcp" in normalized_error_message and any(
+            marker in normalized_error_message
+            for marker in ("unavailable", "failed", "failure", "timeout", "timed out")
+        )
+        if is_hosted_mcp_failure:
+            pytest.skip(f"Upstream MCP server unavailable: {e}")
+        raise
+    except (

Conversation

moonbox3 commented Apr 16, 2026

Summary

Test plan

Uh oh!

Uh oh!

moonbox3 commented Apr 16, 2026

Python Unit Test Overview

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Uh oh!

Copilot AI Apr 16, 2026

Choose a reason for hiding this comment

Uh oh!

Copilot AI Apr 16, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants