feat: Native Anthropic adapter with shared HTTP client infrastructure by nabinchha · Pull Request #426 · NVIDIA-NeMo/DataDesigner

nabinchha · 2026-03-17T19:04:14Z

Summary

Fourth PR in the model facade overhaul series (plan, architecture notes). Adds the native Anthropic Messages API adapter and routes provider_type="anthropic" through the native client path. Extracts shared native httpx transport, lifecycle, timeout, and error-wrapping logic into HttpModelClient and http_helpers.py so AnthropicClient and OpenAICompatibleClient share the same HTTP machinery while keeping provider-specific translation separate.

Previous PRs:

PR-1: feat: canonical model client types, protocols, and LiteLLM bridge adapter #359 - canonical model client types, protocols, and LiteLLM bridge adapter
PR-2: refactor: Decouple ModelFacade from LiteLLM via ModelClient adapter #373 - decouple ModelFacade from LiteLLM via ModelClient adapter
PR-3: feat: Native OpenAI adapter with retry and AIMD throttle infrastructure #402 - native OpenAI adapter with retry and AIMD throttle infrastructure

Changes

Added

anthropic.py — native Anthropic Messages API adapter with /v1 endpoint handling
anthropic_translation.py — request/response translation (system message lifting, tool turns, image blocks, empty text block filtering)
http_helpers.py — shared HTTP transport utilities
http_model_client.py — abstract base for native HTTP adapters with lazy client init and close semantics
model-facade-overhaul-pr-4-architecture-notes.md — architecture notes for this PR
ProviderErrorKind.QUOTA_EXCEEDED — new error kind for credit/billing failures (e.g. Anthropic "credit balance is too low")
ModelQuotaExceededError — user-facing error for quota/billing issues
_attach_provider_message helper — surfaces raw provider error messages in formatted output for HTTP 400 errors

Changed

openai_compatible.py — refactored to inherit from HttpModelClient
factory.py — routes provider_type="anthropic" to native adapter, preserves bridge fallback
errors.py (client) — extended _looks_like_unsupported_params_error for mutually exclusive params, added _looks_like_quota_exceeded_error
errors.py (model) — FormattedLLMErrorMessage now includes optional provider_message (shown before Cause for 400s); _raise_from_provider_error handles QUOTA_EXCEEDED and attaches provider messages selectively

Fixed

Anthropic tool calling flow: empty text blocks ({"type": "text", "text": ""}) in assistant messages with tool calls are now filtered out — Anthropic API rejects them (f5b0b39e)
Anthropic /v1 endpoint handling: _get_messages_route() avoids /v1/v1/messages duplication when endpoint already includes /v1
Error classification: Anthropic "credit balance is too low" (HTTP 400) now maps to ModelQuotaExceededError instead of generic ModelBadRequestError; "temperature and top_p cannot both be specified" maps to ModelUnsupportedParamsError

Attention Areas

Reviewers: Please pay special attention to the following:

anthropic_translation.py — request/response translation, especially system message lifting, tool turns, image block conversion, and empty text block filtering
http_model_client.py — shared lazy client initialization and close semantics across native adapters
factory.py — anthropic routing and bridge fallback behavior
errors.py (model) — _attach_provider_message helper and selective provider message surfacing logic

Test plan

uv run ruff check on all changed source files
uv run pytest on all new and updated test files
E2E smoke test against Anthropic endpoint (text generation, structured output, tool calling, image context)

Description updated with AI

…acade-guts-pr1

Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

…provements - Wrap all LiteLLM router calls in try/except to normalize raw exceptions into canonical ProviderError at the bridge boundary (blocking review item) - Extract reusable response-parsing helpers into clients/parsing.py for shared use across future native adapters - Add async image parsing path using httpx.AsyncClient to avoid blocking the event loop in agenerate_image - Add retry_after field to ProviderError for future retry engine support - Fix _to_int_or_none to parse numeric strings from providers - Create test conftest.py with shared mock_router/bridge_client fixtures - Parametrize duplicate image generation and error mapping tests - Add tests for exception wrapping across all bridge methods

…larity - Parse RFC 7231 HTTP-date strings in Retry-After header (used by Azure and Anthropic during rate-limiting) in addition to numeric delay-seconds - Clarify collect_non_none_optional_fields docstring explaining why f.default is None is the correct check for optional field forwarding - Add tests for HTTP-date and garbage Retry-After values

- Fix misleading comment about prompt field defaults in _IMAGE_EXCLUDE - Handle list-format detail arrays in _extract_structured_message for FastAPI/Pydantic validation errors - Document scope boundary for vision content in collect_raw_image_candidates

…el-facade-guts-pr2

...s/data-designer-engine/src/data_designer/engine/models/clients/adapters/http_model_client.py

...ta-designer-engine/src/data_designer/engine/models/clients/adapters/anthropic_translation.py

andreatgretel · 2026-03-18T15:51:32Z

(AR) Warning: AnthropicClient not exported from adapters __init__.py

packages/data-designer-engine/src/data_designer/engine/models/clients/adapters/__init__.py

What: __all__ exports OpenAICompatibleClient but omits AnthropicClient, creating an inconsistent public surface.

Why: Both are peer adapters in the factory routing table. The asymmetry suggests AnthropicClient is secondary, which does not match the design.

Suggestion:

Add AnthropicClient to __all__ alongside OpenAICompatibleClient.

andreatgretel · 2026-03-18T15:51:37Z

(AR) Warning: Missing litellm_bridge env override test for anthropic provider

packages/data-designer-engine/tests/engine/models/clients/test_factory.py

What: test_bridge_env_override_forces_bridge_for_openai_provider exists but there is no equivalent test for the anthropic provider type.

Why: The env-var bypass is part of the routing contract. A refactor moving the check below the anthropic branch would go undetected.

Suggestion:

Add a parallel test verifying DATA_DESIGNER_MODEL_BACKEND=litellm_bridge overrides the anthropic provider to bridge.

andreatgretel · 2026-03-18T15:51:40Z

(AR) Warning: Missing ConnectionError and non-JSON response tests for AnthropicClient

packages/data-designer-engine/tests/engine/models/clients/test_anthropic.py

What: test_openai_compatible.py has transport_connection_error and non_json_response tests; test_anthropic.py does not.

Why: Symmetric test coverage ensures adapter-specific code (headers, payload building) does not short-circuit shared error paths.

Suggestion:

Add test_transport_connection_error_raises_provider_error and test_non_json_response_raises_provider_error to test_anthropic.py.

andreatgretel · 2026-03-18T15:51:44Z

(AR) Suggestion: Mock helpers duplicated across three test files

packages/data-designer-engine/tests/engine/models/clients/test_anthropic.py

What: _mock_httpx_response, _make_sync_client, and _make_async_client are defined independently in multiple test files with different resp.text defaults.

Why: AGENTS.md says shared fixtures belong in conftest.py. The text default difference (json.dumps vs empty string) may mask subtle bugs.

Suggestion:

Move common mock helpers to conftest.py and reconcile the text default.

andreatgretel · 2026-03-18T15:51:47Z

(AR) Suggestion: Factory tests don't assert resolved API key is forwarded

packages/data-designer-engine/tests/engine/models/clients/test_factory.py

What: Factory routing tests only check isinstance(client, ...) but don't verify the resolved API key reaches the client.

Why: Testing only the return type exercises ~30% of the function's routing logic while leaving key resolution untested.

Suggestion:

Add assert client._api_key == 'resolved-key' in the routing tests.

andreatgretel · 2026-03-18T15:52:02Z

(AR) This PR adds a native Anthropic Messages API adapter with clean separation between translation logic and HTTP transport, extracts shared infrastructure into an HttpModelClient base class, and updates the factory routing. The review covered all 12 changed files (+2455/-236 lines), ran linting (ruff, all passed) and tests (135 passed in 3.50s).

The architecture is well-executed — the stateless translation module is independently testable, the DRY refactoring of HttpModelClient removes real duplication without over-abstracting, and test coverage is thorough across translation, lifecycle, and error paths. The one critical finding is that close() accesses httpx.AsyncClient._transport, a private attribute that could break on httpx upgrades and prevent sync client cleanup. The remaining warnings cover a blocking threading.Lock in async paths (not urgent until AsyncTaskScheduler lands), unused _model_id state, an __init__.py export inconsistency, and a few test coverage gaps.

Verdict: Ship it (with nits) — 1 critical, 5 warnings, 3 suggestions.

- Handle /v1 in Anthropic endpoint gracefully to avoid path duplication - Add QUOTA_EXCEEDED provider error kind for credit/billing failures - Extend UNSUPPORTED_PARAMS detection for mutually exclusive params - Surface raw provider message in formatted errors for 400 status codes - Consolidate provider message helpers into single _attach_provider_message Made-with: Cursor

- Fix close() double-close of shared transport by closing self._transport directly instead of accessing private aclient._transport (critical) - Add TODO for threading.Lock → asyncio.Lock split (plan-346) - Remove unused _model_id from HttpModelClient and all callers - Export AnthropicClient from adapters __init__.py - Filter empty text blocks in translate_tool_result_content join - Move mock helpers to conftest.py with consistent json.dumps text default - Add __init__.py files to enable absolute imports from test conftest - Add bridge env override test for anthropic provider - Add ConnectionError and non-JSON response tests for AnthropicClient - Assert secret_resolver.resolve called with correct key ref in factory tests Made-with: Cursor

nabinchha · 2026-03-18T19:45:49Z

Addressed all review findings in b8add5d:

Re: AnthropicClient not exported from adapters __init__.py (#426 (comment))
→ Added AnthropicClient to __all__ in adapters/__init__.py.

Re: Missing litellm_bridge env override test for anthropic provider (#426 (comment))
→ Added test_bridge_env_override_forces_bridge_for_anthropic_provider in test_factory.py.

Re: Missing ConnectionError and non-JSON response tests for AnthropicClient (#426 (comment))
→ Added test_transport_connection_error_raises_provider_error and test_non_json_response_raises_provider_error in test_anthropic.py.

Re: Mock helpers duplicated across three test files (#426 (comment))
→ Moved mock_httpx_response, make_mock_sync_client, make_mock_async_client to conftest.py with consistent json.dumps text default. Added __init__.py files to enable absolute imports from tests.engine.models.clients.conftest.

Re: Factory tests don't assert resolved API key is forwarded (#426 (comment))
→ Provider fixtures now set api_key and tests assert secret_resolver.resolve is called with the correct key ref (avoids accessing private _api_key).

andreatgretel · 2026-03-19T14:33:01Z

...s/data-designer-engine/src/data_designer/engine/models/clients/adapters/http_model_client.py

+                )
+            return self._aclient
+
+    def close(self) -> None:


I think close() still leaves the real AsyncClient open after an async request path. I hit the live Anthropic path with acompletion(), then called close(), and client._aclient.is_closed stayed False while aclose() flips it to True. Maybe worth tightening close() so sync teardown fully releases async-owned resources too.

packages/data-designer-engine/src/data_designer/engine/models/errors.py

andreatgretel · 2026-03-19T14:33:26Z

I ran some extra QA on this branch with live Anthropic smoke tests through both the native path and the LiteLLM bridge for text generation, tool calling, multimodal input, and an unsupported embedding call. Text and tool flows looked good on both paths, and native multimodal worked in the live check. Just out of curiosity, I also ran a small live latency check on Claude Sonnet; in that sample I did not see a consistent latency difference in favor of the native path. Not blocking, just sharing the extra validation data point. The unsupported-capability case still hits the generic error path noted inline above.

andreatgretel · 2026-03-19T14:45:28Z

quick docs question: now that provider_type="anthropic" routes to the native adapter, are you planning to update the model-provider docs in a follow-up? a couple of spots still read as if provider_type only means “openai-compatible API format”.

andreatgretel · 2026-03-19T15:03:20Z

...s/data-designer-engine/src/data_designer/engine/models/clients/adapters/http_model_client.py

+            # releases the connection pool without re-closing the transport.
+            client.close()
+
+    async def aclose(self) -> None:


same transport leak as the close() one from earlier, but on the async side. close() got explicit transport cleanup in b8add5d but aclose() doesn't have it. since the transport is injected, httpx won't close it for you - needs the same capture-and-close pattern.

andreatgretel

lgtm. clean code, good test coverage, e2e works against live anthropic.

two nits inline - the close/aclose transport leak and the UNSUPPORTED_CAPABILITY error mapping. not blocking.

- Add ModelUnsupportedCapabilityError so unsupported operations (e.g. Anthropic embeddings/image-generation) surface a specific error instead of falling through to generic ModelAPIError - Forward the provider's operation-specific message in the cause - Add parametrized test case for the new error path

nabinchha and others added 30 commits February 19, 2026 15:50

plans for model facade overhaul

ab30a2d

update plan

43824ea

add review

2a5f1e4

address feedback + add more details after several self reviews

f945d5b

update plan doc

dfa3817

Merge branch 'main' into nm/overhaul-model-facade-guts

5b18f74

address nits

0f449a7

Merge branch 'nm/overhaul-model-facade-guts' into nm/overhaul-model-f…

37f092a

…acade-guts-pr1

Add cannonical objects

08e57f8

Merge branch 'main' into nm/overhaul-model-facade-guts-pr1

3ab18ee

self-review feedback + address

34349c7

add LiteLLMRouter protocol to strongly type bridge router param

6aae4b6

Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

simplify some things

2a53d37

add a protol for http response like object

4e2f3af

move HttpResponse

b1c85f2

update PR-1 architecture notes for lifecycle and router protocol

f6dc769

Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

Merge branch 'main' into nm/overhaul-model-facade-guts-pr1

b6b4028

Use contextlib to dry out some code

ba22397

Merge branch 'main' into nm/overhaul-model-facade-guts-pr1

c390912

add PR-2 architecture notes for model facade overhaul

828cc49

save progress on pr2

89a6d4e

Merge branch 'main' into nm/overhaul-model-facade-guts-pr1

e527503

Merge branch 'nm/overhaul-model-facade-guts-pr1' into nm/overhaul-mod…

f6fa447

…el-facade-guts-pr2

small refactor

b8579c2

address feedback

61024c0

Merge branch 'nm/overhaul-model-facade-guts-pr1' into nm/overhaul-mod…

d47d508

…el-facade-guts-pr2

Address greptile comment in pr1

49a45ba

Merge branch 'main' into nm/overhaul-model-facade-guts-pr4

d3e4482

andreatgretel reviewed Mar 18, 2026

View reviewed changes

...s/data-designer-engine/src/data_designer/engine/models/clients/adapters/http_model_client.py Outdated Show resolved Hide resolved

andreatgretel reviewed Mar 18, 2026

View reviewed changes

...s/data-designer-engine/src/data_designer/engine/models/clients/adapters/http_model_client.py Show resolved Hide resolved

andreatgretel reviewed Mar 18, 2026

View reviewed changes

...s/data-designer-engine/src/data_designer/engine/models/clients/adapters/http_model_client.py Outdated Show resolved Hide resolved

andreatgretel reviewed Mar 18, 2026

View reviewed changes

...ta-designer-engine/src/data_designer/engine/models/clients/adapters/anthropic_translation.py Outdated Show resolved Hide resolved

nabinchha added 2 commits March 18, 2026 13:20

nabinchha added 2 commits March 18, 2026 13:46

Update license headers

bed3c54

Fix anthropic tool call flow

f5b0b39

nabinchha requested a review from andreatgretel March 18, 2026 19:54

Merge branch 'main' into nm/overhaul-model-facade-guts-pr4

05dab75

andreatgretel reviewed Mar 19, 2026

View reviewed changes

packages/data-designer-engine/src/data_designer/engine/models/errors.py Outdated Show resolved Hide resolved

andreatgretel reviewed Mar 19, 2026

View reviewed changes

andreatgretel previously approved these changes Mar 19, 2026

View reviewed changes

nabinchha dismissed andreatgretel’s stale review via 8778678 March 19, 2026 16:04

andreatgretel approved these changes Mar 19, 2026

View reviewed changes

Merge branch 'main' into nm/overhaul-model-facade-guts-pr4

3e8cac4

nabinchha merged commit a9a16ae into main Mar 19, 2026
47 checks passed

nabinchha deleted the nm/overhaul-model-facade-guts-pr4 branch March 19, 2026 17:18

Conversation

nabinchha commented Mar 17, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Changes

Added

Changed

Fixed

Attention Areas

Test plan

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

andreatgretel commented Mar 18, 2026

Uh oh!

andreatgretel commented Mar 18, 2026

Uh oh!

andreatgretel commented Mar 18, 2026

Uh oh!

andreatgretel commented Mar 18, 2026

Uh oh!

andreatgretel commented Mar 18, 2026

Uh oh!

andreatgretel commented Mar 18, 2026

Uh oh!

nabinchha commented Mar 18, 2026

Uh oh!

andreatgretel Mar 19, 2026

Choose a reason for hiding this comment

Uh oh!

Uh oh!

andreatgretel commented Mar 19, 2026

Uh oh!

andreatgretel commented Mar 19, 2026

Uh oh!

andreatgretel Mar 19, 2026

Choose a reason for hiding this comment

Uh oh!

andreatgretel left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

nabinchha commented Mar 17, 2026 •

edited

Loading