sdk(0.1.4): accept bare cm_<id> in chat.completions, drop qualified_name by kunwar-vp · Pull Request #10 · voltagepark/graphn-sdk-python

kunwar-vp · 2026-04-25T21:32:04Z

Why

We were making customers address the same custom model two different ways:

client.custom_models.get(model.id)               # cm_abc123
client.chat.completions.create(model=model.qualified_name)  # custom:cm_abc123

The custom: prefix is a wire-level routing detail of the model gateway — it shouldn't have leaked into the public API surface.

What

Collapse both paths to model.id. Resolved entirely inside the SDK — no backend change, no protocol change, no latency hit (the wire request is byte-identical to the prefixed form).

chat.completions.create (sync + async) normalizes the model param: bare cm_<hex> ids get the gateway's custom: prefix prepended before delegating to the openai client.
Pre-prefixed strings (custom:cm_...) and first-party catalog ids (meta-llama/..., Qwen/...) pass through unchanged. Old code keeps working.
Auto-wake's _extract_custom_model_id is unchanged in behavior — it sees the post-normalization value, so cold-start retry continues to fire on bare-cm calls.
CustomModel.qualified_name property removed. Migration is model.qualified_name → model.id; the rare caller who needs the wire-level form can write f"custom:{model.id}".

Versioning — `0.1.4` (patch)

Pre-1.0, and qualified_name (a) was undocumented outside one example block, (b) shipped two weeks ago in 0.1.0, (c) is being removed because the surface that justified its existence (passing it to chat) is now redundant. The new normalization on the input side is purely additive. If anyone hits this in the wild we'll restore qualified_name as a deprecated alias.

Docs

README.md, docs/cold-starts.md, scripts/e2e.py, and every example under examples/ updated to use model.id consistently. examples/openai_compat.py deliberately keeps custom:cm_... because that example is about bypassing graphn.Client — it's now annotated to make that contrast explicit.

Companion docs PR for the BYOM tutorial on agent-foundry: voltagepark/agent-foundry#818.

Test plan

ruff check src tests clean.
pytest -ra — 43/43 pass.
New test_chat_normalizes_model_param_for_each_namespace parametrized test covers bare cm, prefixed cm, HuggingFace-style, and non-string inputs.
test_chat_auto_wakes_cold_custom_model_and_retries now feeds the bare id and asserts the prefixed form goes out on the wire.
End-to-end against beta gateway: bare cm_<id> import → wait_until_ready → chat completion (cold-start auto-wake exercised).
CI green on Py 3.10/3.11/3.12/3.13.
After merge: auto-tag job creates v0.1.4, build + publish ship to PyPI in the same workflow run, pip install graphn==0.1.4 works ~1 min later.

…1.4) Customers were having to address custom models two different ways: model.id (cm_abc123) for control-plane calls, model.qualified_name (custom:cm_abc123) for chat. The custom: prefix is a wire-level routing detail of the model gateway — it shouldn't have leaked into public ergonomics. This PR collapses both paths to model.id: - chat.completions.create (sync + async) now normalizes the model param: bare cm_<hex> ids get the gateway's "custom:" prefix prepended before delegating to the openai client. Pre-prefixed strings ("custom:cm_...") and first-party catalog ids ("meta-llama/...", "Qwen/...") are passed through unchanged. Any non-string is also passed through so we don't shadow the openai client's own TypeError messaging. - _extract_custom_model_id (used by auto-wake to find which model to wake) is unchanged in behavior — it now sees the post-normalization value, so the cold-start retry loop continues to fire on bare-cm calls. - CustomModel.qualified_name property removed. Migration is model.qualified_name -> model.id; the rare caller that needed the wire-level form can write f"custom:{model.id}". No backend change, no protocol change, no latency cost — the wire request is byte-identical to the prefixed form. Pre-1.0 patch release because (a) qualified_name was undocumented outside one example and shipped two weeks ago in 0.1.0, (b) the new behavior is purely additive on the input side. README, docs/cold-starts.md, scripts/e2e.py, and every example under examples/ updated to use model.id consistently. examples/openai_compat.py keeps the explicit custom:cm_... form with a note explaining the prefix is the cost of bypassing graphn.Client. Tests: added a parametrized test covering bare cm, prefixed cm, HuggingFace-style, and non-string inputs through the normalizer; updated the auto-wake test to feed the bare id and assert the prefixed form goes out on the wire; removed the qualified_name property test. 43/43 pass, ruff clean.

kunwar-vp · 2026-04-25T21:36:38Z

Pre-merge e2e against beta — passed

Ran a focused live test against cp-beta.graphn.ai / model-beta.graphn.ai with the local branch installed. Targeted an existing ready custom model in the workspace and exercised the four interesting paths:

target: id='cm_e261ebeef742' name='qwen3-0.6b'

[1] CustomModel.qualified_name should NOT exist
  ok — AttributeError: 'CustomModel' object has no attribute 'qualified_name'

[2/3] chat.completions.create with bare id, capturing wire payload
  ok — chat returned: '<think>\nOkay, the user wants me'
  wire model field: 'custom:cm_e261ebeef742'
  ok — SDK normalized 'cm_e261ebeef742' -> 'custom:cm_e261ebeef742' on the wire

[4] streaming with bare id
  ok — 20 deltas: '<think>\nOkay, the user is asking me to count: 1, 2, 3'

ALL CHECKS PASSED

The wire-payload check used a one-off httpx send spy on c.chat.openai_client._client to capture the actual outgoing JSON body. Confirmed the SDK rewrites cm_e261ebeef742 -> custom:cm_e261ebeef742 at the boundary, so the gateway sees byte-identical traffic to the old prefixed form.

Auto-wake (cold-start retry) was not exercised live because both models in the workspace were warm. That path is untouched by this PR — it consumes the post-normalization model string via _extract_custom_model_id, and the wire-payload check above proves the normalized string is what reaches the openai delegate.

kunwar-vp merged commit 377dcc8 into main Apr 25, 2026
7 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

sdk(0.1.4): accept bare cm_<id> in chat.completions, drop qualified_name#10

sdk(0.1.4): accept bare cm_<id> in chat.completions, drop qualified_name#10
kunwar-vp merged 1 commit into
mainfrom
feat/accept-bare-cm-id-as-model

kunwar-vp commented Apr 25, 2026 •

edited

Loading

Uh oh!

kunwar-vp commented Apr 25, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

kunwar-vp commented Apr 25, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Why

What

Versioning — 0.1.4 (patch)

Docs

Test plan

Uh oh!

kunwar-vp commented Apr 25, 2026

Pre-merge e2e against beta — passed

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

kunwar-vp commented Apr 25, 2026 •

edited

Loading

Versioning — `0.1.4` (patch)