[MISC] Fix zero cost tracking for OpenAI-compatible adapter by chandrasekharan-zipstack · Pull Request #1985 · Zipstack/unstract

chandrasekharan-zipstack · 2026-05-22T12:38:00Z

What

LLM usage recorded by the OpenAI-compatible (custom_openai) adapter always has cost_in_dollars = 0.

Why

validate_model prefixes the model with custom_openai/. _record_usage prices via litellm.cost_per_token(model="custom_openai/..."), but custom_openai is a generic passthrough provider with no entry in LiteLLM's price map — the lookup fails and cost falls back to 0.

How

Mirror the Azure adapter's existing cost_model mechanism (model = routing identity, cost_model = pricing identity):

When api_base points at OpenAI's own HTTPS API host, set cost_model to the bare model name (prefix stripped). LiteLLM's price map then resolves real OpenAI rates.
For any other endpoint, cost_model is left unset. Non-OpenAI gateways (vLLM, self-hosted, third-party resellers) serve same-named models at different — or no — market prices, so guessing OpenAI rates would be confidently wrong. Cost stays 0 (honest "unresolved"), unchanged from today.

cost_model is popped in LLM.__init__ and never reaches litellm.completion(), so routing is unaffected.

Can this PR break any existing features. If yes, please list possible items. If no, please explain why. (PS: Admins do not merge the PR without this section filled)

No. The change only adds a cost_model key on the OpenAI-endpoint path; cost_model is consumed solely by cost lookup and never sent to litellm.completion(). All other endpoints behave exactly as before (cost 0). Routing, request params, and reasoning handling are untouched.

Related Issues or PRs

Follow-up to #1983 (OpenAI gpt-5 / o-series support in the openai-compatible adapter).

Notes on Testing

4 new cases in test_openai_compatible_adapter.py: cost_model set for the OpenAI endpoint, openai/ sub-prefix preserved, cost_model absent for other gateways, and cost_model stable across validate() re-validation. Full suite: 22 tests pass.

Screenshots

Checklist

I have read and understood the Contribution Guidelines.

🤖 Generated with Claude Code

The OpenAI-compatible adapter prefixes the model with `custom_openai/`, which has no entry in LiteLLM's price map, so `cost_per_token` fails and usage is recorded with cost 0. Mirror the Azure adapter's `cost_model` approach: when the endpoint is OpenAI's own API, set `cost_model` to the bare model name so pricing resolves. Other gateways serve same-named models at different prices, so their cost is intentionally left unresolved rather than guessed. Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>

coderabbitai · 2026-05-22T12:38:16Z

Walkthrough

Adds URL parsing and an endpoint-detection helper to identify the official OpenAI API host, and updates OpenAICompatibleLLMParameters.validate() to populate cost_model (by removing a leading custom_openai/ prefix) only when api_base targets api.openai.com; adds tests covering official vs third-party gateway behavior and revalidation stability.

Changes

OpenAI endpoint conditional cost model assignment

Layer / File(s)	Summary
Endpoint detection helper `unstract/sdk1/src/unstract/sdk1/adapters/base1.py`	Adds `urlparse` import and `_is_openai_api_endpoint(api_base)` to detect HTTPS api.openai.com endpoints.
Conditional cost model assignment in validation `unstract/sdk1/src/unstract/sdk1/adapters/base1.py`	Updates `OpenAICompatibleLLMParameters.validate()` to set `validated["cost_model"]` by stripping a leading `custom_openai/` from `validated["model"]` only when `api_base` is the official OpenAI endpoint.
Test coverage for cost model behavior `unstract/sdk1/tests/test_openai_compatible_adapter.py`	Adds four pytest cases validating cost_model is set for official OpenAI endpoints, preserves existing `openai/` subprefixes, remains unset for non-OpenAI gateways, and is stable across revalidation.

Estimated code review effort

🎯 3 (Moderate) | ⏱️ ~20 minutes

🚥 Pre-merge checks | ✅ 4 | ❌ 1

❌ Failed checks (1 warning)

Check name	Status	Explanation	Resolution
Docstring Coverage	⚠️ Warning	Docstring coverage is 22.22% which is insufficient. The required threshold is 80.00%.	Write docstrings for the functions missing them to satisfy the coverage threshold.

✅ Passed checks (4 passed)

Check name	Status	Explanation
Title check	✅ Passed	The title "[MISC] Fix zero cost tracking for OpenAI-compatible adapter" clearly summarizes the main change: fixing zero cost tracking for the OpenAI-compatible adapter.
Linked Issues check	✅ Passed	Check skipped because no linked issues were found for this pull request.
Out of Scope Changes check	✅ Passed	Check skipped because no linked issues were found for this pull request.
Description check	✅ Passed	The pull request description comprehensively covers all required sections with clear explanation of the problem, solution, testing, and impact assessment.

_{✏️ Tip: You can configure your own custom pre-merge checks in the settings.}

✨ Finishing Touches

📝 Generate docstrings

Create stacked PR
Commit on current branch

🧪 Generate unit tests (beta)

Create PR with unit tests
Commit unit tests in branch fix-openai-compatible-cost-model

Thanks for using CodeRabbit! It's free for OSS, and your support helps us grow. If you like it, consider giving us a shout-out.

❤️ Share

_{Comment @coderabbitai help to get the list of available commands and usage tips.}

greptile-apps · 2026-05-22T12:42:21Z

Greptile Summary

This PR fixes zero-cost tracking for the custom_openai adapter by mirroring the Azure adapter's cost_model mechanism: when api_base points at OpenAI's own HTTPS host, cost_model is set to the bare model name (prefix stripped) so LiteLLM's price map resolves correctly.

Adds _is_openai_api_endpoint() helper using urlparse with an explicit HTTPS-scheme guard to detect OpenAI's own host.
Sets cost_model in validate() only for OpenAI's endpoint; other gateways remain at unresolved cost to avoid guessing rates for third-party or self-hosted providers.
Adds four new tests covering cost-model injection, sub-prefix preservation, non-OpenAI gateways, and re-validation idempotency.

Confidence Score: 5/5

Safe to merge — the change is narrowly scoped to cost tracking and cannot affect routing or API calls.

The cost_model key is stripped by LLM.__init__ before reaching litellm.completion(), so there is no path by which this change affects actual API routing or model dispatch. The HTTPS-scheme guard correctly prevents non-HTTPS lookalike URLs from matching. The slice operation is always safe because validate_model guarantees the prefix is present before the cost block runs. Re-validation idempotency is explicitly tested. No correctness, security, or data-integrity concerns were found.

No files require special attention.

Important Files Changed

Filename	Overview
unstract/sdk1/src/unstract/sdk1/adapters/base1.py	Adds `_is_openai_api_endpoint()` helper (HTTPS + hostname check) and a `cost_model` injection block in `OpenAICompatibleLLMParameters.validate()` — logic is correct, safe, and idempotent.
unstract/sdk1/tests/test_openai_compatible_adapter.py	Four new test cases added covering OpenAI endpoint cost-model resolution, sub-prefix preservation, non-OpenAI gateway (no cost_model), and re-validation idempotency — all previously raised concerns are now covered.

Flowchart

%%{init: {'theme': 'neutral'}}%%
flowchart TD
    A["validate(adapter_metadata)"] --> B["validate_model()\nmodel = custom_openai/<name>"]
    B --> C["Pydantic model_dump()\nvalidated dict (no cost_model)"]
    C --> D{"_is_openai_api_endpoint\n(api_base)?"}
    D -- "Yes\nscheme==https AND\nhostname==api.openai.com" --> E["validated['cost_model'] =\nmodel[len('custom_openai/'):]"]
    D -- "No\nother gateway / None" --> F["cost_model unset\n(stays 0 — honest)"]
    E --> G["return validated"]
    F --> G
    G --> H["LLM.__init__ pops cost_model\nfor pricing lookup only"]
    H --> I["litellm.cost_per_token\n(model=cost_model)"]

_{Reviews (2): Last reviewed commit: "[MISC] Address review: require HTTPS sch..." | Re-trigger Greptile}

- _is_openai_api_endpoint now requires the https scheme so a plain-http host cannot trigger cost-model resolution. - Add a test asserting cost_model survives validate() re-validation. Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>

sonarqubecloud · 2026-05-22T13:23:30Z

Quality Gate passed

Issues
0 New issues
0 Accepted issues

Measures
0 Security Hotspots
0.0% Coverage on New Code
0.0% Duplication on New Code

See analysis details on SonarQube Cloud

github-actions · 2026-05-22T13:23:44Z

Test Results

Summary

✅ Runner Tests: 11 passed, 0 failed (11 total)
✅ SDK1 Tests: 347 passed, 0 failed (347 total)

Runner Tests - Full Report

filepath	function	$$\textcolor{#23d18b}{\tt{passed}}$$	SUBTOTAL
$$\textcolor{#23d18b}{\tt{runner/src/unstract/runner/clients/test\_docker.py}}$$	$$\textcolor{#23d18b}{\tt{test\_logs}}$$	$$\textcolor{#23d18b}{\tt{1}}$$	$$\textcolor{#23d18b}{\tt{1}}$$
$$\textcolor{#23d18b}{\tt{runner/src/unstract/runner/clients/test\_docker.py}}$$	$$\textcolor{#23d18b}{\tt{test\_cleanup}}$$	$$\textcolor{#23d18b}{\tt{1}}$$	$$\textcolor{#23d18b}{\tt{1}}$$
$$\textcolor{#23d18b}{\tt{runner/src/unstract/runner/clients/test\_docker.py}}$$	$$\textcolor{#23d18b}{\tt{test\_cleanup\_skip}}$$	$$\textcolor{#23d18b}{\tt{1}}$$	$$\textcolor{#23d18b}{\tt{1}}$$
$$\textcolor{#23d18b}{\tt{runner/src/unstract/runner/clients/test\_docker.py}}$$	$$\textcolor{#23d18b}{\tt{test\_client\_init}}$$	$$\textcolor{#23d18b}{\tt{1}}$$	$$\textcolor{#23d18b}{\tt{1}}$$
$$\textcolor{#23d18b}{\tt{runner/src/unstract/runner/clients/test\_docker.py}}$$	$$\textcolor{#23d18b}{\tt{test\_get\_image\_exists}}$$	$$\textcolor{#23d18b}{\tt{1}}$$	$$\textcolor{#23d18b}{\tt{1}}$$
$$\textcolor{#23d18b}{\tt{runner/src/unstract/runner/clients/test\_docker.py}}$$	$$\textcolor{#23d18b}{\tt{test\_get\_image}}$$	$$\textcolor{#23d18b}{\tt{1}}$$	$$\textcolor{#23d18b}{\tt{1}}$$
$$\textcolor{#23d18b}{\tt{runner/src/unstract/runner/clients/test\_docker.py}}$$	$$\textcolor{#23d18b}{\tt{test\_get\_container\_run\_config}}$$	$$\textcolor{#23d18b}{\tt{1}}$$	$$\textcolor{#23d18b}{\tt{1}}$$
$$\textcolor{#23d18b}{\tt{runner/src/unstract/runner/clients/test\_docker.py}}$$	$$\textcolor{#23d18b}{\tt{test\_get\_container\_run\_config\_without\_mount}}$$	$$\textcolor{#23d18b}{\tt{1}}$$	$$\textcolor{#23d18b}{\tt{1}}$$
$$\textcolor{#23d18b}{\tt{runner/src/unstract/runner/clients/test\_docker.py}}$$	$$\textcolor{#23d18b}{\tt{test\_run\_container}}$$	$$\textcolor{#23d18b}{\tt{1}}$$	$$\textcolor{#23d18b}{\tt{1}}$$
$$\textcolor{#23d18b}{\tt{runner/src/unstract/runner/clients/test\_docker.py}}$$	$$\textcolor{#23d18b}{\tt{test\_get\_image\_for\_sidecar}}$$	$$\textcolor{#23d18b}{\tt{1}}$$	$$\textcolor{#23d18b}{\tt{1}}$$
$$\textcolor{#23d18b}{\tt{runner/src/unstract/runner/clients/test\_docker.py}}$$	$$\textcolor{#23d18b}{\tt{test\_sidecar\_container}}$$	$$\textcolor{#23d18b}{\tt{1}}$$	$$\textcolor{#23d18b}{\tt{1}}$$
$$\textcolor{#23d18b}{\tt{TOTAL}}$$		$$\textcolor{#23d18b}{\tt{11}}$$	$$\textcolor{#23d18b}{\tt{11}}$$

SDK1 Tests - Full Report

chandrasekharan-zipstack self-assigned this May 22, 2026

greptile-apps Bot reviewed May 22, 2026

View reviewed changes

Comment thread unstract/sdk1/src/unstract/sdk1/adapters/base1.py Outdated

Comment thread unstract/sdk1/tests/test_openai_compatible_adapter.py

chandrasekharan-zipstack requested review from pk-zipstack and vishnuszipstack May 22, 2026 13:21

vishnuszipstack approved these changes May 22, 2026

View reviewed changes

Deepak-Kesavan approved these changes May 22, 2026

View reviewed changes

chandrasekharan-zipstack merged commit f08cd08 into main May 22, 2026
8 checks passed

chandrasekharan-zipstack deleted the fix-openai-compatible-cost-model branch May 22, 2026 13:45

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[MISC] Fix zero cost tracking for OpenAI-compatible adapter#1985

[MISC] Fix zero cost tracking for OpenAI-compatible adapter#1985
chandrasekharan-zipstack merged 2 commits into
mainfrom
fix-openai-compatible-cost-model

chandrasekharan-zipstack commented May 22, 2026 •

edited

Loading

Uh oh!

coderabbitai Bot commented May 22, 2026 •

edited

Loading

❌ Failed checks (1 warning)

Uh oh!

greptile-apps Bot commented May 22, 2026 •

edited

Loading

Greptile Summary

Confidence Score: 5/5

Important Files Changed

Flowchart

Uh oh!

Uh oh!

Uh oh!

sonarqubecloud Bot commented May 22, 2026

Uh oh!

github-actions Bot commented May 22, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

chandrasekharan-zipstack commented May 22, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What

Why

How

Can this PR break any existing features. If yes, please list possible items. If no, please explain why. (PS: Admins do not merge the PR without this section filled)

Related Issues or PRs

Notes on Testing

Screenshots

Checklist

Uh oh!

coderabbitai Bot commented May 22, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Walkthrough

Changes

Estimated code review effort

❌ Failed checks (1 warning)

Uh oh!

greptile-apps Bot commented May 22, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Greptile Summary

Confidence Score: 5/5

Important Files Changed

Flowchart

Uh oh!

Uh oh!

Uh oh!

sonarqubecloud Bot commented May 22, 2026

Quality Gate passed

Uh oh!

github-actions Bot commented May 22, 2026

Test Results

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

chandrasekharan-zipstack commented May 22, 2026 •

edited

Loading

coderabbitai Bot commented May 22, 2026 •

edited

Loading

greptile-apps Bot commented May 22, 2026 •

edited

Loading