UN-3415 [REFACTOR] Replace CostCalculationHelper with litellm.cost_per_token by chandrasekharan-zipstack · Pull Request #1906 · Zipstack/unstract

chandrasekharan-zipstack · 2026-04-07T21:09:12Z

What

Replace the custom CostCalculationHelper in platform-service with litellm's built-in cost_per_token(), moving cost calculation to sdk1's Audit class (caller-side).

Why

CostCalculationHelper fetched pricing data from an external URL, cached it in file storage with a TTL, and did manual price lookups — all of which litellm already handles natively with its bundled pricing database (2645+ models).
This removes an external HTTP dependency, file storage dependency, and simplifies the platform service to pure storage.

How

audit.py (sdk1): Compute cost via litellm.cost_per_token() using the full model name (e.g. azure/gpt-4o) before stripping the provider prefix for DB storage. Send pre-computed cost_in_dollars in the payload to platform-service.
platform.py (platform-service): Read cost_in_dollars directly from the payload instead of computing it. Removed CostCalculationHelper import, provider variable, and input token branching logic.
Deleted cost_calculation.py: No longer needed — was only used by the /usage endpoint.
Cleaned up env.py: Removed MODEL_PRICES_URL, MODEL_PRICES_TTL_IN_DAYS, MODEL_PRICES_FILE_PATH env vars.
Cleaned up utils.py: Removed orphaned format_float_positional function.

Can this PR break any existing features. If yes, please list possible items. If no, please explain why. (PS: Admins do not merge the PR without this section filled)

Low risk. The model name stored in the DB remains in the same stripped format as before (e.g. gpt-4o, not azure/gpt-4o). API deployment responses and dashboard queries are unaffected.
Unknown/custom models return cost=0.0 (same behavior as before via the except Exception fallback).
The platform-service /usage endpoint is backward-compatible: if cost_in_dollars is not in the payload (e.g. from an older SDK), it defaults to 0.0.

Database Migrations

None

Env Config

Removed from platform-service: MODEL_PRICES_URL, MODEL_PRICES_TTL_IN_DAYS, MODEL_PRICES_FILE_PATH (no longer needed)

Relevant Docs

litellm cost_per_token

Related Issues or PRs

N/A

Dependencies Versions

No new dependencies. litellm is already a transitive dependency of platform-service via unstract-sdk1.

Notes on Testing

Verified locally: platform-service starts cleanly, /usage endpoint accepts requests and stores cost correctly.
Compared cost values from litellm.cost_per_token() against previous CostCalculationHelper output for azure/gpt-4o — values match.
Key test cases to verify:
- OpenAI models: gpt-4o, gpt-4o-mini
- Azure models: azure/gpt-4o
- Anthropic models: claude-sonnet-4-20250514
- Embedding models: text-embedding-3-small
- Unknown/custom models: should return cost=0.0

Screenshots

N/A

Checklist

I have read and understood the Contribution Guidelines.

Move cost calculation from platform-service to sdk1's Audit class, using litellm's built-in cost_per_token() instead of a custom helper that fetched pricing data from an external URL. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

coderabbitai · 2026-04-07T21:09:21Z

Walkthrough

Server-side cost calculation and related utilities were removed; cost is now computed in the SDK using LiteLLM and sent in the usage payload. The platform controller accepts cost_in_dollars from requests instead of deriving it server-side.

Changes

Cohort / File(s)	Summary
Server-side cost removal `platform-service/src/unstract/platform_service/helper/cost_calculation.py`, `platform-service/src/unstract/platform_service/utils.py`, `platform-service/src/unstract/platform_service/env.py`, `platform-service/src/unstract/platform_service/controller/platform.py`	Deleted `CostCalculationHelper`, removed `format_float_positional()`, removed `Env` constants `MODEL_PRICES_URL`, `MODEL_PRICES_TTL_IN_DAYS`, `MODEL_PRICES_FILE_PATH`. Updated `usage()` to read `cost_in_dollars` from request payload and removed server-side cost/token calculation logic.
Client-side cost addition (SDK) `unstract/sdk1/src/unstract/sdk1/audit.py`	Added LiteLLM `cost_per_token` usage and module logging; changed token accounting for embeddings vs. non-embeddings; preserved full `model_name` for cost lookup while sending a stripped `display_model_name`; included `cost_in_dollars` in POST payload with exception-safe fallback to `0.0`.

Sequence Diagram

sequenceDiagram
    participant SDK as SDK (Client)
    participant LiteLLM as LiteLLM
    participant PlatformService as Platform Service

    rect rgba(100, 150, 200, 0.5)
    Note over SDK,PlatformService: Old Flow (Server-side cost calc)
    SDK->>PlatformService: POST /usage (model, tokens)
    PlatformService->>PlatformService: Load pricing (cache/URL)
    PlatformService->>PlatformService: Calculate cost
    PlatformService-->>SDK: Response (includes computed cost)
    end

    rect rgba(150, 200, 100, 0.5)
    Note over SDK,PlatformService: New Flow (Client-side cost calc)
    SDK->>LiteLLM: cost_per_token(model, prompt_tokens, completion_tokens)
    LiteLLM-->>SDK: cost_in_dollars
    SDK->>PlatformService: POST /usage (model, tokens, cost_in_dollars)
    PlatformService->>PlatformService: Accept cost from payload
    PlatformService-->>SDK: Response
    end

Estimated code review effort

🎯 3 (Moderate) | ⏱️ ~25 minutes

🚥 Pre-merge checks | ✅ 3

✅ Passed checks (3 passed)

Check name	Status	Explanation
Description check	✅ Passed	The description is comprehensive and well-structured, covering all template sections including What, Why, How, breaking changes, migrations, env config, testing notes, and dependencies.
Docstring Coverage	✅ Passed	Docstring coverage is 100.00% which is sufficient. The required threshold is 80.00%.
Title check	✅ Passed	The title accurately describes the main refactoring objective: replacing CostCalculationHelper with litellm.cost_per_token, which is the primary purpose of this PR.

_{✏️ Tip: You can configure your own custom pre-merge checks in the settings.}

✨ Finishing Touches

📝 Generate docstrings

Create stacked PR
Commit on current branch

🧪 Generate unit tests (beta)

Create PR with unit tests
Commit unit tests in branch refactor/litellm-cost-calculation

Thanks for using CodeRabbit! It's free for OSS, and your support helps us grow. If you like it, consider giving us a shout-out.

❤️ Share

_{Comment @coderabbitai help to get the list of available commands and usage tips.}

greptile-apps · 2026-04-07T21:11:35Z

Greptile Summary

This PR refactors cost calculation by replacing the custom CostCalculationHelper (which fetched pricing from an external URL and cached it in file storage) with litellm's built-in cost_per_token(), moving the computation to sdk1's Audit class before the payload is sent to platform-service. Platform-service is simplified to pure storage — it now reads a pre-computed cost_in_dollars from the payload with a safe 0.0 default for backward compatibility with older SDK versions.

Confidence Score: 5/5

Safe to merge — clean refactor with no functional regressions and a safe backward-compatible default for older SDK clients.

No P0 or P1 issues found. The embedding-token completion_tokens concern from the previous review thread is fully addressed. Cost calculation is correct (litellm.cost_per_token returns total cost, not rate), exception handling is appropriate with a 0.0 fallback, and the DB storage format is unchanged. All remaining findings are P2 or absent.

No files require special attention.

Important Files Changed

Filename	Overview
unstract/sdk1/src/unstract/sdk1/audit.py	Cost calculation added using litellm.cost_per_token(); embedding events correctly zero completion_tokens; full model name used for cost lookup before stripping provider prefix for DB storage.
platform-service/src/unstract/platform_service/controller/platform.py	Simplified /usage endpoint: reads pre-computed cost_in_dollars from payload (default 0.0), removing CostCalculationHelper and input-token branching logic.
platform-service/src/unstract/platform_service/helper/cost_calculation.py	Deleted — external HTTP fetch, file-storage TTL caching, and manual pricing logic fully replaced by litellm's bundled pricing database.
platform-service/src/unstract/platform_service/env.py	Removed three required env vars (MODEL_PRICES_URL, MODEL_PRICES_TTL_IN_DAYS, MODEL_PRICES_FILE_PATH) that are no longer needed.
platform-service/src/unstract/platform_service/utils.py	Removed the now-orphaned format_float_positional helper function that was only used by the deleted CostCalculationHelper.

Sequence Diagram

sequenceDiagram
    participant Tool as SDK1 Tool
    participant Audit as Audit.push_usage_data()
    participant LiteLLM as litellm cost_per_token
    participant Platform as platform-service /usage

    Tool->>Audit: push_usage_data(model_name, token_counter, event_type)
    Audit->>Audit: compute prompt_tokens and completion_tokens
    Note over Audit: For embedding events, completion_tokens set to 0
    alt model_name is set and known
        Audit->>LiteLLM: cost_per_token(model, prompt_tokens, completion_tokens)
        LiteLLM-->>Audit: (prompt_cost, completion_cost)
        Audit->>Audit: cost_in_dollars = sum of costs
    else unknown or no model
        Audit->>Audit: cost_in_dollars = 0.0 (exception fallback)
    end
    Audit->>Audit: strip provider prefix from model_name for DB storage
    Audit->>Platform: POST /usage with cost_in_dollars in payload
    Platform->>Platform: read cost_in_dollars from payload (default 0.0)
    Platform->>Platform: INSERT into token_usage table
    Platform-->>Audit: 200 OK

_{Reviews (3): Last reviewed commit: "Merge branch 'main' into refactor/litell..." | Re-trigger Greptile}

coderabbitai

🧹 Nitpick comments (1)

platform-service/src/unstract/platform_service/controller/platform.py (1)
219-219: Consider validating the cost_in_dollars value from the payload.

The endpoint now trusts the client-provided cost_in_dollars value without validation. While the endpoint is protected by authentication, consider adding basic type validation to ensure data integrity:

The value could be a non-numeric type (string, None beyond default, dict, etc.)

The value could be negative, which doesn't make sense for costs
💡 Optional validation
-    cost_in_dollars = payload.get("cost_in_dollars", 0.0)
+    cost_in_dollars = payload.get("cost_in_dollars", 0.0)
+    if not isinstance(cost_in_dollars, (int, float)) or cost_in_dollars < 0:
+        cost_in_dollars = 0.0
🤖 Prompt for AI Agents
Verify each finding against the current code and only fix it if needed.

In `@platform-service/src/unstract/platform_service/controller/platform.py` at
line 219, Validate the payload's cost_in_dollars after the line where
cost_in_dollars = payload.get("cost_in_dollars", 0.0): ensure it's a numeric
value and non-negative by attempting to coerce to float (or checking isinstance
int/float) and rejecting invalid input; if coercion fails or the value is < 0,
return a 400-style validation error (or set a safe default and log) from the
controller function that contains this variable so downstream logic only sees a
validated non-negative float.

🤖 Prompt for all review comments with AI agents

Verify each finding against the current code and only fix it if needed.

Nitpick comments:
In `@platform-service/src/unstract/platform_service/controller/platform.py`:
- Line 219: Validate the payload's cost_in_dollars after the line where
cost_in_dollars = payload.get("cost_in_dollars", 0.0): ensure it's a numeric
value and non-negative by attempting to coerce to float (or checking isinstance
int/float) and rejecting invalid input; if coercion fails or the value is < 0,
return a 400-style validation error (or set a safe default and log) from the
controller function that contains this variable so downstream logic only sees a
validated non-negative float.

ℹ️ Review info

⚙️ Run configuration

Configuration used: Organization UI

Review profile: CHILL

Plan: Pro

Run ID: 2b5e73f7-f615-4b4d-937b-8c09f0b2db10

📥 Commits

Reviewing files that changed from the base of the PR and between c9ffd9d and 7df2f85.

📒 Files selected for processing (5)

platform-service/src/unstract/platform_service/controller/platform.py
platform-service/src/unstract/platform_service/env.py
platform-service/src/unstract/platform_service/helper/cost_calculation.py
platform-service/src/unstract/platform_service/utils.py
unstract/sdk1/src/unstract/sdk1/audit.py

💤 Files with no reviewable changes (3)

platform-service/src/unstract/platform_service/utils.py
platform-service/src/unstract/platform_service/env.py
platform-service/src/unstract/platform_service/helper/cost_calculation.py

Explicitly set completion_tokens to 0 for embedding events before calling cost_per_token, making the assumption that embeddings have no completion tokens explicit rather than relying on the counter always being zero. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

coderabbitai

🧹 Nitpick comments (1)

unstract/sdk1/src/unstract/sdk1/audit.py (1)

104-107: Consider logging the exception details for easier debugging.

The exception is caught but the actual error message is not logged, making it harder to diagnose failures. For unknown models this is fine, but other failures (e.g., litellm API changes, unexpected types) would be harder to debug.

Proposed improvement

         except Exception:
             logger.debug(
-                "Cost lookup failed for model %s, defaulting to 0", model_name
+                "Cost lookup failed for model %s, defaulting to 0", model_name,
+                exc_info=True,
             )

🤖 Prompt for AI Agents

Verify each finding against the current code and only fix it if needed.

In `@unstract/sdk1/src/unstract/sdk1/audit.py` around lines 104 - 107, The except
block in the cost lookup (in unstract.sdk1.audit.py, around the function
performing model cost lookup) currently swallows all exceptions and only logs
the model_name; modify the exception handler to include the actual exception
details—either by capturing the exception as e and adding it to the log message
(e.g., include str(e)) or by passing exc_info=True to logger.debug—so failures
from litellm API changes or unexpected types are recorded alongside the existing
"Cost lookup failed for model %s, defaulting to 0" message.

🤖 Prompt for all review comments with AI agents

Verify each finding against the current code and only fix it if needed.

Nitpick comments:
In `@unstract/sdk1/src/unstract/sdk1/audit.py`:
- Around line 104-107: The except block in the cost lookup (in
unstract.sdk1.audit.py, around the function performing model cost lookup)
currently swallows all exceptions and only logs the model_name; modify the
exception handler to include the actual exception details—either by capturing
the exception as e and adding it to the log message (e.g., include str(e)) or by
passing exc_info=True to logger.debug—so failures from litellm API changes or
unexpected types are recorded alongside the existing "Cost lookup failed for
model %s, defaulting to 0" message.

ℹ️ Review info

⚙️ Run configuration

Configuration used: Organization UI

Review profile: CHILL

Plan: Pro

Run ID: a41f4ae9-3710-44c0-8d58-4b068323c771

📥 Commits

Reviewing files that changed from the base of the PR and between 7df2f85 and 1a57c38.

📒 Files selected for processing (1)

unstract/sdk1/src/unstract/sdk1/audit.py

pk-zipstack

LGTM

github-actions · 2026-04-17T05:50:48Z

Test Results

Summary

✅ Runner Tests: 11 passed, 0 failed (11 total)
✅ SDK1 Tests: 196 passed, 0 failed (196 total)

Runner Tests - Full Report

filepath	function	$$\textcolor{#23d18b}{\tt{passed}}$$	SUBTOTAL
$$\textcolor{#23d18b}{\tt{runner/src/unstract/runner/clients/test\_docker.py}}$$	$$\textcolor{#23d18b}{\tt{test\_logs}}$$	$$\textcolor{#23d18b}{\tt{1}}$$	$$\textcolor{#23d18b}{\tt{1}}$$
$$\textcolor{#23d18b}{\tt{runner/src/unstract/runner/clients/test\_docker.py}}$$	$$\textcolor{#23d18b}{\tt{test\_cleanup}}$$	$$\textcolor{#23d18b}{\tt{1}}$$	$$\textcolor{#23d18b}{\tt{1}}$$
$$\textcolor{#23d18b}{\tt{runner/src/unstract/runner/clients/test\_docker.py}}$$	$$\textcolor{#23d18b}{\tt{test\_cleanup\_skip}}$$	$$\textcolor{#23d18b}{\tt{1}}$$	$$\textcolor{#23d18b}{\tt{1}}$$
$$\textcolor{#23d18b}{\tt{runner/src/unstract/runner/clients/test\_docker.py}}$$	$$\textcolor{#23d18b}{\tt{test\_client\_init}}$$	$$\textcolor{#23d18b}{\tt{1}}$$	$$\textcolor{#23d18b}{\tt{1}}$$
$$\textcolor{#23d18b}{\tt{runner/src/unstract/runner/clients/test\_docker.py}}$$	$$\textcolor{#23d18b}{\tt{test\_get\_image\_exists}}$$	$$\textcolor{#23d18b}{\tt{1}}$$	$$\textcolor{#23d18b}{\tt{1}}$$
$$\textcolor{#23d18b}{\tt{runner/src/unstract/runner/clients/test\_docker.py}}$$	$$\textcolor{#23d18b}{\tt{test\_get\_image}}$$	$$\textcolor{#23d18b}{\tt{1}}$$	$$\textcolor{#23d18b}{\tt{1}}$$
$$\textcolor{#23d18b}{\tt{runner/src/unstract/runner/clients/test\_docker.py}}$$	$$\textcolor{#23d18b}{\tt{test\_get\_container\_run\_config}}$$	$$\textcolor{#23d18b}{\tt{1}}$$	$$\textcolor{#23d18b}{\tt{1}}$$
$$\textcolor{#23d18b}{\tt{runner/src/unstract/runner/clients/test\_docker.py}}$$	$$\textcolor{#23d18b}{\tt{test\_get\_container\_run\_config\_without\_mount}}$$	$$\textcolor{#23d18b}{\tt{1}}$$	$$\textcolor{#23d18b}{\tt{1}}$$
$$\textcolor{#23d18b}{\tt{runner/src/unstract/runner/clients/test\_docker.py}}$$	$$\textcolor{#23d18b}{\tt{test\_run\_container}}$$	$$\textcolor{#23d18b}{\tt{1}}$$	$$\textcolor{#23d18b}{\tt{1}}$$
$$\textcolor{#23d18b}{\tt{runner/src/unstract/runner/clients/test\_docker.py}}$$	$$\textcolor{#23d18b}{\tt{test\_get\_image\_for\_sidecar}}$$	$$\textcolor{#23d18b}{\tt{1}}$$	$$\textcolor{#23d18b}{\tt{1}}$$
$$\textcolor{#23d18b}{\tt{runner/src/unstract/runner/clients/test\_docker.py}}$$	$$\textcolor{#23d18b}{\tt{test\_sidecar\_container}}$$	$$\textcolor{#23d18b}{\tt{1}}$$	$$\textcolor{#23d18b}{\tt{1}}$$
$$\textcolor{#23d18b}{\tt{TOTAL}}$$		$$\textcolor{#23d18b}{\tt{11}}$$	$$\textcolor{#23d18b}{\tt{11}}$$

SDK1 Tests - Full Report

sonarqubecloud · 2026-04-17T05:50:57Z

Quality Gate passed

Issues
0 New issues
0 Accepted issues

Measures
0 Security Hotspots
0.0% Coverage on New Code
0.0% Duplication on New Code

See analysis details on SonarQube Cloud

greptile-apps bot reviewed Apr 7, 2026

View reviewed changes

Comment thread unstract/sdk1/src/unstract/sdk1/audit.py

chandrasekharan-zipstack self-assigned this Apr 8, 2026

chandrasekharan-zipstack marked this pull request as ready for review April 8, 2026 05:29

chandrasekharan-zipstack requested review from Deepak-Kesavan and pk-zipstack April 8, 2026 05:29

coderabbitai bot reviewed Apr 8, 2026

View reviewed changes

pk-zipstack approved these changes Apr 8, 2026

View reviewed changes

Deepak-Kesavan approved these changes Apr 9, 2026

View reviewed changes

chandrasekharan-zipstack changed the title ~~[REFACTOR] Replace CostCalculationHelper with litellm.cost_per_token~~ UN-3415 [REFACTOR] Replace CostCalculationHelper with litellm.cost_per_token Apr 17, 2026

Merge branch 'main' into refactor/litellm-cost-calculation

91df0ff

muhammad-ali-e merged commit bf6db0e into main Apr 17, 2026
8 checks passed

muhammad-ali-e deleted the refactor/litellm-cost-calculation branch April 17, 2026 05:55

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

UN-3415 [REFACTOR] Replace CostCalculationHelper with litellm.cost_per_token#1906

UN-3415 [REFACTOR] Replace CostCalculationHelper with litellm.cost_per_token#1906
muhammad-ali-e merged 3 commits intomainfrom
refactor/litellm-cost-calculation

chandrasekharan-zipstack commented Apr 7, 2026

Uh oh!

coderabbitai bot commented Apr 7, 2026 •

edited

Loading

Uh oh!

greptile-apps bot commented Apr 7, 2026 •

edited

Loading

Greptile Summary

Confidence Score: 5/5

Important Files Changed

Sequence Diagram

Uh oh!

Uh oh!

coderabbitai bot left a comment

Uh oh!

coderabbitai bot left a comment

Uh oh!

pk-zipstack left a comment

Uh oh!

github-actions bot commented Apr 17, 2026

Uh oh!

sonarqubecloud bot commented Apr 17, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Conversation

chandrasekharan-zipstack commented Apr 7, 2026

What

Why

How

Can this PR break any existing features. If yes, please list possible items. If no, please explain why. (PS: Admins do not merge the PR without this section filled)

Database Migrations

Env Config

Relevant Docs

Related Issues or PRs

Dependencies Versions

Notes on Testing

Screenshots

Checklist

Uh oh!

coderabbitai bot commented Apr 7, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Walkthrough

Changes

Sequence Diagram

Estimated code review effort

Uh oh!

greptile-apps bot commented Apr 7, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Greptile Summary

Confidence Score: 5/5

Important Files Changed

Sequence Diagram

Uh oh!

Uh oh!

coderabbitai bot left a comment

Choose a reason for hiding this comment

Uh oh!

coderabbitai bot left a comment

Choose a reason for hiding this comment

Uh oh!

pk-zipstack left a comment

Choose a reason for hiding this comment

Uh oh!

github-actions bot commented Apr 17, 2026

Test Results

Uh oh!

sonarqubecloud bot commented Apr 17, 2026

Quality Gate passed

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

coderabbitai bot commented Apr 7, 2026 •

edited

Loading

greptile-apps bot commented Apr 7, 2026 •

edited

Loading