Skip to content

chore(llma): Update LLM costs#60221

Closed
scheduled-actions-posthog[bot] wants to merge 0 commit into
masterfrom
chore/llma-update-ai-costs
Closed

chore(llma): Update LLM costs#60221
scheduled-actions-posthog[bot] wants to merge 0 commit into
masterfrom
chore/llma-update-ai-costs

Conversation

@scheduled-actions-posthog
Copy link
Copy Markdown
Contributor

This is an automated PR to update LLM pricing models.

Reviewer checklist

Please verify that there aren't any odd changes in the pricing by checking against the official pricing pages:

@greptile-apps
Copy link
Copy Markdown
Contributor

greptile-apps Bot commented May 27, 2026

Prompt To Fix All With AI
Fix the following 1 code review issue. Work through them one at a time, proposing concise fixes.

---

### Issue 1 of 1
nodejs/src/ingestion/ai/costs/providers/llm-costs.json:6951-6962
**qwen3.7-max prices halved but current sources show no price change**

The `default` and `alibaba` provider costs for `qwen/qwen3.7-max` were cut in half (`prompt_token`: `0.0000025``0.00000125`, `completion_token`: `0.0000075``0.00000375`). However, OpenRouter and Together.ai both currently list this model at **$2.50/M input, $7.50/M output** — which matches the *previous* values. If this halving is incorrect, cost tracking for qwen3.7-max will undercount by 50%, causing users to be charged half of what they actually owe.

Reviews (1): Last reviewed commit: "chore(llma): update LLM costs" | Re-trigger Greptile

Comment on lines 6951 to 6962
"default": {
"prompt_token": 0.0000025,
"completion_token": 0.0000075,
"cache_read_token": 5e-7,
"cache_write_token": 0.000003125
"prompt_token": 0.00000125,
"completion_token": 0.00000375,
"cache_read_token": 2.5e-7,
"cache_write_token": 0.0000015625
},
"alibaba": {
"prompt_token": 0.0000025,
"completion_token": 0.0000075,
"cache_read_token": 5e-7,
"cache_write_token": 0.000003125
"prompt_token": 0.00000125,
"completion_token": 0.00000375,
"cache_read_token": 2.5e-7,
"cache_write_token": 0.0000015625
}
Copy link
Copy Markdown
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

P1 qwen3.7-max prices halved but current sources show no price change

The default and alibaba provider costs for qwen/qwen3.7-max were cut in half (prompt_token: 0.00000250.00000125, completion_token: 0.00000750.00000375). However, OpenRouter and Together.ai both currently list this model at $2.50/M input, $7.50/M output — which matches the previous values. If this halving is incorrect, cost tracking for qwen3.7-max will undercount by 50%, causing users to be charged half of what they actually owe.

Prompt To Fix With AI
This is a comment left during a code review.
Path: nodejs/src/ingestion/ai/costs/providers/llm-costs.json
Line: 6951-6962

Comment:
**qwen3.7-max prices halved but current sources show no price change**

The `default` and `alibaba` provider costs for `qwen/qwen3.7-max` were cut in half (`prompt_token`: `0.0000025``0.00000125`, `completion_token`: `0.0000075``0.00000375`). However, OpenRouter and Together.ai both currently list this model at **$2.50/M input, $7.50/M output** — which matches the *previous* values. If this halving is incorrect, cost tracking for qwen3.7-max will undercount by 50%, causing users to be charged half of what they actually owe.

How can I resolve this? If you propose a fix, please make it concise.

@Radu-Raicea Radu-Raicea enabled auto-merge (squash) May 27, 2026 14:18
auto-merge was automatically disabled May 27, 2026 20:39

Pull request was closed

@scheduled-actions-posthog scheduled-actions-posthog Bot force-pushed the chore/llma-update-ai-costs branch from 7fa3d4e to 89a7939 Compare May 27, 2026 20:39
@scheduled-actions-posthog
Copy link
Copy Markdown
Contributor Author

Superseded by #60355

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant