Fix/cerebras conservative max tokens #5036

sebastiand-cerebras · 2025-12-04T01:41:34Z

This PR adds a specific configuration for the Cerebras provider to optimize rate limit handling and integration tracking.

Key changes:

Conservative Token Limit: Sets maxCompletionTokens to 16k. The Cerebras rate limiter estimates token consumption by reserving the full max_completion_tokens quota upfront. Using a conservative default prevents premature rate limiting, ensuring smoother operation even when actual generation is small.
Integration Header: Adds the X-Cerebras-3rd-Party-Integration: opencode` header.
Configuration: Sets autoload: false.

Testing:
Verified functionality with the following models: gpt-oss-120b, qwen-235, zai-glm4.6

rekram1-node · 2025-12-05T16:04:59Z

wouldn’t this kinda neuter a lot of models?

Can you explain why you need this models like gpt oss have 32k max completion output tokens and opencode should be respecting that…

What kinda plan are you on where you get ratelimited?

sebastiand-cerebras · 2025-12-06T00:20:18Z

Cerebras handles rate limiting differently from most providers. It estimates token usage upfront using the max_completion_tokens value, so if a client always sends 32k, each request is counted as if it might produce 32k tokens, even when the actual completion is much smaller. On Cerebras Code plans this causes users to hit rate limits very quickly in agentic coding workflows that make many short calls, which is why a more conservative default like 8,192 tokens gives a much smoother experience without materially limiting typical code completions.

extending to 16k

sebastiand-cerebras added 4 commits December 2, 2025 16:00

fix(cerebras): use conservative max_tokens and add integration header

09a041b

Merge branch 'sst:dev' into fix/cerebras-conservative-max-tokens

d31b2e1

Merge branch 'sst:dev' into fix/cerebras-conservative-max-tokens

e12acc0

Merge branch 'dev' into fix/cerebras-conservative-max-tokens

0272f21

rekram1-node force-pushed the dev branch from 398b39a to 5013d64 Compare December 5, 2025 05:13

sebastiand-cerebras added 5 commits December 5, 2025 16:20

Merge branch 'dev' into fix/cerebras-conservative-max-tokens

dc3aae0

Merge branch 'dev' into fix/cerebras-conservative-max-tokens

bf00c88

Merge branch 'dev' into fix/cerebras-conservative-max-tokens

2df158d

Update provider.ts

eb17220

extending to 16k

Merge branch 'dev' into fix/cerebras-conservative-max-tokens

167c5ef

github-actions bot mentioned this pull request Jan 12, 2026

fix: max completion tokens error #7970

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Fix/cerebras conservative max tokens #5036

Fix/cerebras conservative max tokens #5036

sebastiand-cerebras commented Dec 4, 2025 •

edited

Loading

Uh oh!

rekram1-node commented Dec 5, 2025

Uh oh!

sebastiand-cerebras commented Dec 6, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Fix/cerebras conservative max tokens #5036

Are you sure you want to change the base?

Fix/cerebras conservative max tokens #5036

Conversation

sebastiand-cerebras commented Dec 4, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

rekram1-node commented Dec 5, 2025

Uh oh!

sebastiand-cerebras commented Dec 6, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

sebastiand-cerebras commented Dec 4, 2025 •

edited

Loading