Skip to content

Raise hosted context window 64K -> 128K#30

Merged
0xadvait merged 1 commit into
mainfrom
context-window-128k
Jun 24, 2026
Merged

Raise hosted context window 64K -> 128K#30
0xadvait merged 1 commit into
mainfrom
context-window-128k

Conversation

@0xadvait

Copy link
Copy Markdown
Collaborator

The status bar showed 61.2K/65.5K because the config hard-set context_length: 65536 (the model block + the opengradient provider) -- a flat 64K default unrelated to the model. So the harness compacted at 64K even though the hosted models handle much more.

DeepSeek V4 Pro and Hermes 4 405B/70B are all >=128K native, so this bumps both hosted entries to 131072 (128K). Left the local ones (ollama_num_ctx, ollama-local) alone -- the abliterated local models have small windows.

Conservative on purpose: set to the safe native floor so we never send past the gateway's cap. If the OG gateway serves DeepSeek's full window (up to 1M), this can go higher.

The banner's '65.5K' came from a flat context_length: 65536 in the config (the
model block + the opengradient provider), not from the model -- so the harness
compacted at 64K even though every hosted model handles far more. DeepSeek V4
Pro and Hermes 4 405B/70B are all >=128K native, so bump both hosted entries to
131072. Conservative on purpose (won't over-send past the gateway); can go
higher if the gateway serves the full window. Local (ollama) windows unchanged.
@0xadvait 0xadvait merged commit 6e7bc5d into main Jun 24, 2026
@0xadvait 0xadvait deleted the context-window-128k branch June 24, 2026 21:08
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant