Skip to content

Raise hosted context window to 1M (DeepSeek V4 Pro)#37

Merged
0xadvait merged 1 commit into
mainfrom
context-1m
Jun 26, 2026
Merged

Raise hosted context window to 1M (DeepSeek V4 Pro)#37
0xadvait merged 1 commit into
mainfrom
context-1m

Conversation

@0xadvait

Copy link
Copy Markdown
Collaborator

The chat-app (which uses the same TEE gateway) advertises deepseek-v4-pro and hermes-4-405b at 1024K context, so 1M is genuinely served. The previous 128K was a conservative floor set when I couldn't confirm the gateway's cap. Bumps the two hosted context_length entries to 1048576; local (ollama) windows unchanged. The engine resolves context per-model with this as the fallback, so it's safe for both deepseek and the hermes fallback.

The chat-app (same TEE gateway) advertises both deepseek-v4-pro and
hermes-4-405b at a 1024K context window, so 1M is actually served. The earlier
128K was a conservative floor to avoid over-sending past an unknown cap; with
the cap confirmed, set the hosted context_length to 1048576. Local windows
unchanged.
@0xadvait 0xadvait merged commit 6494da8 into main Jun 26, 2026
@0xadvait 0xadvait deleted the context-1m branch June 26, 2026 14:19
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant