fix(vlm): omit unset default max_tokens by qin-ctx · Pull Request #2949 · volcengine/OpenViking

qin-ctx · 2026-07-02T08:03:54Z

Description

Remove the remaining hardcoded VLM max_tokens=32768 fallbacks outside the OpenAI backend. When vlm.max_tokens is not configured, these backends now omit the token-limit parameter and let the selected model/provider apply its own default.

Related Issue

Fixes #2751

Follow-up to #2946, which already removed the OpenAI backend fallback.

Type of Change

Bug fix (non-breaking change that fixes an issue)
New feature (non-breaking change that adds functionality)
Breaking change (fix or feature that would cause existing functionality to not work as expected)
Documentation update
Refactoring (no functional changes)
Performance improvement
Test update

Changes Made

Stop sending max_tokens when vlm.max_tokens is unset in LiteLLM and VolcEngine VLM backends.
Remove the Kimi-specific default max_tokens constant and fallback.
Remove the stale Kimi unit-test assertion for the old default.

Testing

I have added tests that prove my fix is effective or that my feature works
New and existing unit tests pass locally with my changes
I have tested this on the following platforms:
- Linux
- macOS
- Windows

Not run locally; this is a small request-parameter change and the earlier uv run validation was interrupted before completion.

Checklist

My code follows the project's coding style
I have performed a self-review of my code
I have commented my code, particularly in hard-to-understand areas
I have made corresponding changes to the documentation
My changes generate no new warnings
Any dependent changes have been merged and published

Screenshots (if applicable)

N/A

Additional Notes

Leaving the parameter unset avoids hardcoding one completion-token budget for models and providers with different limits.

github-project-automation Bot added this to OpenViking project Jul 2, 2026

github-project-automation Bot moved this to Backlog in OpenViking project Jul 2, 2026

qin-ctx added 2 commits July 2, 2026 16:05

fix(vlm): omit unset LiteLLM max_tokens

1759c32

fix(vlm): remove provider max_tokens fallbacks

2b1dacb

qin-ctx force-pushed the fix/vlm-omit-default-max-tokens branch from 2872791 to 2b1dacb Compare July 2, 2026 08:05

chenjw approved these changes Jul 2, 2026

View reviewed changes

chenjw merged commit 2bf9e49 into main Jul 2, 2026
3 checks passed

github-project-automation Bot moved this from Backlog to Done in OpenViking project Jul 2, 2026

chenjw deleted the fix/vlm-omit-default-max-tokens branch July 2, 2026 09:07

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

fix(vlm): omit unset default max_tokens#2949

fix(vlm): omit unset default max_tokens#2949
chenjw merged 2 commits into
mainfrom
fix/vlm-omit-default-max-tokens

qin-ctx commented Jul 2, 2026 •

edited

Loading

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Uh oh!

Conversation

qin-ctx commented Jul 2, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Description

Related Issue

Type of Change

Changes Made

Testing

Checklist

Screenshots (if applicable)

Additional Notes

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

qin-ctx commented Jul 2, 2026 •

edited

Loading