Skip to content

fix: honor visual infer batch size#1

Open
sufubao wants to merge 1 commit into
mainfrom
fix/vit-max-batch-size
Open

fix: honor visual infer batch size#1
sufubao wants to merge 1 commit into
mainfrom
fix/vit-max-batch-size

Conversation

@sufubao
Copy link
Copy Markdown
Owner

@sufubao sufubao commented May 6, 2026

Summary

  • Fix ViT worker max_batch_size calculation so --visual_infer_batch_size is honored per visual DP rank
  • Update the nearby kvargs example comment to match the runtime calculation

Problem

The previous code used min(self.infer_batch_size // self.vit_dp, 1), which clamps every valid configuration to 1 because startup validation already requires infer_batch_size // vit_dp >= 1.

Verification

  • python -m py_compile lightllm/server/visualserver/manager.py lightllm/server/visualserver/visual_only_manager.py lightllm/server/visualserver/model_infer/model_rpc.py
  • git diff --check

sufubao added a commit that referenced this pull request Jun 8, 2026
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant