fix: use _max_concurrent_semantic for Semantic queue worker instead of hardcoded 1 by r266-tech · Pull Request #877 · volcengine/OpenViking

r266-tech · 2026-03-22T20:32:45Z

Summary

Fixes #873

The variable was stored in QueueManager.__init__ but never used in _start_queue_worker. The Semantic queue worker always ran with max_concurrent=1, ignoring the configured vlm.max_concurrent value for queue-level task concurrency.

Change

Before:

max_concurrent = self._max_concurrent_embedding if queue.name == self.EMBEDDING else 1

After:

max_concurrent = self._max_concurrent_embedding if queue.name == self.EMBEDDING else self._max_concurrent_semantic

Impact

Semantic queue now respects the max_concurrent_semantic parameter (default: 100)
Users can configure Semantic queue concurrency through ov.conf via vlm.max_concurrent
Enables parallel processing of multiple pending Semantic tasks

github-actions · 2026-03-22T20:33:28Z

Failed to generate code suggestions for PR

The _max_concurrent_semantic variable was stored in QueueManager.__init__ but never used in _start_queue_worker. The Semantic queue worker always had max_concurrent=1, ignoring the configured vlm.max_concurrent value. Fixes volcengine#873

CLAassistant · 2026-03-23T00:44:50Z

Thank you for your submission! We really appreciate it. Like many open source projects, we ask that you sign our Contributor License Agreement before we can accept your contribution.
_{You have signed the CLA already but the status is still pending? Let us recheck it.}

Per-message: skip expensive LLM overview generation when ≤5 files changed and cached overview exists. Rebuild overview from summaries without LLM call (0 LLM calls vs 3 batches × 10+ min each). Daily at 04:00 WIB (21:00 UTC): full LLM regen for directories with file count delta ≥ 5 since last run, keeping overviews coherent. Also applies PR volcengine#877 fix: use _max_concurrent_semantic for queue worker concurrency instead of hardcoded 1. Default set to 2 to match llama.cpp parallel slots. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

zhoujh01 · 2026-03-24T03:03:25Z

Thank you for your code. I just merged in the same fix（https://github.com/volcengine/OpenViking/pull/905）, so I'm closing this PR for now.

github-project-automation bot added this to OpenViking project Mar 22, 2026

github-project-automation bot moved this to Backlog in OpenViking project Mar 22, 2026

r266-tech force-pushed the fix/semantic-queue-concurrency branch from b155230 to 2c3c22d Compare March 23, 2026 00:44

zhoujh01 closed this Mar 24, 2026

github-project-automation bot moved this from Backlog to Done in OpenViking project Mar 24, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix: use _max_concurrent_semantic for Semantic queue worker instead of hardcoded 1#877

fix: use _max_concurrent_semantic for Semantic queue worker instead of hardcoded 1#877
r266-tech wants to merge 1 commit intovolcengine:mainfrom
r266-tech:fix/semantic-queue-concurrency

r266-tech commented Mar 22, 2026

Uh oh!

github-actions bot commented Mar 22, 2026

Uh oh!

CLAassistant commented Mar 23, 2026

Uh oh!

zhoujh01 commented Mar 24, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Conversation

r266-tech commented Mar 22, 2026

Summary

Change

Impact

Uh oh!

github-actions bot commented Mar 22, 2026

Uh oh!

CLAassistant commented Mar 23, 2026

Uh oh!

zhoujh01 commented Mar 24, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants