Skip to content

Chain source syncs sequentially#242

Merged
maperu merged 1 commit intomasterfrom
maperu/chainSourceSyncs
Apr 1, 2026
Merged

Chain source syncs sequentially#242
maperu merged 1 commit intomasterfrom
maperu/chainSourceSyncs

Conversation

@maperu
Copy link
Copy Markdown

@maperu maperu commented Apr 1, 2026

Since dd.45 fixed COA source syncs, lemur now actually completes syncs instead of failing immediately. With 20+ COA sources fetching certs from vault the 2 celery workers get tied up during the bi-hourly sync_all_sources run. Lightweight minutely tasks like metrics reporting can't execute, get revoked by celery, and the revocation persists across pod restarts. This causes the task monitors to show "No Data" and declare incidents.

This chains them so they run one at a time, keeping the queue available for other tasks.

Companion PR for the worker concurrency bump: https://github.com/DataDog/k8s-resources/pull/145229

Source syncs are heavy operations. Dispatching all 20+ at once floods
the celery queue. Chaining them runs one at a time so the queue stays
available for other tasks.
@maperu maperu force-pushed the maperu/chainSourceSyncs branch from 1e06f89 to 79fe283 Compare April 1, 2026 15:39
@maperu maperu marked this pull request as ready for review April 1, 2026 15:40
@maperu maperu requested review from a team as code owners April 1, 2026 15:40
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants