Fix credential refresh race during worker activation by EDsCODE · Pull Request #567 · PostHog/duckgres

EDsCODE · 2026-05-18T21:18:07Z

Summary

Skip reserved/activating workers in the credential refresh due-worker query
Prevent the refresh scheduler from bumping owner_epoch while first ActivateTenant is still in flight
Add regression coverage for reserved/activating rows with NULL or past-due credential expiry

Context

In the 24-client same-org burst QA, several clients failed with:

same-tenant takeover requires newer owner epoch 1 (current N)

During burst activation, worker rows become org-bound before the first ActivateTenant RPC completes and before s3_credentials_expires_at is stamped. The refresh scheduler treated those NULL-expiry reserved/activating rows as immediately due, bumped owner_epoch, and could race the original activation.

Tests

go test ./tests/configstore -run 'TestListWorkersDueForCredentialRefresh|TestMarkCredentialsRefreshed'
go test -tags kubernetes ./controlplane -run 'TestCredentialRefreshScheduler|TestK8sPoolActivateReservedWorker|TestSharedWorkerActivator'

Fix credential refresh race during worker activation

fb072b3

EDsCODE merged commit 31c36e9 into main May 18, 2026
22 checks passed

EDsCODE deleted the eric/skip-refresh-pending-workers branch May 18, 2026 21:25

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix credential refresh race during worker activation#567

Fix credential refresh race during worker activation#567
EDsCODE merged 1 commit into
mainfrom
eric/skip-refresh-pending-workers

EDsCODE commented May 18, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

EDsCODE commented May 18, 2026

Summary

Context

Tests

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant