Optimize write checkpoint lookups when users have none #355

rkistner · 2025-09-04T11:55:23Z

#276 included some significant optimizations to how write checkpoint lookups are performed. In general, the strategy is:

We watch the checkpoint_events capped collection for new checkpoints.
Each time we get a new checkpoint, we lookup changed buckets since the last checkpoint, changed parameter lookups, and changed write checkpoints.
For each user's stream, we use that as a filter to check whether or not we need to query for further data for that user.

When a new stream is opened, we need to get the initial write checkpoint for the user, before we can use the above approach to efficiently get changes. The implementation in #276 mostly handled that.

However, there was one missed case: If the user does not have any write checkpoint, it would retry the lookup on every new checkpoint. In a case of thousands of concurrent connections and no write checkpoints, we can end up with doing tens of thousands of write checkpoint lookups per second. Even if the write checkpoint collection is empty, this still ends up adding multiple megabytes/s traffic between the instance and the storage database just for sending the query and receiving the empty results.

This fixes the issue by adding a boolean to keep track of whether or not we have done the initial query, instead of using a null check on the write checkpoint.

This change has no effect on users that do have a write checkpoint.

changeset-bot · 2025-09-04T11:55:27Z

🦋 Changeset detected

Latest commit: 3269f76

The changes in this PR will be included in the next version bump.

This PR includes changesets to release 11 packages

Name	Type
@powersync/service-module-mongodb-storage	Patch
@powersync/service-core	Patch
@powersync/service-image	Patch
@powersync/service-schema	Patch
@powersync/service-module-mongodb	Patch
@powersync/service-module-mysql	Patch
@powersync/service-module-postgres	Patch
@powersync/service-core-tests	Patch
@powersync/service-module-core	Patch
@powersync/service-module-postgres-storage	Patch
test-client	Patch

Not sure what this means? Click here to learn what changesets are.

Click here if you're a maintainer who wants to add another changeset to this PR

simolus3

I'm happy with these changes 👍

rkistner added 3 commits September 4, 2025 13:39

Minor cleanup.

14d22c9

Only query write checkpoint once initially.

d448a5f

Changset.

3269f76

rkistner requested a review from simolus3 September 4, 2025 11:55

simolus3 approved these changes Sep 4, 2025

View reviewed changes

rkistner merged commit a2b8bb0 into main Sep 4, 2025
21 checks passed

rkistner deleted the optimize-write-checkpoint-lookups branch September 4, 2025 12:19

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Optimize write checkpoint lookups when users have none #355

Optimize write checkpoint lookups when users have none #355

Uh oh!

rkistner commented Sep 4, 2025 •

edited

Loading

Uh oh!

changeset-bot bot commented Sep 4, 2025

Uh oh!

simolus3 left a comment

Uh oh!

Uh oh!

Uh oh!

Optimize write checkpoint lookups when users have none #355

Optimize write checkpoint lookups when users have none #355

Uh oh!

Conversation

rkistner commented Sep 4, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

changeset-bot bot commented Sep 4, 2025

🦋 Changeset detected

Uh oh!

simolus3 left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

rkistner commented Sep 4, 2025 •

edited

Loading