Skip to content

Backfill use BatchedWriter#1187

Merged
michaeldjeffrey merged 4 commits into
mainfrom
mj/backfill-use-batched-write
May 7, 2026
Merged

Backfill use BatchedWriter#1187
michaeldjeffrey merged 4 commits into
mainfrom
mj/backfill-use-batched-write

Conversation

@michaeldjeffrey
Copy link
Copy Markdown
Contributor

Writing a single snapshot per backfilled file causes things to slow down quite dramatically.
Using the batched writer, we can queue writes, then flush them similar to file-store. With the same guarantees that we won't miss files if things fail because they're being persisted to disk.

This keeps backfill quite speedy, with intermittent pauses for flushing to iceberg.

@michaeldjeffrey michaeldjeffrey requested a review from bbalser May 6, 2026 19:20
@michaeldjeffrey michaeldjeffrey force-pushed the mj/backfill-use-batched-write branch from 896bccb to 710feff Compare May 6, 2026 19:28
@michaeldjeffrey michaeldjeffrey force-pushed the mj/backfill-use-batched-write branch from 710feff to 029cfe9 Compare May 6, 2026 19:51
@michaeldjeffrey michaeldjeffrey merged commit ade63ae into main May 7, 2026
29 checks passed
@michaeldjeffrey michaeldjeffrey deleted the mj/backfill-use-batched-write branch May 7, 2026 19:43
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants