fix: improve failover resilience and observability by psteinroe · Pull Request #49 · psteinroe/postgres-stream

psteinroe · 2026-04-05T10:28:40Z

Graceful mid-replay error handling: Sink errors during failover replay now abort the replay cleanly instead of propagating, keeping the stream in failover for retry on the next batch
Skip table sync copies: Changed TableSyncCopyConfig to SkipAllTables since write_table_rows is a no-op — avoids unnecessary work during initial sync
Improved failure-path logging: Upgraded info → warn with structured fields (checkpoint_event_id, error) for all failover/failure paths, and added startup warning when stream begins in failover mode

Port incident fixes from getmateo fork: gracefully handle sink errors during failover replay, skip unnecessary table sync copies, and upgrade failure-path logging from info to warn with structured fields. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

- Reverts the error-swallowing behavior introduced in #49 for sink failures during failover replay - Returning `Ok(())` on sink error would let the stream complete recovery and mark itself `Healthy`, even though not all events were replayed — causing data loss with flaky destinations - Propagating the error with `?` lets the destination's retry logic handle transient failures Co-authored-by: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

psteinroe merged commit 40a0115 into main Apr 5, 2026
6 checks passed

psteinroe mentioned this pull request Apr 5, 2026

fix: propagate sink errors during failover replay #50

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

fix: improve failover resilience and observability#49

fix: improve failover resilience and observability#49
psteinroe merged 1 commit intomainfrom
fix/incident-failover-improvements

psteinroe commented Apr 5, 2026 •

edited

Loading

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Uh oh!

Conversation

psteinroe commented Apr 5, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

psteinroe commented Apr 5, 2026 •

edited

Loading