fix(podlogs): Follow mode exit bug and test improvements by mnencia · Pull Request #9599 · cloudnative-pg/cloudnative-pg

mnencia · 2025-12-29T16:34:12Z

The test "should catch extra logs if given the follow option" was flaky in CI because it tested implementation details (counting loop iterations with tight timing) rather than actual behavior.

Redesigned the test to verify what Follow=true actually does: it keeps streaming until the context is cancelled. The test now uses Eventually/Consistently patterns that handle timing variations gracefully, making it robust across different environments.

The improved test exposed a bug in the production code: the streaming function would exit when all current log streams completed, even when Follow=true was set. This caused premature exit instead of continuing to poll for new or restarted pods. Fixed by changing the exit condition to only apply when Follow=false.

github-actions · 2025-12-29T16:34:21Z

❗ By default, the pull request is configured to backport to all release branches.

To stop backporting this pr, remove the label: backport-requested ◀️ or add the label 'do not backport'
To stop backporting this pr to a certain release branch, remove the specific branch label: release-x.y

mnencia · 2025-12-29T16:51:56Z

/ok-to-merge test only change

mnencia · 2026-01-10T22:35:15Z

One of the executions of the improved test failed in CI: https://github.com/cloudnative-pg/cloudnative-pg/actions/runs/20791171388/job/59713483662?pr=9599

The failure exposed a bug in cluster_writer.go: the streaming function would exit when streamSet.isZero() returned true, even with Follow=true enabled. This violated the Follow semantics which should keep polling for new/restarted pods.

The race occurred because log streams could complete before the main loop checked for active streams, causing premature exit. Fixed by adding !csr.Options.Follow to the exit condition, ensuring Follow mode continues looping as intended.

mnencia · 2026-01-10T22:35:50Z

/test

github-actions · 2026-01-10T22:35:59Z

@mnencia, here's the link to the E2E on CNPG workflow run: https://github.com/cloudnative-pg/cloudnative-pg/actions/runs/20885430224

The Follow=true test was flaky in CI because it tested implementation details (counting loop iterations with tight timing) rather than actual behavior. Redesigned the test to verify what Follow=true actually does: it keeps streaming until the context is cancelled, rather than exiting after one read. The test now checks that streaming starts, continues running without exiting on its own, and only stops when we cancel the context. This approach uses Eventually/Consistently patterns that handle timing variations gracefully, making it robust across different environments. Signed-off-by: Marco Nenciarini <marco.nenciarini@enterprisedb.com>

Fixed a bug where Follow=true would exit prematurely when all current log streams completed, instead of continuing to poll for new or restarted pods. Changed the exit condition on zero active streams to only apply when Follow=false, allowing Follow mode to keep looping as intended. Signed-off-by: Marco Nenciarini <marco.nenciarini@enterprisedb.com>

The test "should catch extra logs if given the follow option" was flaky in CI because it tested implementation details (counting loop iterations with tight timing) rather than actual behavior. Redesigned the test to verify what Follow=true actually does: it keeps streaming until the context is cancelled. The test now uses Eventually/Consistently patterns that handle timing variations gracefully, making it robust across different environments. The improved test exposed a bug in the production code: the streaming function would exit when all current log streams completed, even when Follow=true was set. This caused premature exit instead of continuing to poll for new or restarted pods. Fixed by changing the exit condition to only apply when Follow=false. Signed-off-by: Marco Nenciarini <marco.nenciarini@enterprisedb.com> (cherry picked from commit 76817ec)

Signed-off-by: Marco Nenciarini <marco.nenciarini@enterprisedb.com>

mnencia requested a review from a team as a code owner December 29, 2025 16:34

dosubot bot added the size:M This PR changes 30-99 lines, ignoring generated files. label Dec 29, 2025

cnpg-bot added backport-requested ◀️ This pull request should be backported to all supported releases release-1.25 release-1.27 release-1.28 labels Dec 29, 2025

NiccoloFei force-pushed the dev/fix-flaky-podlogs-test branch from d6cc96d to 027c158 Compare December 29, 2025 16:36

mnencia added the no-issue label Dec 29, 2025

mnencia added the ok to merge 👌 This PR can be merged label Dec 29, 2025

armru approved these changes Dec 30, 2025

View reviewed changes

dosubot bot added the lgtm This PR has been approved by a maintainer label Dec 30, 2025

mnencia force-pushed the dev/fix-flaky-podlogs-test branch from 027c158 to 2bda92a Compare December 30, 2025 12:17

mnencia force-pushed the dev/fix-flaky-podlogs-test branch from 2bda92a to 73a08a5 Compare January 7, 2026 17:55

mnencia changed the title ~~test(podlogs): make Follow=true test robust and behavior-focused~~ fix(podlogs): Follow mode exit bug and test improvements Jan 10, 2026

mnencia force-pushed the dev/fix-flaky-podlogs-test branch from 3203816 to a48a08c Compare January 10, 2026 22:35

mnencia force-pushed the dev/fix-flaky-podlogs-test branch from a48a08c to 22baf8e Compare January 10, 2026 22:42

leonardoce approved these changes Jan 12, 2026

View reviewed changes

mnencia added 2 commits January 12, 2026 17:03

leonardoce force-pushed the dev/fix-flaky-podlogs-test branch from 22baf8e to 53265c6 Compare January 12, 2026 16:03

leonardoce merged commit 76817ec into main Jan 12, 2026
34 checks passed

leonardoce deleted the dev/fix-flaky-podlogs-test branch January 12, 2026 16:19

mnencia added a commit that referenced this pull request Feb 5, 2026

chore: move #9599 to cnpg plugin section

dba87bd

Signed-off-by: Marco Nenciarini <marco.nenciarini@enterprisedb.com>

mnencia added a commit that referenced this pull request Feb 5, 2026

chore: move #9599 to cnpg plugin section

cd336a3

Signed-off-by: Marco Nenciarini <marco.nenciarini@enterprisedb.com>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix(podlogs): Follow mode exit bug and test improvements#9599

fix(podlogs): Follow mode exit bug and test improvements#9599
leonardoce merged 2 commits intomainfrom
dev/fix-flaky-podlogs-test

mnencia commented Dec 29, 2025 •

edited

Loading

Uh oh!

github-actions bot commented Dec 29, 2025

Uh oh!

mnencia commented Dec 29, 2025 •

edited

Loading

Uh oh!

mnencia commented Jan 10, 2026 •

edited

Loading

Uh oh!

mnencia commented Jan 10, 2026

Uh oh!

github-actions bot commented Jan 10, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Conversation

mnencia commented Dec 29, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

github-actions bot commented Dec 29, 2025

Uh oh!

mnencia commented Dec 29, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

mnencia commented Jan 10, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

mnencia commented Jan 10, 2026

Uh oh!

github-actions bot commented Jan 10, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

mnencia commented Dec 29, 2025 •

edited

Loading

mnencia commented Dec 29, 2025 •

edited

Loading

mnencia commented Jan 10, 2026 •

edited

Loading