Skip to content

Conversation

@xinyuiscool
Copy link
Contributor

@xinyuiscool xinyuiscool commented Aug 21, 2024

Previous pr #1705 adds the support of preventing watermark stall during lagging upstream tasks. In testing we found a bug introduced by the logic that the aggregated watermark is not advancing if the smallest watermark have been updated, but the updated tasks within the idle timeout doesn't meet quorum. The fix is to keep the previous logic if the aggregated watermark has advanced, otherwise invoking the logic for computing quorum.

Also changed the quorum count to be a metric instead of logging, due to the potential large number of logs.

Copy link

@becketqin becketqin left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM.

@xinyuiscool xinyuiscool merged commit 3cb57c2 into apache:master Aug 21, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants