KAFKA-13499: Avoid restoring outdated records by gabriellefu · Pull Request #22115 · apache/kafka

gabriellefu · 2026-04-22T03:41:18Z

Expose the retentionPeriod length to storeMetadata
In prepareChangelogs(), switch it from always seektobeginning if
checkpoint doesn't exist to seek to certain timestamp to avoid restoring
outdated records.
Change from the :
Instead of the wall clock, use the latest timestamp in the changelog as
the latest time, and seek from the timestamp of
latest_changelog_stamp_time-rention_period.

Reviewers: TengYao Chi frankvicky@apache.org, Bill Bejeck
bbejeck@apache.org

…ctRocksDBSegmentedBytesStore.java revert

…oreChangelogReaderTest.java

gabriellefu · 2026-04-22T04:00:17Z

The failed smoke test now is passed:

frankvicky

@gabriellefu Thanks for the PR.
Please run ./gradlew clean spotlessApply to fix the CI fail.

bbejeck

Thanks @gabriellefu I made a pass

bbejeck · 2026-04-30T18:44:12Z

-
-                newPartitionsWithoutStartOffset.add(partition);
+                final long retentionPeriod = storeMetadata.retentionPeriod();
+                if (retentionPeriod > 0 && retentionPeriod != Long.MAX_VALUE) {


@gabriellefu I was playing around some more and I think I found something else - New standby tasks won't have a valid endOffset, so they need to be filtered out. Otherwise with the restore consumer's auto.offset.reset=none every batched partition falls back to seek-to-beginning.

So we can update the if block to this
if (retentionPeriod > 0 && retentionPeriod != Long.MAX_VALUE && endOffset != null && endOffset > 0)

I used restoreConsumer.endOffsets() not which should be able to solve the standby task problem

bbejeck · 2026-05-04T16:37:41Z

System tests pass for this PR

bbejeck

Thanks @gabriellefu ! LGTM

bbejeck · 2026-05-04T17:14:55Z

Merged #22115 into trunk

gabriellefu added 11 commits April 21, 2026 17:36

restore window code

04b9fc7

revert some space

87ed81f

revert StoreChangelogReader.java

b4a9ff0

revert ProcessorStateManager.java

8d7418f

streams/src/main/java/org/apache/kafka/streams/state/internals/Abstra…

4b16700

…ctRocksDBSegmentedBytesStore.java revert

streams/src/test/java/org/apache/kafka/streams/processor/internals/St…

feff5e6

…oreChangelogReaderTest.java

adding the interface

76d3d94

adding WithRetentionPeriod

d7c0b72

fix the bug cause by the gap between wall clock and streams clock

72413c0

fix the bug cause by the gap between wall clock and streams clock

4c5feae

use the latest timestamp as the time to restore

4d61fb0

github-actions Bot added triage PRs from the community streams clients labels Apr 22, 2026

gabriellefu added 3 commits April 22, 2026 00:12

cleanup some code

ee9aee2

one space

55535e3

simply the way of getting the max_timestamp

045b0f2

frankvicky added the ci-approved label Apr 22, 2026

frankvicky reviewed Apr 22, 2026

View reviewed changes

format

c0bf24e

github-actions Bot removed the triage PRs from the community label Apr 23, 2026

bbejeck requested changes Apr 28, 2026

View reviewed changes

Comment thread streams/src/main/java/org/apache/kafka/streams/processor/internals/StoreChangelogReader.java Outdated

fix new standby task npe

5dc78fa

gabriellefu requested a review from bbejeck April 29, 2026 14:49

bbejeck reviewed Apr 30, 2026

View reviewed changes

filter out standby task

3c1f8d1

gabriellefu force-pushed the restoring_window branch 2 times, most recently from 200526e to 3c1f8d1 Compare May 1, 2026 19:58

restoreConsumer.endOffsets() + poll for standby

519a2d4

gabriellefu added 5 commits May 1, 2026 19:29

use seek to end to get the last offset

02a61e6

use restoreConsumer.endOffsets(partitions)

1e8d5c4

add seek to beginning

2ba2da2

change the name to make it less confusing

0b0a866

fix the bug

c2ec3be

bbejeck approved these changes May 4, 2026

View reviewed changes

bbejeck merged commit 94b6886 into apache:trunk May 4, 2026
22 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

KAFKA-13499: Avoid restoring outdated records#22115

KAFKA-13499: Avoid restoring outdated records#22115
bbejeck merged 23 commits intoapache:trunkfrom
gabriellefu:restoring_window

gabriellefu commented Apr 22, 2026 •

edited by github-actions Bot

Loading

Uh oh!

gabriellefu commented Apr 22, 2026

Uh oh!

frankvicky left a comment

Uh oh!

bbejeck left a comment

Uh oh!

Uh oh!

Uh oh!

bbejeck Apr 30, 2026

Uh oh!

gabriellefu May 2, 2026

Uh oh!

bbejeck commented May 4, 2026 •

edited

Loading

Uh oh!

bbejeck left a comment

Uh oh!

Uh oh!

bbejeck commented May 4, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

gabriellefu commented Apr 22, 2026 • edited by github-actions Bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

gabriellefu commented Apr 22, 2026

Uh oh!

frankvicky left a comment

Choose a reason for hiding this comment

Uh oh!

bbejeck left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

bbejeck Apr 30, 2026

Choose a reason for hiding this comment

Uh oh!

gabriellefu May 2, 2026

Choose a reason for hiding this comment

Uh oh!

bbejeck commented May 4, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

bbejeck left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

bbejeck commented May 4, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

gabriellefu commented Apr 22, 2026 •

edited by github-actions Bot

Loading

bbejeck commented May 4, 2026 •

edited

Loading