k/group: recover leader epoch on leader change #17260

nvartolomei · 2024-03-22T13:31:28Z

From my understanding, this can cause a problem only when write caching is enabled.

It could also apply to ACKS=1 in an edge case but haven't thought it through. We make data available only after majority replicated so it’s very unlikely for a truncation to happen after that and trigger KIP-320

This was discovered while testing write caching feature. After leadership change or node restart we would reply with default field value -2147483648 which breaks the KIP-320 logic.

check_leader_epoch in redpanda treats negative epoch values as "not set" and, I believe, franz-go behaves the same.

As result, KIP-320 fencing is not being applied and the client ends up with OFFSET_OUT_OF_RANGE error.

Backports Required

Release Notes

none

vbotbuildovich · 2024-03-22T15:41:11Z

new failures in https://buildkite.com/redpanda/redpanda/builds/46616#018e668c-9526-49f5-97e6-de141da9ca44:

"rptest.tests.partition_move_interruption_test.PartitionMoveInterruption.test_cancelling_partition_move.replication_factor=3.unclean_abort=True.recovery=restart_recovery.compacted=False"

vbotbuildovich · 2024-03-22T16:02:59Z

ducktape was retried in https://buildkite.com/redpanda/redpanda/builds/46616#018e669e-a59b-4cd6-9f6d-56626702fc2e

ducktape was retried in https://buildkite.com/redpanda/redpanda/builds/46640#018e681b-7814-4b5c-a686-1dcbb45692dc

ducktape was retried in https://buildkite.com/redpanda/redpanda/builds/46640#018e681f-84c3-4a5d-a9b7-ed837d00e5be

ducktape was retried in https://buildkite.com/redpanda/redpanda/builds/46718#018e7701-2d62-4e94-8fe3-c807d2de1e0e

nvartolomei · 2024-03-22T20:25:37Z

/dt
skip-redpanda-build
dt-repeat=10
tests/rptest/tests/partition_move_interruption_test.py::PartitionMoveInterruption.test_cancelling_partition_move

nvartolomei · 2024-03-22T20:36:02Z

failing test with verifiable consumer; without write caching... https://ci-artifacts.dev.vectorized.cloud/redpanda/46616/018e668c-9526-49f5-97e6-de141da9ca44/vbuild/ducktape/results/final/report.html

[2024-03-22 14:47:42,280] INFO [Consumer clientId=consumer-test_group-1, groupId=test_group] Fetch position FetchPosition{offset=6230, offsetEpoch=Optional[2], currentLeader=LeaderAndEpoch{leader=Optional[docker-rp-2:9092 (id: 4 rack: null)], epoch=absent}} is out of range for partition topic-peirabobwf-11, resetting offset (org.apache.kafka.clients.consumer.internals.Fetcher)
[2024-03-22 14:47:42,805] INFO [Consumer clientId=consumer-test_group-1, groupId=test_group] Resetting offset for partition topic-peirabobwf-11 to position FetchPosition{offset=0, offsetEpoch=Optional.empty, currentLeader=LeaderAndEpoch{leader=Optional[docker-rp-2:9092 (id: 4 rack: null)], epoch=absent}}. (org.apache.kafka.clients.consumer.internals.SubscriptionState)

dotnwat

Nice. When you request reviewers, I think we'll want to get a few groups bharath/michal and probably someone from enterprise team.

savex · 2024-03-25T13:02:31Z

/ci-repeat 10
skip-redpanda-build
dt-repeat=2
tests/rptest/tests/partition_move_interruption_test.py::PartitionMoveInterruption.test_cancelling_partition_move

VerifiableConsumer is KIP-320 compliant now. At least to the degree the java kafka client is. This will facilitates testing of the redpanda implementation of KIP-320 in group recovery (redpanda-data#17260) and data loss handling when write caching is enabled.

This tests consumer group commits. The test also shows a bug in which leader_offset is reset after cluster restart. This is fixed in a subsequent commit.

This was discovered while testing write caching feature. After leadership change or node restart we would reply with default field value `-2147483648` which breaks the KIP-320 logic. `check_leader_epoch` in redpanda treats negative epoch values as "not set" and, I believe, franz-go behaves the same. As result, KIP-320 fencing is not being applied and the client ends up with `OFFSET_OUT_OF_RANGE` error.

graphcareful

LGTM nice !

vbotbuildovich · 2024-03-26T16:04:56Z

/backport v23.3.x

vbotbuildovich · 2024-03-26T16:04:57Z

/backport v23.2.x

VerifiableConsumer is KIP-320 compliant now. At least to the degree the java kafka client is. This will facilitates testing of the redpanda implementation of KIP-320 in group recovery (redpanda-data#17260) and data loss handling when write caching is enabled. (cherry picked from commit 9316309)

github-actions bot added the area/redpanda label Mar 22, 2024

dotnwat reviewed Mar 23, 2024

View reviewed changes

nvartolomei force-pushed the nv/kafka-group-recover-leader-epoch branch from 8e82687 to 0ed5c6b Compare March 25, 2024 17:22

nvartolomei added 3 commits March 25, 2024 17:55

rptest: Test consumer group commit

e1a3e65

This tests consumer group commits. The test also shows a bug in which leader_offset is reset after cluster restart. This is fixed in a subsequent commit.

nvartolomei force-pushed the nv/kafka-group-recover-leader-epoch branch from 0ed5c6b to 501b9d3 Compare March 25, 2024 17:55

nvartolomei requested review from bharathv, mmaslankaprv and dotnwat March 26, 2024 09:59

graphcareful self-requested a review March 26, 2024 13:39

graphcareful approved these changes Mar 26, 2024

View reviewed changes

nvartolomei merged commit 42bafee into redpanda-data:dev Mar 26, 2024
21 checks passed

vbotbuildovich mentioned this pull request Mar 26, 2024

[v23.2.x] k/group: recover leader epoch on leader change #17397

Merged

vbotbuildovich mentioned this pull request Mar 26, 2024

[v23.3.x] k/group: recover leader epoch on leader change #17398

Merged

nvartolomei mentioned this pull request Mar 29, 2024

CORE-88 rptest: handle errors and retry them in list offsets request #17494

Merged

6 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

k/group: recover leader epoch on leader change #17260

k/group: recover leader epoch on leader change #17260

nvartolomei commented Mar 22, 2024 •

edited

vbotbuildovich commented Mar 22, 2024

vbotbuildovich commented Mar 22, 2024 •

edited

nvartolomei commented Mar 22, 2024

nvartolomei commented Mar 22, 2024

dotnwat left a comment

savex commented Mar 25, 2024

graphcareful left a comment

vbotbuildovich commented Mar 26, 2024

vbotbuildovich commented Mar 26, 2024

k/group: recover leader epoch on leader change #17260

k/group: recover leader epoch on leader change #17260

Conversation

nvartolomei commented Mar 22, 2024 • edited

Backports Required

Release Notes

vbotbuildovich commented Mar 22, 2024

vbotbuildovich commented Mar 22, 2024 • edited

nvartolomei commented Mar 22, 2024

nvartolomei commented Mar 22, 2024

dotnwat left a comment

Choose a reason for hiding this comment

savex commented Mar 25, 2024

graphcareful left a comment

Choose a reason for hiding this comment

vbotbuildovich commented Mar 26, 2024

vbotbuildovich commented Mar 26, 2024

nvartolomei commented Mar 22, 2024 •

edited

vbotbuildovich commented Mar 22, 2024 •

edited