Fix dataloss issue in restarting-streams-at-rebalancing mode #473

vigoo · 2022-05-31T08:23:20Z

Fixes #469

svroonland

Looks good! I wonder if we could simplify a bit by not buffering at all but immediately pushing the 'to be buffered' records on the partition stream with this mechanism.

svroonland · 2022-06-01T07:32:54Z

src/main/scala/zio/kafka/consumer/internal/PartitionStreamControl.scala

+import zio.kafka.consumer.internal.Runloop.ByteArrayCommittableRecord
+import zio.stream.Take
+
+case class PartitionStreamControl(


Should it be private?

vigoo · 2022-06-06T08:49:05Z

Looks good! I wonder if we could simplify a bit by not buffering at all but immediately pushing the 'to be buffered' records on the partition stream with this mechanism.

Sounds like a good idea but:

it will also affect the "normal mode" - need to think about whether it remains correct or not.
I'm not sure it would not make the processing unordered

vigoo · 2022-06-07T10:35:52Z

Note: I'm still trying to validate this in production and seeing some problems (not sure yet if related)

vigoo · 2022-06-08T13:51:28Z

The problem was that merge introduced a 1-element buffer for each partition (at least the ZIO 1 version) and it was taking too much memory in our application. As the whole drainQueue was only used at the "end" of the partition stream anyway, I changed the merge to concat which fixed our memory issue.

With this change from my end the change is ready to merge

vigoo · 2022-06-22T10:37:42Z

Note: one more fix is required for this, I'm validating it in prod now.

vigoo · 2022-07-06T10:01:08Z

I think the fix is now complete, I plan to publish a detailed blog post about it that will help understand.

vigoo · 2022-07-15T13:41:26Z

Detailed explanation: https://ziverge.com/blog/zio-kafka-with-transactions-debugging-story

Fix dataloss issue in restarting-streams-at-rebalancing mode

15fd5aa

vigoo requested a review from iravid as a code owner May 31, 2022 08:23

svroonland previously approved these changes Jun 1, 2022

View reviewed changes

Update PartitionStreamControl.scala

9a27458

svroonland dismissed their stale review via 9a27458 June 7, 2022 07:42

svroonland and others added 3 commits June 7, 2022 12:37

Update PartitionStreamControl.scala

7c3b116

Use concat instead of merge

a2c26d6

Merge branch 'master' into restarting-mode-dataloss-fix

2675a27

One more fix

efca247

vigoo added 4 commits July 6, 2022 11:47

Refactor introducing BufferedRecords type

16f0df9

Fix handling of requested records after rebalance

04f11de

Fix case when revoke and assign are happening in different polls

0f599ca

Merge branch 'master' into restarting-mode-dataloss-fix

072c6cb

vigoo mentioned this pull request Aug 22, 2022

Fix dataloss issue in restarting-streams-at-rebalancing mode #500

Merged

vigoo added 2 commits August 22, 2022 16:24

Merge branch 'master' into restarting-mode-dataloss-fix

3fd9edb

Merge branch 'master' into restarting-mode-dataloss-fix

c61b217

vigoo merged commit 1e17e32 into zio:master Sep 23, 2022

vigoo deleted the restarting-mode-dataloss-fix branch September 23, 2022 19:03

guizmaii mentioned this pull request Mar 12, 2023

Performance improvements #697

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix dataloss issue in restarting-streams-at-rebalancing mode #473

Fix dataloss issue in restarting-streams-at-rebalancing mode #473

vigoo commented May 31, 2022

svroonland left a comment

svroonland Jun 1, 2022

vigoo commented Jun 6, 2022

vigoo commented Jun 7, 2022

vigoo commented Jun 8, 2022

vigoo commented Jun 22, 2022

vigoo commented Jul 6, 2022

vigoo commented Jul 15, 2022

Fix dataloss issue in restarting-streams-at-rebalancing mode #473

Fix dataloss issue in restarting-streams-at-rebalancing mode #473

Conversation

vigoo commented May 31, 2022

svroonland left a comment

Choose a reason for hiding this comment

svroonland Jun 1, 2022

Choose a reason for hiding this comment

vigoo commented Jun 6, 2022

vigoo commented Jun 7, 2022

vigoo commented Jun 8, 2022

vigoo commented Jun 22, 2022

vigoo commented Jul 6, 2022

vigoo commented Jul 15, 2022