Verify checkpoint saving strategy #72

fbeltrao · 2019-04-11T21:43:00Z

Checkpoint saving current is done using Consumer.Commit which blocks the thread. An alternative is to use StoreOffset that will save the checkpoint asynchronously in librdkafka.

Commit is more accurate while StoreOffset offers a better throughput.

Would love your feedback @jeffhollan, @anirudhgarg and @ryancrawcour

jeffhollan · 2019-04-11T22:13:39Z

I imagine the risk of StoreOffset is that if something happens on the async thread and the commit doesn't checkpoint you may end up in a world where you replay a few batches of messages you thought were being checkpoint but the checkpoint was never active? In general we are pretty clear you need to be at-least-once expected, so I think I'm ok with the StoreOffset given the better throughput.

The other alternative is we make it a configurable choice in the host.json config so they can choose the method they want and we default to one.

ryancrawcour · 2019-04-12T00:59:38Z

ou need to be at-least-once expected, so I think I'm ok with the StoreOffset given the better throughput.

that's my feeling too

ryancrawcour · 2019-04-12T01:00:29Z

we make it a configurable choice

that's not a bad idea. how complex would this make the code @fbeltrao ?

fbeltrao · 2019-04-12T11:03:07Z

The code to support both is not complex. I am questioning the value.

What Jeff says is right, consumers should be implemented to process messages at least once anyway.

I can run a few tests to see if the messages processing repetition is noticeable in our e2e tests if we optimize for throughput.

ryancrawcour · 2019-04-12T16:07:51Z

I'd say let's pick one. Then if customers ask for the other one, or complain about the one we picked, we can then always come back and revisit this.

I am happy with "you're expected to ensure your consumer is idempotent b/c at-least-once semantics" and go for the higher throughput.

fbeltrao added the question Further information is requested label Apr 11, 2019

fbeltrao added this to the P1 milestone Apr 11, 2019

fbeltrao self-assigned this Apr 12, 2019

fbeltrao mentioned this issue Apr 12, 2019

Producer and consumer optimizations #73

Merged

ryancrawcour closed this as completed in #73 Apr 12, 2019

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Verify checkpoint saving strategy #72

Verify checkpoint saving strategy #72

fbeltrao commented Apr 11, 2019

jeffhollan commented Apr 11, 2019

ryancrawcour commented Apr 12, 2019

ryancrawcour commented Apr 12, 2019

fbeltrao commented Apr 12, 2019 •

edited

Loading

ryancrawcour commented Apr 12, 2019 •

edited

Loading

Verify checkpoint saving strategy #72

Verify checkpoint saving strategy #72

Comments

fbeltrao commented Apr 11, 2019

jeffhollan commented Apr 11, 2019

ryancrawcour commented Apr 12, 2019

ryancrawcour commented Apr 12, 2019

fbeltrao commented Apr 12, 2019 • edited Loading

ryancrawcour commented Apr 12, 2019 • edited Loading

fbeltrao commented Apr 12, 2019 •

edited

Loading

ryancrawcour commented Apr 12, 2019 •

edited

Loading