[FLINK-4723] [kafka-connector] Unify committed offsets to Kafka to be the next record to process #2580

tzulitai · 2016-10-03T12:46:20Z

The description within the JIRA ticket (FLINK-4723) explains the reasoning for this change.

With this change, offsets committed to Kafka of both 0.8 and 0.9 are larger by 1 compared to the internally checkpointed offsets. This is changed at the FlinkKafkaConsumerBase level, so that offsets given through the abstract commitSpecificOffsetsToKafka() method to the version-specific implementations are already incremented and represent the next record to process. This way, the version-specific implementations simply commit the given offsets without the need to manipulate them.

To test the behaviour on both connector versions, this PR also includes major refactoring of the IT tests by adding offset committing related IT tests to FlinkKafkaConsumerTestBase, and let both the 0.8 and 0.9 consumers run offset committing / initial offset startup tests (previously only the 0.8 consumer had these tests).

R: @rmetzger what do you think of this?

tzulitai · 2016-10-03T15:40:56Z

Seems like one of the new IT tests is a bit unstable, fixing it ...

StephanEwen · 2016-10-04T17:06:53Z

Looks quite good. I would suggest one change, though:

Can we avoid copying the offsets in the checkpoint into a new map (with increment by one) and passing that to the ZooKeeper Offset Committer or the Kafka Offset Committer? I am just not a big fan of copying things back and forth (especially in "prepareSnaoshot()", which we want to keep as lightweight as possible). Instead, can we have the contract that the offset committers always commit "+1" from the value they get (pretty much as it was in the 0.9 committer after FLINK-4618)?

Concerning the tests, is the stability issue fixed there?
What I frequently do is push the same commit to 10 different newly created branches to keep Travis busy over night with 10 test runs and see if I see a stability issue.

tzulitai · 2016-10-05T04:03:55Z

Thanks for the review @StephanEwen.

Concerning changing the contract for commitSpecificOffsetsToKafka:
Makes sense, I don't really like excessive copying too. With proper tests on both 0.8 and 0.9, I think it's reasonable to change this. I'll update this, and probably also rename commitSpecficOffsetsToKafka to reflect the contract behaviour.

Thanks for the tip on test stability, I'll do that ;)

rmetzger

I like the change overall, I had one question regarding a test case. Once that is resolved, and the conflicts are resolved, the change is good to be merged.

rmetzger · 2016-10-10T10:11:56Z

...ka-base/src/test/java/org/apache/flink/streaming/connectors/kafka/KafkaConsumerTestBase.java

+	}
+
+	/**
+	 * This test first writes a total of 200 records to a test topic, reads the first 100 so that some offsets are


rmetzger · 2016-10-10T10:21:51Z

...ka-base/src/test/java/org/apache/flink/streaming/connectors/kafka/KafkaConsumerTestBase.java

+		Long o2 = kafkaOffsetHandler.getCommittedOffset(topicName, 1);
+		Long o3 = kafkaOffsetHandler.getCommittedOffset(topicName, 2);
+
+		LOG.info("Got final committed offsets from Kafka o1={}, o2={}, o3={}", o1, o2, o3);


I wonder whether it makes sense to check that at least one of o1, o2 and o3 is not 300. If they are all 300 below test

It should be impossible for them to be 300, because we're stopping the first consuming job once it hits the 150th record.
However, I think it is reasonable to check whether at least one of o1, o2, o3 is not null before proceeding with the next consuming job. We'd want to have at least some start offsets to test the start from committed offsets behaviour. What do you think?

That sounds reasonable.

tzulitai · 2016-10-13T03:48:16Z

Thanks for the review @rmetzger. I've created several local branches to test out the new IT tests stability on Travis as @StephanEwen suggested, and they seem to be fine.

I'll rebase this, address the last few comments, and give the changes a final test run before merging.

… next record to process

tzulitai · 2016-10-17T03:42:43Z

Merging this once tests turn green.

tzulitai · 2016-10-18T03:41:13Z

Merging this to master now ...

…rd to process This closes apache#2580

tzulitai mentioned this pull request Oct 4, 2016

[FLINK-4727] [kafka-connector] Set missing initial offset states with starting KafkaConsumer position #2585

Closed

tzulitai force-pushed the FLINK-4723 branch 2 times, most recently from 00ce52b to a8267dd Compare October 7, 2016 04:58

rmetzger requested changes Oct 10, 2016

View reviewed changes

tzulitai mentioned this pull request Oct 13, 2016

[FLINK-4280][kafka-connector] Explicit start position configuration for Kafka Consumer #2509

Closed

tzulitai added 4 commits October 17, 2016 10:22

[FLINK-4723] [kafka-connector] Unify committed offsets to Kafka to be…

811fa9c

… next record to process

[FLINK-4723] Make runStartFromKafkaCommitOffsets test more stable

ec5f500

[FLINK-4723] Address Stephan's review comments

688cac6

[FLINK-4723] Backport offset committing IT tests to 0.10 connector

3b6fa16

tzulitai force-pushed the FLINK-4723 branch 2 times, most recently from 865143c to f7b4589 Compare October 17, 2016 03:38

[FLINK-4723] Address Robert's comments

715d9b3

tzulitai force-pushed the FLINK-4723 branch from f7b4589 to 715d9b3 Compare October 17, 2016 11:01

asfgit closed this in f46ca39 Oct 18, 2016

liuyuzhong pushed a commit to liuyuzhong/flink that referenced this pull request Dec 5, 2016

[FLINK-4723] [kafka] Unify committed offsets to Kafka to be next reco…

16afe78

…rd to process This closes apache#2580

rmetzger added the component=Connectors/Kafka label Mar 14, 2019

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[FLINK-4723] [kafka-connector] Unify committed offsets to Kafka to be the next record to process #2580

[FLINK-4723] [kafka-connector] Unify committed offsets to Kafka to be the next record to process #2580

tzulitai commented Oct 3, 2016 •

edited

Loading

tzulitai commented Oct 3, 2016

StephanEwen commented Oct 4, 2016

tzulitai commented Oct 5, 2016 •

edited

Loading

rmetzger left a comment

rmetzger Oct 10, 2016 •

edited

Loading

rmetzger Oct 10, 2016

tzulitai Oct 13, 2016 •

edited

Loading

rmetzger Oct 13, 2016

tzulitai commented Oct 13, 2016 •

edited

Loading

tzulitai commented Oct 17, 2016

tzulitai commented Oct 18, 2016

[FLINK-4723] [kafka-connector] Unify committed offsets to Kafka to be the next record to process #2580

[FLINK-4723] [kafka-connector] Unify committed offsets to Kafka to be the next record to process #2580

Conversation

tzulitai commented Oct 3, 2016 • edited Loading

tzulitai commented Oct 3, 2016

StephanEwen commented Oct 4, 2016

tzulitai commented Oct 5, 2016 • edited Loading

rmetzger left a comment

Choose a reason for hiding this comment

rmetzger Oct 10, 2016 • edited Loading

Choose a reason for hiding this comment

rmetzger Oct 10, 2016

Choose a reason for hiding this comment

tzulitai Oct 13, 2016 • edited Loading

Choose a reason for hiding this comment

rmetzger Oct 13, 2016

Choose a reason for hiding this comment

tzulitai commented Oct 13, 2016 • edited Loading

tzulitai commented Oct 17, 2016

tzulitai commented Oct 18, 2016

tzulitai commented Oct 3, 2016 •

edited

Loading

tzulitai commented Oct 5, 2016 •

edited

Loading

rmetzger Oct 10, 2016 •

edited

Loading

tzulitai Oct 13, 2016 •

edited

Loading

tzulitai commented Oct 13, 2016 •

edited

Loading