[Bug]: Kafka consumer offsets are not being committed to Kafka in Beam v2.52.0 on Flink Runner #30539
Closed
1 of 16 tasks
Labels
Milestone
What happened?
Beam v2.52.0
Flink v1.12.4
I have a Beam pipeline that reads from Kafka, transforms data and writes to Kafka. It runs on Flink with periodic checkpointing enabled. KafkaIO is set up as follows:
After moving to Beam v2.52.0, I don't see Kafka consumer offsets being committed to Kafka as part of checkpoint finalisation anymore. It worked as expected in Beam v2.51.0 and prior.
After looking at the code, I noticed that #28614 replaced
UnboundedSourceWrapper
withFlinkUnboundedSource/FlinkUnboundedSourceReader
.UnboundedSourceWrapper
implements Flink'sCheckpointListener
and overridesnotifyOnCheckpointComplete
method, which finalises Kafka checkpoint marks.However,
FlinkUnboundedSourceReader
->FlinkSourceReaderBase
implement FlinkSourceReader
which provides a default no-op implementation fornotifyOnCheckpointComplete
.FlinkUnboundedSourceReader
->FlinkSourceReaderBase
do not explicitly override the default implementation meaning that checkpoint finalisation in KafkaIO does not happen and Kafka consumer offsets are not committed back to Kafka. Is this accurate?Thank you!
Issue Priority
Priority: 2 (default / most bugs should be filed as P2)
Issue Components
The text was updated successfully, but these errors were encountered: