Skip to content

Checkpointing using the __consumer_offsets topic. #680

@chinmaychandak

Description

@chinmaychandak

I see that whenever a consumer consumes messages from a topic, the latest metadata and offsets for this are updated in the __consumer_offsets topic.

I wanted to know if there exists an efficient way today to retrieve the partitions (& therefore offsets) specific to a group.id+topic from this __consumer_offsets topic. Can someone please help me with this?

I was thinking of implementing checkpointing conforming to Spark Streaming in streamz. But since checkpointing requires creation of checkpoint (partition:offset dictionary) text-files after the messages have been processed, I was thinking of using the already existing information in __consumer_offsets to do this.

Metadata

Metadata

Assignees

No one assigned

    Labels

    enhancementRequesting a feature change

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions