[Question] Specified group generation id is not valid #1009

vicmerlis · 2021-01-28T13:54:34Z

Describe the bug
Consumers (sometimes) encounters the following exception: Specified group generation id is not valid. The consumers that encounters that error become a kind of Zombie. They are still connected as consumers to a partition but not consuming messages.

To Reproduce
Can't reproduce

Expected behavior
The consumer should reconnect to the consumer grouop.

Observed behavior
Logs:

[ConsumerGroup] Consumer has joined the group
The group is rebalancing, so a rejoin is needed
Specified group generation id is not valid

Environment:

OS: [alpine]
KafkaJS version [1.15.0]
Kafka version [2.4.1]
NodeJS version [14]

Additional context
It's probably not related to KafkaJs, but there is a mention of that error error.js and maybe you have any idea why it's happening?

The text was updated successfully, but these errors were encountered:

tulios · 2021-01-28T14:25:22Z

That's a server error, how are you generating your group ids?

vicmerlis · 2021-01-28T15:15:32Z

the groupId looks like: ${Environment}-staticString

this._Kafka = new KafkaJs({brokers: [host], ...clientOptions});
const groupId = options.groupId;
const currentOptions = Object.assign({}, options, {topic: topic});
const consumer = this._Kafka.consumer(currentOptions);
await consumer.connect();
await consumer.subscribe(currentOptions);
consumer.run({
    eachMessage: async ({topic, partition, message}) => {
        return this._handleConsumerMessages({topic, message, groupId, partition});
    }, ...currentOptions
});

Nevon · 2021-01-28T16:26:39Z

The groupGenerationId is something we get from the broker (generation_id) in the JoinGroup response, and it's just a number that increments with each generation in the group.

You'll get this error when you try to commit after having been kicked out of the consumer group. This could for example happen if you spend too long in between heartbeats (because you're processing a single message for too long, for example). What should happen is that you should re-join the group and get the new generation id to use.

It would be helpful if you could run with DEBUG log level so that we can see what requests are being made when this happens.

The consumers that encounters that error become a kind of Zombie. They are still connected as consumers to a partition but not consuming messages.

This is a shot in the dark, but do you maybe have more consumer instances than you do partitions? If so, some of your consumers will not be assigned any partitions, and thus won't be doing any work.

vicmerlis · 2021-01-28T17:22:37Z

I'll enable debug log level and will post once the error occurs.

Regarding more consumers than partitions - we are running on ECS with autoscaling, max tasks=60 (each task = 1 consumer). The topic configured with 60 partitions. I'm pretty sure that we didn't reached to the max number of tasks, but i'll check it also once the error occurs.

mremick · 2021-01-28T23:19:14Z

Hello, I'm having a similar issue. Could increasing the heartbeatInterval and sessionTimeout potentially fix the issue? I think I'm taking too long while processing when making a network request with high latency.

Nevon · 2021-01-29T06:52:47Z

Could increasing the heartbeatInterval and sessionTimeout potentially fix the issue?

Yes. You have to tweak those to fit your application behavior.

mremick · 2021-01-29T07:01:07Z

Thanks. It fixed my issue.

This error is indicating that the consumer is trying to commit offsets, but the consumer group has changed to a new generation. Retrying within the existing session will indeed not work, but rejoining the group and re-trying should be successful. Fixes tulios#1009

guiestimoneon · 2022-12-23T18:00:22Z

Hello guys

I am having this issue when I scale my application horizontally. The pod is processing normally and out of nowhere I get this error:

I suspect a rebalance has occurred and the pod still tries to commit a message.
I've tried the above solutions but to no avail.

This error is indicating that the consumer is trying to commit offsets, but the consumer group has changed to a new generation. Retrying within the existing session will indeed not work, but rejoining the group and re-trying should be successful. Fixes tulios#1009

tulios added the question label Jan 28, 2021

tulios closed this as completed Feb 15, 2021

hbazan-pp mentioned this issue Oct 17, 2022

"Specified group generation id is not valid" after broker maintenance, consumer stops receiving events #1466

Closed

jakewins mentioned this issue Nov 2, 2022

Consider ILLEGAL_GENERATION error as rebalancing error #1474

Merged

theoengland mentioned this issue Nov 24, 2022

Confluent.Kafka.KafkaException: Broker: Specified group generation id is not valid quixio/quix-streams#8

Closed

patrykwegrzyn mentioned this issue Jan 26, 2023

Consider ILLEGAL_GENERATION error as rebalancing error patrykwegrzyn/kafkajs#1

Merged

matray mentioned this issue Apr 27, 2023

Hotfix andrewreineke/kafkajs#1

Open

peter-quix mentioned this issue Nov 16, 2023

Confluent.Kafka.KafkaException: Broker: Specified group generation id is not valid quixio/quix-streams-dotnet#7

Open

kiessan mentioned this issue Jun 26, 2024

Error "Specified group generation id is not valid" during consumption mateusjunges/laravel-kafka#295

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Question] Specified group generation id is not valid #1009

[Question] Specified group generation id is not valid #1009

vicmerlis commented Jan 28, 2021

tulios commented Jan 28, 2021

vicmerlis commented Jan 28, 2021

Nevon commented Jan 28, 2021

vicmerlis commented Jan 28, 2021

mremick commented Jan 28, 2021

Nevon commented Jan 29, 2021

mremick commented Jan 29, 2021

guiestimoneon commented Dec 23, 2022

[Question] Specified group generation id is not valid #1009

[Question] Specified group generation id is not valid #1009

Comments

vicmerlis commented Jan 28, 2021

tulios commented Jan 28, 2021

vicmerlis commented Jan 28, 2021

Nevon commented Jan 28, 2021

vicmerlis commented Jan 28, 2021

mremick commented Jan 28, 2021

Nevon commented Jan 29, 2021

mremick commented Jan 29, 2021

guiestimoneon commented Dec 23, 2022