Destroying queue references makes polling unusable #1792

accelerated · 2018-05-03T15:53:46Z

This could be a bug or perhaps I'm doing something wrong, please read.

Current workflow for the auto rebalance consumer:

Forward the main queue to the consumer queue with rd_kafka_poll_set_consumer
Get references to both the consumer queue and also to all assigned partitions queues with rd_kafka_queue_get_consumer and rd_kafka_queue_get_partition respectively.
On the partition queues, stop forwarding to the consumer queue with rd_kafka_queue_forward(..., NULL)
Poll both consumer queue and individual partitions queues for events via rd_kafka_consume_queue.
When complete, forward partitions queues back to consumer queue with rd_kafka_queue_forward.
Delete all queue references by calling rd_kafka_queue_destroy.
Repeat 2-6 if needed.

Issue I'm seeing:

Steps 1-6 work well. I can get messages and poll in all queues including the consumer queue. I get all events serviced as well which is fine. The problem arises if I try to repeat 2-6 again. At this point nothing works anymore. I get new references to the queues, but it seems that the queues are disabled somehow and the rdkafka background thread is not fetching anything from the broker. The call to rd_kafka_cgrp_partitions_fetch_start happens when assignments are done, but if the queue is not working, I never get these events!! I tracked the problem to rd_kafka_queue_destroy which actually not only decrements the reference but it also totally deletes the queue. Normally the consumer object should have at least one reference to the consumer queue and to each partition queues, but it doesn't seem like it's the case, so the destroy is final.
I tried calling rd_kafka_consume_start or rd_kafka_consume_queue_start on the new queues, but this doesn't do anything. Furthermore, after calling rd_kafka_consume_start, if I poll the main consumer queue, I get an assert. I even tried calling rd_kafka_consume_stop before the destroy and then rd_kafka_consume_start again on the new references but that doesn't work either.
Currently the only way which seems to work and I can actually repeat steps 2-6 successfully is to never call rd_kafka_queue_destroy. I also got a deadlock at some point, but I could not replicate it unfortunately (scary!!).
As a side note, it would be nice to expose rd_kafka_q_purge0 to the public API so that when I'm done with polling each individual partition and I want to re-instate forwarding to the consumer queue, all non-processed messages in the local partition queues are copied over to the forwarded queue...which is not always a desirable behavior. I would prefer to have control and flush all local queues IF I want to.

Any ideas what I'm doing wrong or if you think there's actually a bug?

The text was updated successfully, but these errors were encountered:

edenhill · 2018-05-03T19:24:40Z

Ha, I just found the exact same issue yesterday in code that hasn't been touched in years. What are the odds? :)

The problem is that rd_kafka_queue_destroy() always treats the application as the owner of the queue, which is correct for rd_kafka_queue_new() queues, but not for existing internal queues such as rd_kafka_queue_get_partition, ..et.al. This causes the queue to be disabled, which leads to exactly what you are seeing, silence.

I'll post a fix soon.

As for queue_purge(): I'll look into it, it makes sense.

accelerated · 2018-05-03T19:30:59Z

wow, yeah talk about odds! I guess most people don't use these more advanced apis. One more thing that I would like to understand, when a new assignment is made (or an un-assignment): these internal partition queues are purged and destroyed under the hood, and then new ones are created after the new assignment is made? Or you just purge the messages but keep the queues around?
PS: that makes me wonder about the deadlock I ran into...wonder if you saw that as well?

edenhill · 2018-05-03T19:35:18Z

An op (rd_kafka_op_t), which is what a queue is made up, has an optional version barrier.
Fetched messages (RD_KAFKA_OP_FETCH) has a version barrier based on the last fetcher state update for the given partition. When the fetcher state is updated, due to rebalance, stop, pause, or whatever, the messages in the queue are either purged directly based on this version, or purged lazily as the queue is being read by an op version filter. Effectively this means that messages fetched for a previous fetcher state / version barrier will not be seen by the application.

Asynchronoucity in all its glory.

https://github.com/edenhill/librdkafka/blob/master/src/rdkafka_queue.c#L342

https://github.com/edenhill/librdkafka/blob/master/src/rdkafka_queue.c#L286

accelerated · 2018-05-03T20:09:40Z

Ok thanks. Please let me know when you post the fix, It's pending a checkin in cppkafka as well, which I managed to make work with the above workaround i.e. not calling destroy. Also on a totally unrelated thread, when you get a chance to review the latest PR (on the scope of setting options) which I made a while back (based on our conversations) that would be great! That one is also pending a fix in cppkafka :).

mfontanini · 2018-05-03T20:22:10Z

@edenhill just to be sure: would it cause issues if cppkafka used the queue handles and never destroyed them? Will that memory be released once the rdkafka handle gets destroyed?

Also @accelerated we can't really rely on this change as older users of rdkafka will be stuck with the broken behavior otherwise. At best the code could choose to return a non owning or an owning Queue object depending on the rdkafka version being used.

accelerated · 2018-05-03T20:45:50Z

@mfontanini I would wait until the fix is put in librdkafka and then I will provide "full ownership" of handles. But in terms of backwards compatibility...we can just say that this round-robin adapter only works with a certain version of librdkafka. We can take this discussion on the cppkafka thread, i'm fine either way.

…1792)

accelerated · 2018-06-20T19:13:36Z

@edenhill Hi, what is the ETA for 0.11.5 ? Any ideas?

edenhill · 2018-06-20T19:38:33Z

@accelerated we're aiming for 2-3w

accelerated · 2018-06-20T19:43:53Z

Thanks

agis · 2018-07-19T09:34:33Z

This can be probably closed (fixed in 0.11.5)?

edenhill · 2018-07-19T09:37:26Z

Yes! thanks

accelerated mentioned this issue May 3, 2018

round robin polling for assigned partitions mfontanini/cppkafka#63

Merged

edenhill added a commit that referenced this issue May 9, 2018

Fix queue hang when queue_destroy() is called on rdkafka-owned queue (#…

4cb79b0

…1792)

edenhill added a commit that referenced this issue May 25, 2018

Fix queue hang when queue_destroy() is called on rdkafka-owned queue (#…

6c87c13

…1792)

edenhill added a commit that referenced this issue May 31, 2018

Fix queue hang when queue_destroy() is called on rdkafka-owned queue (#…

809627c

…1792)

edenhill added a commit that referenced this issue Jun 5, 2018

Fix queue hang when queue_destroy() is called on rdkafka-owned queue (#…

26418ae

…1792)

accelerated mentioned this issue Jun 20, 2018

Fix for ref count on queue handles mfontanini/cppkafka#92

Merged

edenhill closed this as completed Jul 19, 2018

mfontanini mentioned this issue Dec 14, 2018

Memory leak when cosuming with poll_batch function mfontanini/cppkafka#104

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Destroying queue references makes polling unusable #1792

Destroying queue references makes polling unusable #1792

accelerated commented May 3, 2018

edenhill commented May 3, 2018

accelerated commented May 3, 2018 •

edited

edenhill commented May 3, 2018 •

edited

accelerated commented May 3, 2018

mfontanini commented May 3, 2018

accelerated commented May 3, 2018 •

edited

accelerated commented Jun 20, 2018

edenhill commented Jun 20, 2018

accelerated commented Jun 20, 2018

agis commented Jul 19, 2018

edenhill commented Jul 19, 2018

Destroying queue references makes polling unusable #1792

Destroying queue references makes polling unusable #1792

Comments

accelerated commented May 3, 2018

edenhill commented May 3, 2018

accelerated commented May 3, 2018 • edited

edenhill commented May 3, 2018 • edited

accelerated commented May 3, 2018

mfontanini commented May 3, 2018

accelerated commented May 3, 2018 • edited

accelerated commented Jun 20, 2018

edenhill commented Jun 20, 2018

accelerated commented Jun 20, 2018

agis commented Jul 19, 2018

edenhill commented Jul 19, 2018

accelerated commented May 3, 2018 •

edited

edenhill commented May 3, 2018 •

edited

accelerated commented May 3, 2018 •

edited