[bugfix] Fix regression with the upgrade to 2.8.x: Count corrently the number of partitions in ProduceRequest #1645

eolivelli · 2022-12-23T08:30:22Z

Motivation

With the upgrade to Kafka 2.8.x we introduced a regression in the handleProduceRequest method, we are not counting
correctly the number of partitions.
If the client sends data for more than one partition for the same topic, then the result is a REQUEST_TIMED_OUT because we are calling "completeOne.run()" once per partition, but we are using the number of topics as the expected number of "completeOne" calls.

Modifications

Count correctly the number of partitions.

Verifying this change

This change is a trivial rework / code cleanup without any test coverage.

Documentation

Check the box below.

Need to update docs?

doc-required

(If you need help on updating docs, create a doc issue)
no-need-doc

(Please explain why)
doc

(If this PR contains doc changes)

…e number of partitions in ProduceRequest

BewareMyPower

Could you add a test to avoid the regression?

codecov · 2022-12-23T09:51:46Z

Codecov Report

Merging #1645 (8e7509d) into master (1e5433d) will increase coverage by 0.00%.
The diff coverage is 0.00%.

@@            Coverage Diff            @@
##             master    #1645   +/-   ##
=========================================
  Coverage     15.77%   15.78%           
- Complexity      612      613    +1     
=========================================
  Files           164      164           
  Lines         12249    12254    +5     
  Branches       1124     1124           
=========================================
+ Hits           1932     1934    +2     
- Misses        10158    10162    +4     
+ Partials        159      158    -1

Impacted Files	Coverage Δ
...ative/pulsar/handlers/kop/KafkaRequestHandler.java	`1.09% <0.00%> (-0.01%)`	⬇️
...pulsar/handlers/kop/utils/timer/TimerTaskList.java	`78.31% <0.00%> (+2.40%)`	⬆️

eolivelli · 2022-12-23T12:49:08Z

Sure, I will.
I wanted to post this PR as soon as possible, because in my fork after the upgrade we found those TIME OUT errors that were without an explanation and it took days to find out the problem.

eolivelli · 2022-12-23T12:49:46Z

I also have a couple of other follow up PRs regarding the upgrade of the kafka clients, in order to improve performances.
I will send new PRs new week

michaeljmarshall

Nice catch @eolivelli.

michaeljmarshall · 2022-12-23T22:25:43Z

kafka-impl/src/main/java/io/streamnative/pulsar/handlers/kop/KafkaRequestHandler.java

+        final int numPartitions = produceRequest
+                .data()
+                .topicData()
+                .stream()


Streams can be less efficient than iterating directly and adding to a mutable variable. Given that this code is on the hot path, can we use a for loop?

While a generally agree on this point, the new Java model for the wire protocol is using this pattern with multiple layers (topics > partitions).
Doing 2 'for' loops will require in any case a similar amount of resources, especially if the data structures are small. The gain probably is negligible.

Please note that we could create some utility method and refactor the codebase. I didn't do it because I wasn't sure that we would be axtuslly able to make the code more readable or efficient.

The code in this form is pretty compact and readable.
I am leaning towards keeping it like this

BewareMyPower

I just took a look again. It only affects the time to send produce response and it might be a little hard to add the test. I will merge this fix first.

…e number of partitions in ProduceRequest (#1645) ### Motivation With the upgrade to Kafka 2.8.x we introduced a regression in the handleProduceRequest method, we are not counting correctly the number of partitions. If the client sends data for more than one partition for the same topic, then the result is a REQUEST_TIMED_OUT because we are calling "completeOne.run()" once per partition, but we are using the number of topics as the expected number of "completeOne" calls. ### Modifications Count correctly the number of partitions. (cherry picked from commit 9d0f57b)

[bugfix] Fix regression with the upgrade to 2.8.x: Count corrently th…

8e7509d

…e number of partitions in ProduceRequest

eolivelli requested review from BewareMyPower and Demogorgon314 as code owners December 23, 2022 08:30

github-actions bot assigned eolivelli Dec 23, 2022

github-actions bot added the no-need-doc This pr does not need any document label Dec 23, 2022

BewareMyPower approved these changes Dec 23, 2022

View reviewed changes

BewareMyPower suggested changes Dec 23, 2022

View reviewed changes

michaeljmarshall reviewed Dec 23, 2022

View reviewed changes

BewareMyPower approved these changes Dec 26, 2022

View reviewed changes

BewareMyPower merged commit 9d0f57b into streamnative:master Dec 26, 2022

BewareMyPower added the release/2.11 label Apr 25, 2023

BewareMyPower added the cherry-picked/branch-2.11 label Apr 25, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[bugfix] Fix regression with the upgrade to 2.8.x: Count corrently the number of partitions in ProduceRequest #1645

[bugfix] Fix regression with the upgrade to 2.8.x: Count corrently the number of partitions in ProduceRequest #1645

eolivelli commented Dec 23, 2022

BewareMyPower left a comment

codecov bot commented Dec 23, 2022

eolivelli commented Dec 23, 2022

eolivelli commented Dec 23, 2022

michaeljmarshall left a comment

michaeljmarshall Dec 23, 2022

eolivelli Dec 24, 2022

BewareMyPower left a comment

[bugfix] Fix regression with the upgrade to 2.8.x: Count corrently the number of partitions in ProduceRequest #1645

[bugfix] Fix regression with the upgrade to 2.8.x: Count corrently the number of partitions in ProduceRequest #1645

Conversation

eolivelli commented Dec 23, 2022

Motivation

Modifications

Verifying this change

Documentation

BewareMyPower left a comment

Choose a reason for hiding this comment

codecov bot commented Dec 23, 2022

Codecov Report

eolivelli commented Dec 23, 2022

eolivelli commented Dec 23, 2022

michaeljmarshall left a comment

Choose a reason for hiding this comment

michaeljmarshall Dec 23, 2022

Choose a reason for hiding this comment

eolivelli Dec 24, 2022

Choose a reason for hiding this comment

BewareMyPower left a comment

Choose a reason for hiding this comment