Refactor the producer, part 2 #550

eapache · 2015-10-05T16:46:09Z

Second stand-alone chunk extracted from #544, (first chunk: #549). This uses the
produceSet struct in the aggregator as well, and moves the wouldOverflow and
readyToFlush methods to methods on the produceSet.

Knock-on effects:

now that we do per-partition size tracking in the aggregator we can do much
more precise overflow checking (see the compressed-message-batch-size-limit
case in wouldOverflow which has changed) which will be more efficient in
high-volume scenarios
since the produceSet encodes immediately, messages which fail to encode are
now rejected from the aggregator and don't count towards batch size
we still have to iterate the messages in the flusher in order to reject those
which need retrying due to the state machine; for simplicity I add them to a
second produceSet still, which means all messages get encoded twice; this is
a definite major performance regression which will go away again in part 3
of this refactor

@wvanbergen

Second stand-alone chunk extracted from #544, (first chunk: #549). This uses the `produceSet` struct in the aggregator as well, and moves the `wouldOverflow` and `readyToFlush` methods to methods on the `produceSet`. Knock-on effects: - now that we do per-partition size tracking in the aggregator we can do much more precise overflow checking (see the compressed-message-batch-size-limit case in `wouldOverflow` which has changed) which will be more efficient in high-volume scenarios - since the produceSet encodes immediately, messages which fail to encode are now rejected from the aggregator and don't count towards batch size - we still have to iterate the messages in the flusher in order to reject those which need retrying due to the state machine; for simplicity I add them to a second produceSet still, which means all messages get encoded twice; this is a definite major performance regression which will go away again in part 3 of this refactor

wvanbergen · 2015-10-15T00:25:41Z

async_producer_test.go

@@ -320,7 +320,7 @@ func TestAsyncProducerEncoderFailures(t *testing.T) {
 	leader.Returns(prodSuccess)

 	config := NewConfig()
-	config.Producer.Flush.Messages = 3
+	config.Producer.Flush.Messages = 1


This was needed because the 2 failed to encode messages are not counted towards the batch anymore I assume?

wvanbergen · 2015-10-15T00:27:43Z

👍

Refactor the producer, part 2

eapache force-pushed the producer-refactor-2 branch from 381225c to 7c4ceb3 Compare October 9, 2015 14:14

eapache changed the title ~~WIP~~ Refactor the producer, part 2 Oct 9, 2015

eapache mentioned this pull request Oct 14, 2015

Producer refactor 3 #551

Merged

eapache force-pushed the producer-refactor-2 branch from 7c4ceb3 to 733c74f Compare October 14, 2015 21:34

wvanbergen reviewed Oct 15, 2015
View reviewed changes

eapache added a commit that referenced this pull request Oct 15, 2015

Merge pull request #550 from Shopify/producer-refactor-2

4a6acf4

Refactor the producer, part 2

eapache merged commit 4a6acf4 into master Oct 15, 2015

eapache deleted the producer-refactor-2 branch October 15, 2015 02:52

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Refactor the producer, part 2 #550

Refactor the producer, part 2 #550

eapache commented Oct 5, 2015

wvanbergen Oct 15, 2015

eapache Oct 15, 2015

wvanbergen commented Oct 15, 2015

Refactor the producer, part 2 #550

Refactor the producer, part 2 #550

Conversation

eapache commented Oct 5, 2015

wvanbergen Oct 15, 2015

Choose a reason for hiding this comment

eapache Oct 15, 2015

Choose a reason for hiding this comment

wvanbergen commented Oct 15, 2015