Share buffer pool across all partitions #310

merlimat · 2020-07-06T18:24:58Z

Motivation

When a producer is publishing on many partitions, there can be significant memory overhead in maintaining a per-partition pool. Instead, there's not significant perf impact in using a single shared buffer pool.

wolfstudy

Cool, the change LGTM+1, just only some concerns whether the use of sync.pool here will cause GC pressure because sync.Pool cannot specify a size, which is only subject to the GC threshold.

wolfstudy · 2020-07-07T16:25:52Z

perf/perf-producer.go

@@ -62,6 +63,8 @@ func newProducerCommand() *cobra.Command {
 		"Publish rate. Set to 0 to go unthrottled")
 	flags.IntVarP(&produceArgs.BatchingTimeMillis, "batching-time", "b", 1,
 		"Batching grouping time in millis")
+	flags.IntVarP(&produceArgs.BatchingMaxSize, "batching-max-size", "", 128,
+		"Max size of a batch (in KB)")


In pulsar-perf produce about --batch-max-messages is abbreviated and the default value is as follows, can we consider keeping the two consistent?

-bm, --batch-max-messages Maximum number of messages per batch Default: 1000

This are 2 different settings. One is the number of messages in 1 batch and the other is the max size of the batch

merlimat · 2020-07-08T21:50:57Z

just only some concerns whether the use of sync.pool here will cause GC pressure because sync.Pool cannot specify a size, which is only subject to the GC threshold.

Using the pool will not causing GC pressure itself: it's actually there to avoid the GC pressure.

The pool having no max size is not a big problem. The max amount of memory is still determined by the "pending" messages whose payloads are buffered until they get acknowledged by the broker.

This change is to use a single pool to avoid that each pool will have a few buffers that cannot be immediately reused. After this change, the memory usage is the same as without the pooling.

wolfstudy

LGTM +1

Share buffer pool across all partitions

66f9754

merlimat added the type/enhancement label Jul 6, 2020

merlimat added this to the 0.2.0 milestone Jul 6, 2020

merlimat requested review from sijie, srkukarni, jerrypeng, aahmed-se and wolfstudy July 6, 2020 18:24

merlimat self-assigned this Jul 6, 2020

Fixed formatting

f739c31

merlimat mentioned this pull request Jul 7, 2020

Fix CompressMaxSize() for ZLib provider #312

Merged

Merge remote-tracking branch 'apache/master' into shared-pool

852abf0

wolfstudy reviewed Jul 7, 2020

View reviewed changes

wolfstudy approved these changes Jul 9, 2020

View reviewed changes

wolfstudy merged commit 9cbe36f into apache:master Jul 9, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Share buffer pool across all partitions #310

Share buffer pool across all partitions #310

merlimat commented Jul 6, 2020

wolfstudy left a comment

wolfstudy Jul 7, 2020 •

edited

Loading

merlimat Jul 8, 2020

merlimat commented Jul 8, 2020

wolfstudy left a comment

Share buffer pool across all partitions #310

Share buffer pool across all partitions #310

Conversation

merlimat commented Jul 6, 2020

Motivation

wolfstudy left a comment

Choose a reason for hiding this comment

wolfstudy Jul 7, 2020 • edited Loading

Choose a reason for hiding this comment

merlimat Jul 8, 2020

Choose a reason for hiding this comment

merlimat commented Jul 8, 2020

wolfstudy left a comment

Choose a reason for hiding this comment

wolfstudy Jul 7, 2020 •

edited

Loading