Fix overflowing batch size #1310

pavolloffay · 2020-07-09T14:43:22Z

Signed-off-by: Pavol Loffay ploffay@redhat.com

Description: <Describe what has changed.

This PR adds configuration property enfoce_batch_size to batch processor that ensures that batch size does not overflow the configured max_batch_size. The default value is false hence it does not change the default behavior.

If the incoming batch size is bigger than available space in cached traces then the incoming batch is split into reaming size and size of the maximum batch size. The spans that would overflow the buffer are sent again in trace data over the channel.

Link to tracking Issue:

Resolves #1140
Related to #1020

Testing: < Describe what testing was performed and which tests were added.>

Documentation: < Describe the documentation added.>

enfoce_batch_size has been added to the readme.

codecov · 2020-07-09T14:50:57Z

Codecov Report

Merging #1310 into master will increase coverage by 0.03%.
The diff coverage is 97.95%.

@@            Coverage Diff             @@
##           master    #1310      +/-   ##
==========================================
+ Coverage   90.01%   90.05%   +0.03%     
==========================================
  Files         216      217       +1     
  Lines       15200    15244      +44     
==========================================
+ Hits        13683    13728      +45     
  Misses       1101     1101              
+ Partials      416      415       -1

Impacted Files	Coverage Δ
processor/batchprocessor/splittraces.go	`97.05% <97.05%> (ø)`
processor/batchprocessor/batch_processor.go	`98.01% <100.00%> (+0.21%)`	⬆️
translator/internaldata/resource_to_oc.go	`86.04% <0.00%> (+2.32%)`	⬆️

Continue to review full report at Codecov.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update 0c7de06...0d5e6c7. Read the comment docs.

processor/batchprocessor/batch_processor.go

dmitryax · 2020-07-10T01:59:18Z

The original idea of the batcher is to batch traces together and do it as performant as possible. That why it doesn't enforce any hard limit on the batch size. It might be not clearly described in docs, but we have this:

- `send_batch_size` (default = 8192): Number of spans or metrics after which a batch will be sent.

I don't think we should add the complexity to enforce the hard limit by default. This behavior might not be what the users want, but can degrade performance.

What do you think if we keep existing behavior for send_batch_size parameter (may be to make it more clear in docs that it's not a hard limit) and add another config parameter that would enforce the hard limit and cause splitting traces?

pavolloffay · 2020-07-10T09:38:12Z

What do you think if we keep existing behavior for send_batch_size parameter (may be to make it more clear in docs that it's not a hard limit) and add another config parameter that would enforce the hard limit and cause splitting traces?

It looks good to me

pavolloffay · 2020-07-10T11:55:25Z

@dmitryax thanks for the feedback. I have made the changes and made the splitting configurable.

processor/batchprocessor/batch_processor.go

bogdandrutu · 2020-07-12T15:13:19Z

processor/batchprocessor/batch_processor.go

@@ -141,6 +143,16 @@ func (bp *batchTraceProcessor) startProcessingCycle() {
 	for {
 		select {
 		case td := <-bp.newTraceItem:
+			if bp.enforceBatchSize {


What about some simpler approach. Add items as normal to the batch, then do a simple split not by max size. It is a bit more overhead maybe but simpler logic I feel.

That might be even better from the perf standpoint. The buckets of max size would have to be probably split also bc there can be other items arriving in the meantime (unless we push them directly).

pavolloffay · 2020-07-13T13:24:43Z

@bogdandrutu I have rebased the PR and simplified it to do only a single split per consume.

processor/batchprocessor/README.md

Signed-off-by: Pavol Loffay <ploffay@redhat.com>

* Update Span End method documentation Updates to the Span after End is called result it potentially inconsistent views of the Span between the code defining it and the ultimate receiver of the Span data. This corrects the documented language of the API to prevent this from happening. * Add changes to changelog

pavolloffay requested review from bogdandrutu, dmitryax, flands, james-bebbington, owais, pjanotti, rghetia and tigrannajaryan as code owners July 9, 2020 14:43

project-bot bot added this to In progress in Collector Jul 9, 2020

pavolloffay commented Jul 9, 2020

View reviewed changes

processor/batchprocessor/batch_processor.go Outdated Show resolved Hide resolved

bogdandrutu requested changes Jul 12, 2020

View reviewed changes

processor/batchprocessor/batch_processor.go Outdated Show resolved Hide resolved

Collector automation moved this from In progress to Review in progress Jul 12, 2020

bogdandrutu reviewed Jul 12, 2020

View reviewed changes

pavolloffay force-pushed the batch-size-overflow branch from f93cb7a to 9728c7d Compare July 13, 2020 12:56

dmitryax self-assigned this Jul 13, 2020

bogdandrutu mentioned this pull request Jul 13, 2020

Ability to set maximum batch size #1020

Closed

bogdandrutu reviewed Jul 13, 2020

View reviewed changes

processor/batchprocessor/README.md Outdated Show resolved Hide resolved

bogdandrutu self-requested a review July 13, 2020 22:23

Fix overflowing batch size

1a5c564

Signed-off-by: Pavol Loffay <ploffay@redhat.com>

pavolloffay force-pushed the batch-size-overflow branch from 650d1c8 to 1a5c564 Compare July 14, 2020 07:09

Use max size

0d5e6c7

Signed-off-by: Pavol Loffay <ploffay@redhat.com>

Collector automation moved this from Review in progress to Reviewer approved Jul 14, 2020

bogdandrutu approved these changes Jul 14, 2020

View reviewed changes

bogdandrutu merged commit 5c7db8c into open-telemetry:master Jul 14, 2020

Collector automation moved this from Reviewer approved to Done Jul 14, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix overflowing batch size #1310

Fix overflowing batch size #1310

pavolloffay commented Jul 9, 2020 •

edited

Loading

codecov bot commented Jul 9, 2020 •

edited

Loading

dmitryax commented Jul 10, 2020 •

edited

Loading

pavolloffay commented Jul 10, 2020

pavolloffay commented Jul 10, 2020

bogdandrutu Jul 12, 2020 •

edited

Loading

pavolloffay Jul 13, 2020

pavolloffay commented Jul 13, 2020

Fix overflowing batch size #1310

Fix overflowing batch size #1310

Conversation

pavolloffay commented Jul 9, 2020 • edited Loading

codecov bot commented Jul 9, 2020 • edited Loading

Codecov Report

dmitryax commented Jul 10, 2020 • edited Loading

pavolloffay commented Jul 10, 2020

pavolloffay commented Jul 10, 2020

bogdandrutu Jul 12, 2020 • edited Loading

Choose a reason for hiding this comment

pavolloffay Jul 13, 2020

Choose a reason for hiding this comment

pavolloffay commented Jul 13, 2020

pavolloffay commented Jul 9, 2020 •

edited

Loading

codecov bot commented Jul 9, 2020 •

edited

Loading

dmitryax commented Jul 10, 2020 •

edited

Loading

bogdandrutu Jul 12, 2020 •

edited

Loading