changefeedccl: add changefeed.emitted_batch_sizes
metric
#115537
Merged
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
This change introduces a new metric which is a histogram for emitted batch sizes. This metric will help debug any issues with batching when they arise.
This change also introduces a new histogram window type:
DataCount16MBuckets
. It has 24 buckets from 1 - 16M, exponentially distributed. Right now, the largest batch seen by changefeeds is 670k rows (parquet with 16MB file sizes). There was no previously existing histogram bucket which would be appropriate for measuring batches of this scale. It's possible to configure larger file sizes (ex. 128MB has been used by customers before). 16M is a generous upper bound to how large batches in changefeeds may be.Release note (ops change): This change introduces the
changefeed.emitted_batch_sizes
histogram metric which measures the batch sizes used when emitting data to sinks. This metric supports metrics labels.Closes: #114141