Optimize getContext and getContextAndTags #253

martin-sucha · 2022-02-11T19:20:15Z

It is not necessary to do multiple allocations and copying,
single pass is enough.

It is not necessary to do multiple allocations and copying, single pass is enough.

hush-hush

Thanks a lot for this PR @martin-sucha !

I added a comment about a edge case. Beside this it looks ready to be merged.

statsd/aggregator.go

With zero tags, the code did two allocations instead of one because len(tagSeparatorSymbol)*(len(tags)-1) evaluated to a negative number, so the buffer was short. Thanks hush-hush for pointing this edge case in code review.

hush-hush

One last nit-pick

statsd/aggregator.go

There is at least one tag because we checked for len(tags) == 0 in the beginning.

hush-hush

Thanks for the PR !

I'll merge and will release it with the next version of the client (hopefully next week).

I'm curious of your use case and what bring you to optimizing this part of the client. Would you mind sharing how many points per second you're sending, what type of metrics ...

martin-sucha · 2022-02-21T14:04:48Z

I'm curious of your use case and what bring you to optimizing this part of the client.

We switched away from datadog-go several years ago to another statsd library because datadog-go did not have aggregation support at the time and was spending too much time sending packets. Now datadog-go has aggregation support, so I was checking if we can switch back to datadog-go as the other library does not support distribution metrics, which I'd like to use in some places. As part of that experiment, I profiled both versions.

As you can see in the image from the profiler in the original post, datadog-go was about 2.2% CPU time in the staging environment and getContext/getContextAndTags was majority of that. At the same time it was obvious from the flamegraph that the function can be optimized pretty easily. Now the profiler shows about 1.6% CPU for datadog-go.

I also tried Prometheus Go client (with counter vectors only) for comparison and that is around 1.4% CPU, so much closer now.

Would you mind sharing how many points per second you're sending, what type of metrics ...

In the staging environment where I tested this about 42k metrics per second in one pod before aggregation (as shown by datadog.dogstatsd.client.metrics_by_type metric):

Metric type	Rate
counter	34k / second
timing	7k / second
histogram	800 / second
gauge	30 / second
set	0
distribution	0

Optimize getContext and getContextAndTags

65ab8ed

It is not necessary to do multiple allocations and copying, single pass is enough.

hush-hush requested changes Feb 15, 2022

View reviewed changes

statsd/aggregator.go Show resolved Hide resolved

Handle zero tags in getContextAndTags separately

d2c6f38

With zero tags, the code did two allocations instead of one because len(tagSeparatorSymbol)*(len(tags)-1) evaluated to a negative number, so the buffer was short. Thanks hush-hush for pointing this edge case in code review.

hush-hush requested changes Feb 16, 2022

View reviewed changes

statsd/aggregator.go Outdated Show resolved Hide resolved

Remove redundant length check from getContextAndTags

ea53751

There is at least one tag because we checked for len(tags) == 0 in the beginning.

hush-hush approved these changes Feb 17, 2022

View reviewed changes

hush-hush merged commit 255498e into DataDog:master Feb 18, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Optimize getContext and getContextAndTags #253

Optimize getContext and getContextAndTags #253

martin-sucha commented Feb 11, 2022

hush-hush left a comment

hush-hush left a comment

hush-hush left a comment

martin-sucha commented Feb 21, 2022 •

edited

Optimize getContext and getContextAndTags #253

Optimize getContext and getContextAndTags #253

Conversation

martin-sucha commented Feb 11, 2022

hush-hush left a comment

Choose a reason for hiding this comment

hush-hush left a comment

Choose a reason for hiding this comment

hush-hush left a comment

Choose a reason for hiding this comment

martin-sucha commented Feb 21, 2022 • edited

martin-sucha commented Feb 21, 2022 •

edited