Cache the result of toString in BigInteger #1228

aarya123 · 2020-02-18T23:10:50Z

I'm currently testing out the new datadog profiling feature and am seeing a high number of MutableBigIntegers being allocated. Because it was pretty much exclusively due to toString(), I've created a simple wrapper that caches the result, which will (hopefully) reduce our overhead of logging the same trace id multiple times.

devinsba · 2020-02-19T11:39:54Z

Hi, thanks for the contribution!

Did you consider instead or creating a new type, adding the caching to DDSpanContext directly?

Here:

dd-trace-java/dd-trace-ot/src/main/java/datadog/opentracing/DDSpanContext.java

Line 136 in fc6c327

public String toTraceId() {

And here:

dd-trace-java/dd-trace-ot/src/main/java/datadog/opentracing/DDSpanContext.java

Line 149 in fc6c327

public String toSpanId() {

aarya123 · 2020-02-19T17:52:59Z

Unfortunately, I don't think that would be a thorough enough of a fix. At the end of the day, the problems may not lie with the users of the tracing api library, but the implementors of integrations as well! There are many places where a span is cast to a DDSpan and then call getTraceId(), which isn't inherently wrong since in some contexts, since you do need the BigInteger for things like sampling. But there are quiet a few places that use the toString() method on that object, be it implicit or explicit.
The type that the TraceId and SpanId uses is an implementation detail and it seems more encompassing to me to make the change there as opposed to the interfaces that OpenTracing itself makes available to the users in order to protect contributors and users of this agent.

tylerbenson · 2020-02-20T16:26:59Z

Caused by: org.gradle.api.GradleException: Rule violated for class datadog.opentracing.StringCachingBigInteger: instructions covered ratio is 0.4, but expected minimum is 0.6

Looks like jacoco has issue with the new class's test coverage. You can add it to the ignore list here: https://github.com/DataDog/dd-trace-java/blob/master/dd-trace-ot/dd-trace-ot.gradle#L13

tylerbenson · 2020-02-20T16:28:53Z

dd-trace-ot/src/main/java/datadog/opentracing/StringCachingBigInteger.java

+ */
+public class StringCachingBigInteger extends BigInteger {
+
+  private String cachedString;


Technically this isn't threadsafe, but in practice I don't think it matters.

Yes, in this case, it will work, since the calculation is idempotent.
Each thread will eventually make the null -> non-null transition.
This is the same basic strategy employed by String for the hashCode calculation.

The downside is a potential for a bit of extra allocation compared to the "ideal", but that's a reasonable trade-off.
This will already save a great deal on allocation and keeping the coordination overhead down is also important.

It could be made volatile, but we'd still have the same fundamental race -- so I think this is good as is.

keeping the coordination overhead down is also important.

Super agreed on this point. The overhead in a massively parallel/concurrent environment would not be worth locking this value

Cache the result of toString in BigInteger

3c6b840

aarya123 requested a review from a team as a code owner February 18, 2020 23:10

Anubhaw Arya added 2 commits February 18, 2020 15:20

java 8

4ee4688

formatting

fc6c327

tylerbenson reviewed Feb 20, 2020

View reviewed changes

Ignores for coverage

17b4fae

tylerbenson approved these changes Feb 21, 2020

View reviewed changes

tylerbenson added tag: community Community contribution tag: performance Performance related changes labels Feb 21, 2020

tylerbenson merged commit b805bf5 into DataDog:master Feb 21, 2020

randomanderson added this to the 0.44.0 milestone Feb 26, 2020

aarya123 deleted the cachingBigInteger branch February 27, 2020 21:06

aarya123 mentioned this pull request Apr 23, 2020

Http Trace and Span ID should be StringCachingBigInteger #1397

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Cache the result of toString in BigInteger #1228

Cache the result of toString in BigInteger #1228

aarya123 commented Feb 18, 2020 •

edited

devinsba commented Feb 19, 2020

aarya123 commented Feb 19, 2020 •

edited

tylerbenson commented Feb 20, 2020

tylerbenson Feb 20, 2020

dougqh Feb 20, 2020

aarya123 Feb 20, 2020 •

edited

Cache the result of toString in BigInteger #1228

Cache the result of toString in BigInteger #1228

Conversation

aarya123 commented Feb 18, 2020 • edited

devinsba commented Feb 19, 2020

aarya123 commented Feb 19, 2020 • edited

tylerbenson commented Feb 20, 2020

tylerbenson Feb 20, 2020

Choose a reason for hiding this comment

dougqh Feb 20, 2020

Choose a reason for hiding this comment

aarya123 Feb 20, 2020 • edited

Choose a reason for hiding this comment

aarya123 commented Feb 18, 2020 •

edited

aarya123 commented Feb 19, 2020 •

edited

aarya123 Feb 20, 2020 •

edited