Create a new SupportabilityMetrics class rather than using alpha Metrics #2353

jkwatson · 2021-02-19T23:04:31Z

No description provided.

anuraaga · 2021-02-20T01:31:16Z

...i/src/main/java/io/opentelemetry/instrumentation/api/tracer/utils/SupportabilityMetrics.java

+  private final boolean agentDebugEnabled;
+  private final Consumer<String> reporter;
+
+  private final ConcurrentMap<String, EnumMap<SpanKind, AtomicInteger>> suppressionCounters =


Let's use LongAdder instead of atomic integer

anuraaga · 2021-02-20T01:32:09Z

...i/src/main/java/io/opentelemetry/instrumentation/api/tracer/utils/SupportabilityMetrics.java

+      return;
+    }
+    // note: there's definitely a race here, but since this is just debug information, I think
+    // we can live with the possibility that we might lose a count or two.


Is the race because of the usage of enummap? Looks like it's easy enough for us to define a class with a final field per kind to avoid the worry.

No, due to the two phase lookup.

Since they're both compute if absent, it seems like it would be ok if the second map was also a concurrent map. But we can use a class instead.

we could also just create a single layer with a String key that concatenates the bits.

I'll play around with a class to hold a longadder for each kind, although that feels a bit wasteful since a given instrumentation almost always only creates a single kind of span.

By the light of day, after some more thinking, I think you're right. The lack of concurrent safety on the EnumMap makes it worse than a race... the values might not even be visible across threads.

So, some possible solutions are:

a compound key class (or string concat) on a single concurrent map

a thread-safe class as the value of a single level map

2 levels of concurrent map with LongAdders as the leaf

Since this is only doing anything if you turn on agent debug, it probably doesn't matter all that much which we choose, although it would be great if we could have an efficient solution that could be always on, even in production (maybe it only dumps out the metrics on demand or something, rather than every n seconds).

I've been doing some research, mostly about LongAdder. It looks like all of the heap memory used by a LongAdder is completely lazily initialized. So, having a few extra LongAdders around that are unused is actually very cheap. I'm thinking that choosing the 2nd option is probably the simplest and most efficient, both from a correctness and a memory perspective.

I think you could also use an EnumMap<SpanKind, ConcurrentHashMap<String, LongAdder>> prepopulated with empty maps for all 5 enum values - this way the enum map would effectively be read-only and could be set in the constructor.

But to be honest I prefer your option 2 😄

done. option 2 has been implemented.

mateuszrzeszutek · 2021-02-22T12:14:29Z

...i/src/main/java/io/opentelemetry/instrumentation/api/tracer/utils/SupportabilityMetrics.java

+
+  public SupportabilityMetrics start() {
+    if (agentDebugEnabled) {
+      Executors.newScheduledThreadPool(1).scheduleAtFixedRate(this::report, 5, 5, TimeUnit.SECONDS);


Shouldn't we use a daemon thread for things like that? There's a DaemonThreadFactory in javaagent-tooling, with some minor changes it could be moved to instrumentation-api and used here.

easy enough to just inline one that sets daemon here. will do.

good catch 👍

mateuszrzeszutek · 2021-02-22T12:21:28Z

...i/src/main/java/io/opentelemetry/instrumentation/api/tracer/utils/SupportabilityMetrics.java

+      return;
+    }
+    // note: there's definitely a race here, but since this is just debug information, I think
+    // we can live with the possibility that we might lose a count or two.


I think you could also use an EnumMap<SpanKind, ConcurrentHashMap<String, LongAdder>> prepopulated with empty maps for all 5 enum values - this way the enum map would effectively be read-only and could be set in the constructor.

But to be honest I prefer your option 2 😄

…ics.

…or the reporter

anuraaga

Cool

trask

👍

jkwatson requested review from anuraaga, iNikem, mateuszrzeszutek, pavolloffay, trask and tylerbenson as code owners February 19, 2021 23:04

anuraaga reviewed Feb 20, 2021

View reviewed changes

mateuszrzeszutek reviewed Feb 22, 2021

View reviewed changes

mateuszrzeszutek approved these changes Feb 22, 2021

View reviewed changes

jkwatson added 5 commits February 22, 2021 08:55

Create a new SupportabilityMetrics class rather than using alpha Metr…

34d0938

…ics.

remove metrics dependency; add a test for disabled supportability

950d3ce

formatting

d3b5f70

change the implementation to be thread-safe and use a daemon thread f…

d1613c2

…or the reporter

actually wire the runnable to the thread

92deaf9

jkwatson force-pushed the supportability branch from d9f20ec to 92deaf9 Compare February 22, 2021 17:42

anuraaga approved these changes Feb 23, 2021

View reviewed changes

mateuszrzeszutek approved these changes Feb 23, 2021

View reviewed changes

trask approved these changes Feb 24, 2021

View reviewed changes

trask merged commit 57185f5 into open-telemetry:main Feb 24, 2021

jkwatson deleted the supportability branch February 24, 2021 20:35

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Create a new SupportabilityMetrics class rather than using alpha Metrics #2353

Create a new SupportabilityMetrics class rather than using alpha Metrics #2353

jkwatson commented Feb 19, 2021

anuraaga Feb 20, 2021

anuraaga Feb 20, 2021

jkwatson Feb 20, 2021

anuraaga Feb 20, 2021

jkwatson Feb 20, 2021

jkwatson Feb 20, 2021

jkwatson Feb 20, 2021 •

edited

jkwatson Feb 20, 2021

mateuszrzeszutek Feb 22, 2021

jkwatson Feb 22, 2021

mateuszrzeszutek Feb 22, 2021

jkwatson Feb 22, 2021

jkwatson Feb 22, 2021

trask Feb 24, 2021

mateuszrzeszutek Feb 22, 2021

anuraaga left a comment

trask left a comment

Create a new SupportabilityMetrics class rather than using alpha Metrics #2353

Create a new SupportabilityMetrics class rather than using alpha Metrics #2353

Conversation

jkwatson commented Feb 19, 2021

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

jkwatson Feb 20, 2021 • edited

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

anuraaga left a comment

Choose a reason for hiding this comment

trask left a comment

Choose a reason for hiding this comment

jkwatson Feb 20, 2021 •

edited