Remove `thread.name` from metrics #14061

jhayes2-chwy · 2025-06-18T15:29:46Z

As outlined in #13407 and #14047, the thread.name attribute can create very high cardinality in many cases, and also contributes to a memory leak in the collection mechanism of those metrics. This PR is aimed at fixing both issues.

The affected metrics are:

jvm.memory.allocation
jvm.cpu.longlock
jvm.network.io
jvm.network.time

See: [open-telemetry#13407](open-telemetry#13407)

See: [open-telemetry#14047](open-telemetry#14047)

…stances See: [open-telemetry#14047](open-telemetry#14047)

linux-foundation-easycla · 2025-06-18T15:29:51Z

✅login: jhayes2-chwy / (1106d44)
✅login: jhayes2-chwy / (1106d44, a8508fe)
✅login: jhayes2-chwy / (1106d44, a8508fe, 6f666c2)
✅login: jhayes2-chwy / (1106d44, a8508fe, 6f666c2, b762c33)
✅login: jhayes2-chwy / (1106d44, a8508fe, 6f666c2, b762c33, f9b198d)
✅login: jhayes2-chwy / (1106d44, a8508fe, 6f666c2, b762c33, f9b198d, 960e102)

The committers listed above are authorized under a signed CLA.

See: [open-telemetry#14047](open-telemetry#14047)

trask · 2025-06-18T16:06:15Z

...lemetry/instrumentation/runtimemetrics/java17/internal/AbstractThreadDispatchingHandler.java

+  // Use an access-ordered LinkedHashMap so we get a bounded LRU cache
+  private final Map<String, Consumer<RecordedEvent>> perThread =
+      new LinkedHashMap<String, Consumer<RecordedEvent>>(16, 0.75F, true) {
+        @Override
+        protected boolean removeEldestEntry(Map.Entry<String, Consumer<RecordedEvent>> eldest) {
+          // Bound this map to prevent memory leaks with fast-cycling thread frameworks
+          return size() > 512;
+        }
+      };


is this map needed now that the thread name isn't being on the metrics?

Certainly not for correctness reasons. I didn't see any explicit documentation around this map, but after reading through the code my impression was this was mostly a performance optimization to reduce allocations of Consumer<RecordedEvent> instances.

Since this was previously using an unsynchronized hashmap, it does appear to me that the invocation of these consumers is all single-threaded (I haven't worked directly with JFR before, so maybe that's not true?); it smells to me like there's no contention or throughput reasons to have this cache other than to reduce allocations.

If that jives with your understanding, I could simply remove the cache. While all the little allocations of the PerThread*Handler inner classes probably aren't a problem for the use-cases I'm coming from (high-scale ecommerce), I imagine there are definitely existing OTEL use-cases where it would be, especially on older and/or smaller JVMs.

With that in mind, I'd actually prefer to jump all the way to inlining the PerThread*Handler-based logic directly into the AbstractThreadDispatchingHandler-subclasses.

With that in mind, I'd actually prefer to jump all the way to inlining the PerThread*Handler-based logic directly into the AbstractThreadDispatchingHandler-subclasses.

I've gone ahead and done this, should be easy to revert if needed.

…r it See: [open-telemetry#14047](open-telemetry#14047)

laurit · 2025-06-19T07:02:54Z

...main/java/io/opentelemetry/instrumentation/runtimemetrics/java17/internal/ThreadGrouper.java


-  // FIXME doesn't actually do any grouping, but should be safe for now
+  // FIXME only handles substrings of contiguous digits -> a single `x`, but should be good
+  // enough for now
  @Nullable
  public String groupedName(RecordedEvent ev) {


is this used anywhere now that AbstractThreadDispatchingHandler was deleted?

Good catch, addressed.

laurit · 2025-06-19T07:04:16Z

@jhayes2-chwy you need to sign the CLA in order to get the PR merged

jhayes2-chwy · 2025-06-19T13:40:56Z

@jhayes2-chwy you need to sign the CLA in order to get the PR merged

Indeed; I've been working with my company to determine if we have a Corporate CLA, so I'm still waiting on that.

See: [open-telemetry#14047](open-telemetry#14047)

jhayes2-chwy added 3 commits June 18, 2025 11:07

feat: remove thread.name from attributes

1106d44

See: [open-telemetry#13407](open-telemetry#13407)

feat: improve ThreadGrouper grouping logic

a8508fe

See: [open-telemetry#14047](open-telemetry#14047)

fix: switch to a bounded LRU cache for the Consumer<RecordedEvent> in…

6f666c2

…stances See: [open-telemetry#14047](open-telemetry#14047)

jhayes2-chwy requested a review from a team as a code owner June 18, 2025 15:29

feat: remove unnecessary synchronization

b762c33

See: [open-telemetry#14047](open-telemetry#14047)

trask reviewed Jun 18, 2025

View reviewed changes

refactor: remove the cache entirely, and inline the plumbing built fo…

f9b198d

…r it See: [open-telemetry#14047](open-telemetry#14047)

laurit reviewed Jun 19, 2025

View reviewed changes

refactor: remove ThreadGrouper as well, since it is now unused

960e102

See: [open-telemetry#14047](open-telemetry#14047)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Remove `thread.name` from metrics #14061

Remove `thread.name` from metrics #14061

jhayes2-chwy commented Jun 18, 2025

Uh oh!

linux-foundation-easycla bot commented Jun 18, 2025 •

edited

Loading

Uh oh!

trask Jun 18, 2025

Uh oh!

jhayes2-chwy Jun 18, 2025

Uh oh!

jhayes2-chwy Jun 18, 2025 •

edited

Loading

Uh oh!

jhayes2-chwy Jun 18, 2025

Uh oh!

laurit Jun 19, 2025

Uh oh!

jhayes2-chwy Jun 19, 2025 •

edited

Loading

Uh oh!

laurit commented Jun 19, 2025

Uh oh!

jhayes2-chwy commented Jun 19, 2025

Uh oh!

Uh oh!

Remove thread.name from metrics #14061

Are you sure you want to change the base?

Remove thread.name from metrics #14061

Conversation

jhayes2-chwy commented Jun 18, 2025

Uh oh!

linux-foundation-easycla bot commented Jun 18, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

trask Jun 18, 2025

Choose a reason for hiding this comment

Uh oh!

jhayes2-chwy Jun 18, 2025

Choose a reason for hiding this comment

Uh oh!

jhayes2-chwy Jun 18, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

jhayes2-chwy Jun 18, 2025

Choose a reason for hiding this comment

Uh oh!

laurit Jun 19, 2025

Choose a reason for hiding this comment

Uh oh!

jhayes2-chwy Jun 19, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

laurit commented Jun 19, 2025

Uh oh!

jhayes2-chwy commented Jun 19, 2025

Uh oh!

Uh oh!

Remove `thread.name` from metrics #14061

Remove `thread.name` from metrics #14061

linux-foundation-easycla bot commented Jun 18, 2025 •

edited

Loading

jhayes2-chwy Jun 18, 2025 •

edited

Loading

jhayes2-chwy Jun 19, 2025 •

edited

Loading