Unsafe convergent cache for traces by RaasAhsan · Pull Request #1101 · typelevel/cats-effect

RaasAhsan · 2020-08-19T07:39:21Z

This should be the last PR to have tracing ready for 2.2.0. The current version of tracing relies on ConcurrentHashMap to keep a cache of traces. As described in #1076, ConcurrentHashMap offers thread safety at the cost of synchronization via read barriers on the hot read path. Because the cache will eventually converge to an optimal set, it's OK if we don't have safe publication (different threads may not see the same cache), as long as the objects we do see are all safely initialized. Safe initialization means that an object values initialized in a constructor are visible to threads who observe a shared reference to the object. This property is implicitly guaranteed with the usual synchronization primitives that offer thread safety.

The Java memory model offers another method of achieving safe initialization that is much cheaper than synchronization: final fields. The semantics of final fields dictate that as long as this doesn't escape during the constructor of an object and that its final fields are set before the constructor completes, then any thread who observes a shared reference of the initialized object will observe fully initialized values for its final fields, and that guarantee extends transitively.

We leverage these semantics to build a thread-unsafe, convergent cache that achieves a higher read throughput than ConcurrentHashMap.

TODO:

Tests
Benchmarks
Detailed documentation about the memory model guarantees we are leveraging here
Collisions beneath mask
Max buffer sizes
Rewrite the implementation in Scala, verifying field modifiers

Closes #1076.

RaasAhsan · 2020-08-19T07:58:50Z

+            return this.array[hash];
+        }
+
+        public Buffer grow() {


TODO: pass the key value pair that caused the original collision here to insert

RaasAhsan · 2020-08-19T07:59:17Z

+        }
+
+        public Node<K, V> put(K k, V v) {
+            int hash = k.hashCode() & this.mask;


CHM and HM have another way of generating the hash, will look into that

retronym · 2020-09-14T06:33:42Z

Storing strong references to the lambda classes as the key of the frame cache smells like a potential classloader leak when the lambda class is loaded from a classloader with a shorter lifecycle than cats-eval's classloader.

java.lang.ClassValue is designed to avoid such leaks. It's also designed to be extremely fast for lookups.

djspiewak · 2020-09-14T14:20:42Z

java.lang.ClassValue is designed to avoid such leaks. It's also designed to be extremely fast for lookups.

TIL!

RaasAhsan · 2020-09-14T16:26:09Z

java.lang.ClassValue is designed to avoid such leaks. It's also designed to be extremely fast for lookups.

Didn't know that either, thanks! We've also noticed that Class.hashCode is a pretty costly operation, would be awesome if this could make that cost cheaper

retronym · 2020-09-15T01:21:18Z

Didn't know that either, thanks! We've also noticed that Class.hashCode is a pretty costly operation, would be awesome if this could make that cost cheaper

That doesn't sound right to me, it statically resolves to Object.hashCode, which is a JIT treats as an intrinsic and amounts to a read from the identity hash code which is computed once and stored in the object header. Profiler's can sometimes give misleading answers here due to safe-point biasing problem, try async-profiler and/or Java Flight Recorder with -XX:+DebugNonSafePoints. It's also a good idea to setup a microbenchmark in JMH to investigate these questions (JMH now integrates with async-profiler and JFR with -prof async and -prof:jfr)

RaasAhsan · 2020-09-15T01:51:55Z

I'll take a look at those! We did a bit of digging and arrived at the same conclusion about the hashCode being cached in the object header. The aforementioned cost was relative to its absence. I have a JMH benchmark sitting around for a simpler version of what's in this PR, and keying the array by Class.hashCode as opposed to arbitrary integers lowered throughput by 50%, which I thought was odd too

RaasAhsan · 2020-09-15T02:26:32Z

I forgot to mention, the likely implementation for this cache actually won't retain any class references. We're acknowledging that tracing is an imprecise approximation, so in order to speed up reads, we'll only keep one value per array index. This means that we're effectively ignoring collisions, and to lower collision probability, we can grow the size of the array.

RaasAhsan · 2021-02-08T07:18:17Z

Going to do this on CE3

Raas Ahsan added 4 commits August 19, 2020 00:23

Add unsafe convergent cache

09c0b87

Define UnsafeConvergentCache in Java

85fcf96

wip

7fd0a29

wip

72af3f7

RaasAhsan commented Aug 19, 2020

View reviewed changes

djspiewak mentioned this pull request Sep 13, 2020

Implement Asynchronous Stack Traces for Task monix/monix#1267

Merged

6 tasks

djspiewak added the 👴 CE2 label Feb 6, 2021

RaasAhsan closed this Feb 8, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Unsafe convergent cache for traces#1101

Unsafe convergent cache for traces#1101
RaasAhsan wants to merge 4 commits intotypelevel:series/2.xfrom
RaasAhsan:feature/unsafe-trace-cache

RaasAhsan commented Aug 19, 2020 •

edited

Loading

Uh oh!

RaasAhsan Aug 19, 2020

Uh oh!

RaasAhsan Aug 19, 2020

Uh oh!

retronym commented Sep 14, 2020

Uh oh!

djspiewak commented Sep 14, 2020

Uh oh!

RaasAhsan commented Sep 14, 2020

Uh oh!

retronym commented Sep 15, 2020

Uh oh!

RaasAhsan commented Sep 15, 2020

Uh oh!

RaasAhsan commented Sep 15, 2020

Uh oh!

RaasAhsan commented Feb 8, 2021

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Uh oh!

Conversation

RaasAhsan commented Aug 19, 2020 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

RaasAhsan Aug 19, 2020

Choose a reason for hiding this comment

Uh oh!

RaasAhsan Aug 19, 2020

Choose a reason for hiding this comment

Uh oh!

retronym commented Sep 14, 2020

Uh oh!

djspiewak commented Sep 14, 2020

Uh oh!

RaasAhsan commented Sep 14, 2020

Uh oh!

retronym commented Sep 15, 2020

Uh oh!

RaasAhsan commented Sep 15, 2020

Uh oh!

RaasAhsan commented Sep 15, 2020

Uh oh!

RaasAhsan commented Feb 8, 2021

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

RaasAhsan commented Aug 19, 2020 •

edited

Loading