Handle lazily/eagerly started coroutines differently #3

monosoul · 2023-01-08T21:27:45Z

This changes the way we capture active scope.

For eagerly started coroutines the scope will be captured on instantiation
For lazily started coroutines the scope will be captured on start

This doesn't yet include changes done to ScopeState here: 542e6a8

But I'll merge that if you think this approach is fine.

@bantonsson

Signed-off-by: monosoul <Kloz.Klaud@gmail.com>

This reverts commit c31c62d.

bantonsson · 2023-01-16T15:58:12Z

I'm not completely sure about if the difference in behavior will be confusing or if this is more correct.

monosoul · 2023-01-16T20:39:42Z

@bantonsson Tbh, I think with the fact that there is no guarantee a lazily started coroutine will ever be started/cancelled there isn't much we can do about it. Or rather, there might be solutions, but they are gonna be complicated. I personally would rather have a predictable behavior that I might not be very happy about than a behavior that works in some cases and doesn't work in others. And that could be delivered sooner 🙂
In my 4 years of writing backend apps in Kotlin I have never used lazily started coroutines tbh, maybe this use case is more relevant for Android apps, where they won't gonna use a java agent anyway.
So I suggest to go with this approach as an MVP and see if the users are happy with it. If they don't then we might think of a way to try to make the behavior consistent across different start types.

Upd: just read your message that it's okay to capture a span and never start the coroutine. So I guess this can be disregarded. 🙂

bantonsson

Thanks for the great work. Only some minor comments.

bantonsson · 2023-01-24T09:57:13Z

...n/java/datadog/trace/instrumentation/kotlin/coroutines/AbstractCoroutineInstrumentation.java

+    implements Instrumenter.ForTypeHierarchy {
+
+  public AbstractCoroutineInstrumentation() {
+    super("kotlin-abstract-coroutine");


Having completely different names for all the different parts of the kotlin_coroutine instrumentation is only needed if you want to enable/disable the individual parts by themselves, which I don't think we want here. Since we're not completely sure about how this works in real applications yet, we should probably name it kotlin_coroutine.experimental, and also override the defaultEnabled() method in the instrumentation classes and return false, so it is an opt-in for now.

Done: d3962bc , d3c9c57

bantonsson · 2023-01-24T09:59:00Z

...rc/main/java/datadog/trace/instrumentation/kotlin/coroutines/ScopeStateCoroutineContext.java

-  @Nullable private ContinuationHandler continuationHandler;
+  @Nullable private AgentScope.Continuation continuation;
+  @Nullable private AgentScope continuationScope;
+  private Boolean isInitialized = false;


Nitpick, this will be a Boolean object instead of a boolean.

Thanks for noticing, done: a0fb62f

bantonsson · 2023-01-24T10:02:46Z

...rc/main/java/datadog/trace/instrumentation/kotlin/coroutines/ScopeStateCoroutineContext.java

  private final ScopeState coroutineScopeState;
-  @Nullable private ContinuationHandler continuationHandler;
+  @Nullable private AgentScope.Continuation continuation;
+  @Nullable private AgentScope continuationScope;


I think that these variables can still be accessed concurrently by multiple threads and should be using AtomicReference

Yeah, you're right, I'll change that

@bantonsson
Altho, now that I went through the code again and through thread context element docs (mostly the comments here), I really don't see how there could be any concurrent writes there.

First call to updateThreadContext will always be "synchronous" it can't happen in parallel with restoreThreadContext. We initialize those fields on the first call only, later on there are only reads.

Then there's also coroutineScopeState, but we don't access it at all on call to restoreThreadContext, instead we write the ScopeStack to the thread local variable using ScopeState.

The only possible reason why we can have concurrent calls to updateThreadContext/restoreThreadContext is if there's a bug somewhere else. And there was one! 🙂

When you call withTimeout internally it creates a new coroutine of type TimoutCoroutine and it turns out it's handled a bit differently than others. First of all when it is created, there's no invocation of CoroutineContextKt.newCoroutineContext (so we weren't creating an instance of ScopeStateCoroutineContext for it and it was inheriting this item from the parent context, probably this is what was causing concurrent access to the context element). Second problem was that it is handled a bit differently from others, in a way that it's not guaranteed to start, in some cases it just runs the code block without dispatch. In such cases the on completion callback won't run. Luckily, there's another method available in JobSupport class that we can instrument to guarantee on completion callback execution even when TimeoutCoroutine hasn't been dispatched - onCompletionInternal.

So here 27e144e I did a few changes:

ScopeStateCoroutineContext is now created on invocation of AbstractCoroutine constructor instead of CoroutineContextKt.newCoroutineContext, this way we guarantee every coroutine started will have a new instance of ScopeStateCoroutineContext and will not inherit it from the parent coroutine, so we shouldn't have any concurrent access to ScopeStateCoroutineContext instance.

maybeCloseScopeAndCancelContinuation is now guaranteed to be invoked when coroutine transitions to a terminal state (Cancelled or Completed)

also removed the optional scope propagation I added here (48922a7) before I saw your comment 🙂

Nice catch with the TimeoutCoroutine and newCoroutineContext. That explains why multiple threads were accessing the same instance.

bantonsson · 2023-01-24T10:12:53Z

...rc/main/java/datadog/trace/instrumentation/kotlin/coroutines/ScopeStateCoroutineContext.java

+    if (!isInitialized) {
+      final AgentScope activeScope = AgentTracer.get().activeScope();
+      if (activeScope != null) {
+        activeScope.setAsyncPropagation(true);


Also, we shouldn't force this on the active scope, but only capture it iff it isAsyncPropagating().

Maybe we should make it an option? I.e. those who'd like to propagate scopes to coroutines by default will just enable this option, while others might have a more granular control over it?

@monosoul Automatic async propagation is already the default. If the active scope has its async propagation set to false it is (hopefully) for a good reason. The other big switch will be the kotlin_coroutine.experimental name that enables/disables the integration completely, and it becomes available as dd.integration.kotlin_coroutine.experimental.enabled and DD_INTEGRATION_KOTLIN_COROUTINE_EXPERIMENTAL_ENABLED.

Ahh, I see. Okay, cool, thanks for the explanation! 👍

Signed-off-by: monosoul <Kloz.Klaud@gmail.com>

…en coroutine completes Signed-off-by: monosoul <Kloz.Klaud@gmail.com>

Signed-off-by: monosoul <Kloz.Klaud@gmail.com>

bantonsson and others added 5 commits January 6, 2023 13:00

Add some more kotlin coroutine tests without any enclosing spans

faf1b5d

Merge ContinuationHandler and ScopeStateCoroutineContext

cbda10a

Signed-off-by: monosoul <Kloz.Klaud@gmail.com>

AgentSpan: add isFinished() method

c31c62d

Signed-off-by: monosoul <Kloz.Klaud@gmail.com>

Instrument eagerly/lazily started coroutines differently

31eab43

Signed-off-by: monosoul <Kloz.Klaud@gmail.com>

Revert "AgentSpan: add isFinished() method"

9e65483

This reverts commit c31c62d.

monosoul mentioned this pull request Jan 8, 2023

Kotlin coroutine changes #2

Closed

bantonsson reviewed Jan 24, 2023

View reviewed changes

monosoul added 6 commits January 25, 2023 22:58

Change instrumentation name to kotlin_coroutine.experimental

d3962bc

Signed-off-by: monosoul <Kloz.Klaud@gmail.com>

Disable coroutines instrumentation by default

d3c9c57

Signed-off-by: monosoul <Kloz.Klaud@gmail.com>

Use primitive boolean

a0fb62f

Signed-off-by: monosoul <Kloz.Klaud@gmail.com>

Make auto scope propagation configurable

48922a7

Signed-off-by: monosoul <Kloz.Klaud@gmail.com>

Make sure maybeCloseScopeAndCancelContinuation() is always invoked wh…

27e144e

…en coroutine completes Signed-off-by: monosoul <Kloz.Klaud@gmail.com>

Add Javadoc to the methods

ba6eb02

Signed-off-by: monosoul <Kloz.Klaud@gmail.com>

monosoul requested a review from bantonsson January 26, 2023 12:14

bantonsson approved these changes Jan 26, 2023

View reviewed changes

monosoul merged commit 2f062e6 into feature/kotlin-coroutines-instrumentation Jan 26, 2023

monosoul deleted the feature/kotlin-coroutines-instrumentation-2 branch December 18, 2023 12:41

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Handle lazily/eagerly started coroutines differently #3

Handle lazily/eagerly started coroutines differently #3

monosoul commented Jan 8, 2023

bantonsson commented Jan 16, 2023

monosoul commented Jan 16, 2023 •

edited

Loading

bantonsson left a comment

bantonsson Jan 24, 2023

monosoul Jan 26, 2023 •

edited

Loading

bantonsson Jan 24, 2023

monosoul Jan 26, 2023

bantonsson Jan 24, 2023

monosoul Jan 25, 2023

monosoul Jan 26, 2023 •

edited

Loading

bantonsson Jan 26, 2023

bantonsson Jan 24, 2023

monosoul Jan 25, 2023

bantonsson Jan 26, 2023 •

edited

Loading

monosoul Jan 26, 2023

Handle lazily/eagerly started coroutines differently #3

Handle lazily/eagerly started coroutines differently #3

Conversation

monosoul commented Jan 8, 2023

bantonsson commented Jan 16, 2023

monosoul commented Jan 16, 2023 • edited Loading

bantonsson left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

monosoul Jan 26, 2023 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

monosoul Jan 26, 2023 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

bantonsson Jan 26, 2023 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

monosoul commented Jan 16, 2023 •

edited

Loading

monosoul Jan 26, 2023 •

edited

Loading

monosoul Jan 26, 2023 •

edited

Loading

bantonsson Jan 26, 2023 •

edited

Loading