Add low-cost instrumented version of RuntimeAsyncTask::DispatchContinuations reclaiming lost performance. by lateralusX · Pull Request #126091 · dotnet/runtime

lateralusX · 2026-03-25T13:51:05Z

#123727 introduced a regression of ~7% adding additional instrumentation for Debugger/TPL into RuntimeAsyncTask::DispatchContinuations. Going forward, there will be even more instrumentation needed when implementing async profiling and that would increase the overhead even more, so we need a way to isolate the instrumented vs none instrumented version of this method and regain some of the lost performance.

This PR introduces DispatchContinuations<TRuntimeAsyncTaskInstrumentation>() where TRuntimeAsyncTaskInstrumentation : struct, IRuntimeAsyncTaskInstrumentation using generic value type specialization using an interface for the instrumentation locations needed in the method creating two different versions of codegen. When instrumentation is disabled, most of the instrumentations are empty with minimal overhead on the execution of the method.

I took this path to get away from duplicating the complete DispatchContinuations method into a regular and instrumented version, reducing the maintenance of ~100 lines of duplicated high performing unsafe code. It can be argued that duplicating the DispatchContinuations is a small price to pay giving a little clearer implementation. If that is something we all agree on and accept, I'm happy to pursue that path as well, but wanted to start with the zero cost abstraction, no duplication path.

To be able to "upgrade" from none instrumented to instrumented version of DispatchContinuations, there are checks at method entry and after each completed continuation detecting if method should switch to instrumented version. Both checks are small and fast, but the check on method entry needs to do one more compare detecting if debugger flag has been toggled. The post continuation completion check is just a reload of a flag in a static variable and checked if it's not 0.

This change will make sure the RuntimeAsyncTask::DispatchContinuations is protected from future performance regressions when more instrumentation gets added into the method. Since it shares the majority of the implementation of the loop itself, there is no code duplication. Another approach will be to just duplicate the method, but that would lead to maintaining two copies of the method.

Running the same benchmark, #123727 (comment) now shows the following numbers on old vs new implementation:

Metric	Old	New	Diff
Total bytes (`S.P.C`)	16 740 KB	16 684 KB	-56 KB
JIT Size (`RuntimeAsyncTask::DispatchContinuations`)	1778 B	1431 B	-347 B
Benchmark	337ms	362ms	-25ms (~ -7%)

Measurements done on Windows x64.

S.P.C is 56 KB smaller with this PR, so the extra instrumented codegen version of RuntimeAsyncTask::DispatchContinuations is not included in R2R image. The methods triggering the generic instantiation of the instrumented method have been explicitly marked BypassReadyToRun.

JIT Size is 347 bytes smaller on the default none instrumented version of RuntimeAsyncTask::DispatchContinuations, all previous instrumentation has been moved out into the instrumentation interface, completely eliminated in default method.

Benchmark shows that this PR recover most of the performance previously lost in #123727.

Code paths triggering the use of instrumented specialization, DispatchContinuations<EnableRuntimeAsyncTaskInstrumentation> are protected by a IsSupported flag. On JIT this is always true and will be folded away, but on Native AOT it will use feature flags ( Debugger.IsSupported || EventSource.IsSupported), that are false by default, meaning that none of the instrumented versions of RuntimeAsyncTask::DispatchContinuations will be included, and together with the JIT size savings to RuntimeAsyncTask::DispatchContinuations, Native AOT apps are expected to be slightly smaller.

Most changes in this PR are around setting up the interface used as instrumentation points and extract out existing instrumentation into the instrumentation implementation of the interface. PR also optimize some of the debugger instrumentations previously implemented reducing locking in scenarios where continuation chains are handled.

PR adds a number of new tests validating that the current debugger and TPL instrumentation is still working.

PR also adds preparation for async profiler instrumentation in the RuntimeAsyncTaskInstrumentation type.

Add instrumentation probes to RuntimeAsyncTask DispatchContinuations loop. This is an extremely hot loop, in the centre of dispatching async continuations. dotnet#123727 introduced a regression of ~7% when adding additional instrumentation for debugger/tpl into the loop. This commit uses generic value type specialization to setup an interface for the probes that JIT can use to create two versions of codegen for this hot method, most of the probes will be transformed to noop when profiling/debugging is disabled, introduce minimal overhead to critical hot code path. Dispatch loop checks on entry if instrumentation is enabled, if so it will switch to instrumented version of the function. It also checks on each completion of a continuation if instrumentation flags changed and if that is the case it will again switch to the instrumented version. In total it performs small set of instructions to "upgrade" the method on entry, and also a fast check against a static on each loop to support late attach scenarios. This change will make sure the dispatch continuations loop is protected from future performance regressions when more instrumentation gets added.

dotnet-policy-service · 2026-03-25T13:52:25Z

Tagging subscribers to this area: @dotnet/area-system-runtime
See info in area-owners.md if you want to be subscribed.

Copilot

Pull request overview

This PR refactors runtime-async continuation dispatch to support a low-overhead “uninstrumented” fast path while still enabling Debugger/TPL (and future profiler) instrumentation via a separate, JIT-specialized codegen path, with updated flag plumbing and added tests to validate behavior and cleanup.

Changes:

Introduces a generic, instrumentation-specialized DispatchContinuations<TRuntimeAsyncTaskInstrumentation>() path and centralized runtime-async instrumentation flag management.
Refactors Task’s runtime-async timestamp bookkeeping APIs to better support continuation chains and exception/unwind cleanup.
Adds/expands RuntimeAsync tests for timestamp cleanup, debugger detach behavior, continuation timestamp visibility, and TPL EventSource events; wires TPL EventSource enable/disable to update instrumentation flags.

Reviewed changes

Copilot reviewed 4 out of 4 changed files in this pull request and generated 4 comments.

File	Description
src/libraries/System.Runtime/tests/System.Threading.Tasks.Tests/System.Runtime.CompilerServices/RuntimeAsyncTests.cs	Adds coverage for runtime-async instrumentation behavior (timestamps, detach, unwind/cancel, and TPL events).
src/libraries/System.Private.CoreLib/src/System/Threading/Tasks/TplEventSource.cs	Updates runtime-async instrumentation flags when TPL EventSource commands change enabled keywords.
src/libraries/System.Private.CoreLib/src/System/Threading/Tasks/Task.cs	Refactors runtime-async timestamp dictionaries and adds helpers for chain timestamp propagation and cleanup.
src/coreclr/System.Private.CoreLib/src/System/Runtime/CompilerServices/AsyncHelpers.CoreCLR.cs	Adds the generic instrumentation abstraction and splits dispatch/finalize into specialized uninstrumented vs instrumented implementations.

...time/tests/System.Threading.Tasks.Tests/System.Runtime.CompilerServices/RuntimeAsyncTests.cs

src/libraries/System.Private.CoreLib/src/System/Threading/Tasks/Task.cs

src/coreclr/System.Private.CoreLib/src/System/Runtime/CompilerServices/AsyncHelpers.CoreCLR.cs

…s/System.Runtime.CompilerServices/RuntimeAsyncTests.cs Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com>

…s/Task.cs Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com>

…s/System.Runtime.CompilerServices/RuntimeAsyncTests.cs Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com>

Copilot

Pull request overview

Copilot reviewed 4 out of 4 changed files in this pull request and generated 3 comments.

...time/tests/System.Threading.Tasks.Tests/System.Runtime.CompilerServices/RuntimeAsyncTests.cs

src/libraries/System.Private.CoreLib/src/System/Threading/Tasks/Task.cs

…s/Task.cs Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com>

Copilot

Pull request overview

Copilot reviewed 4 out of 4 changed files in this pull request and generated 2 comments.

src/coreclr/System.Private.CoreLib/src/System/Runtime/CompilerServices/AsyncHelpers.CoreCLR.cs

lateralusX · 2026-03-26T09:04:24Z

Most failures are due to test _reflection::Async2Reflection.FromStack that currently asserts that DispatchContinuations is on stack and is currently not aware of the change to a generic method. If we stick with the generic method, then the assert in this test needs to be updated to reflect the name change.

lateralusX added 3 commits March 25, 2026 13:08

Adding additional tests.

44e595f

Restore FinalizeTaskReturningThunk pattern.

cb94fa7

lateralusX requested review from Copilot and rcj1 March 25, 2026 13:51

github-actions bot added the area-System.Runtime label Mar 25, 2026

lateralusX requested a review from jakobbotsch March 25, 2026 13:51

dotnet-policy-service bot assigned lateralusX Mar 25, 2026

Copilot started reviewing on behalf of lateralusX March 25, 2026 13:52 View session

Copilot AI reviewed Mar 25, 2026

View reviewed changes

Update src/libraries/System.Runtime/tests/System.Threading.Tasks.Test…

77b306a

…s/System.Runtime.CompilerServices/RuntimeAsyncTests.cs Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com>

Copilot AI review requested due to automatic review settings March 25, 2026 14:53

Update src/libraries/System.Private.CoreLib/src/System/Threading/Task…

1d4ff7a

…s/Task.cs Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com>

Copilot started reviewing on behalf of lateralusX March 25, 2026 14:54 View session

Update src/libraries/System.Runtime/tests/System.Threading.Tasks.Test…

b4936e6

…s/System.Runtime.CompilerServices/RuntimeAsyncTests.cs Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com>

Copilot AI reviewed Mar 25, 2026

View reviewed changes

Update src/libraries/System.Private.CoreLib/src/System/Threading/Task…

996b1f9

…s/Task.cs Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com>

Copilot AI review requested due to automatic review settings March 25, 2026 15:15

Copilot started reviewing on behalf of lateralusX March 25, 2026 15:17 View session

Copilot AI reviewed Mar 25, 2026

View reviewed changes

src/coreclr/System.Private.CoreLib/src/System/Runtime/CompilerServices/AsyncHelpers.CoreCLR.cs Show resolved Hide resolved

src/coreclr/System.Private.CoreLib/src/System/Runtime/CompilerServices/AsyncHelpers.CoreCLR.cs Show resolved Hide resolved

Update test.

88807a1

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add low-cost instrumented version of RuntimeAsyncTask::DispatchContinuations reclaiming lost performance.#126091

Add low-cost instrumented version of RuntimeAsyncTask::DispatchContinuations reclaiming lost performance.#126091
lateralusX wants to merge 8 commits intodotnet:mainfrom
lateralusX:lateralusX/runtime-async-instrumentation

lateralusX commented Mar 25, 2026 •

edited

Loading

Uh oh!

dotnet-policy-service bot commented Mar 25, 2026

Uh oh!

Copilot AI left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Copilot AI left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Copilot AI left a comment

Uh oh!

Uh oh!

Uh oh!

lateralusX commented Mar 26, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

lateralusX commented Mar 25, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

dotnet-policy-service bot commented Mar 25, 2026

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Reviewed changes

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Uh oh!

Uh oh!

Uh oh!

lateralusX commented Mar 26, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

lateralusX commented Mar 25, 2026 •

edited

Loading