[wasm] Interpreter automatic PGO #92981

kg · 2023-10-04T02:54:49Z

This PR adds automatic PGO for interpreter tiering. The idea is that by maintaining a list of which interpreter methods were tiered during execution, we can skip directly to generating optimized code on future runs. This will get the application into a steady high-performance state faster on future runs, while also reducing the amount of time/memory we spend on code generation (since normally we generate unoptimized code, then generate optimized code).

The infrastructure for this feature is not platform gated, but the only platform that has an implementation for loading/saving the PGO data right now is the browser, and the runtime option controlling it is only default-on for the browser.

Web application code can either use INTERNAL.interpgo_load_data and INTERNAL.interpgo_save_data to manually control the loading/saving of the data, or can use the new withInterpreterPgo(true, autoSaveDelay) builder method to turn on automatic mode. Automatic mode will attempt to load the data during startup, and will automatically save it after a delay (at which point your application should be in something approximating a steady-state and critical code has already tiered.) The current web implementation repurposes the cache storage code from pavel's memory snapshot feature to store the pgo table in the cache next to the snapshot(s).

This PR also adds basic (controlled by a #define and off by default) timing instrumentation for generate_code, because in my testing the browser's profiler was producing wildly incorrect timing data for interpreter codegen. This should make it easy to evaluate how well the feature is working since you can just look at the output.

One interesting finding: The list of method hashes generated on a first run is bigger than the list generated on future runs - I believe this is because once a full set of methods is tiered, we no longer need the tiered version of some of them because the hottest methods have gotten inlined into their callers. So on a second run the list of hashes stabilizes into a smaller list.

In a couple local test runs of browser-bench with this active and without, the time spent in generate_code was 3600ms without interpgo and 3209ms with interpgo, so the improvement over the course of a full run of an application with lots of generated code should be worthwhile. For smaller applications it's below the noise floor, I haven't been able to measure it successfully at that scale.

ghost · 2023-10-04T02:55:02Z

Tagging subscribers to this area: @BrzVlad, @kotlarmilos
See info in area-owners.md if you want to be subscribed.

Issue Details

This PR adds automatic PGO for interpreter tiering. The idea is that by maintaining a list of which interpreter methods were tiered during execution, we can skip directly to generating optimized code on future runs. This will get the application into a steady high-performance state faster on future runs, while also reducing the amount of time/memory we spend on code generation (since normally we generate unoptimized code, then generate optimized code).

Application code can either use INTERNAL.interpgo_load_data and INTERNAL.interpgo_save_data to manually control the loading/saving of the data, or can use the new withInterpreterPgo(true, autoSaveDelay) builder method to turn on automatic mode. Automatic mode will attempt to load the data during startup, and will automatically save it after a delay (at which point your application should be in something approximating a steady-state and critical code has already tiered.)

Author:	kg
Assignees:	kg
Labels:	`area-Codegen-Interpreter-mono`
Milestone:	-

src/mono/mono/mini/interp/interpgo.c

src/mono/mono/mini/interp/interpgo.h

src/mono/mono/mini/interp/interpgo.c

vargaz · 2023-10-12T04:34:00Z

src/mono/mono/mini/interp/interpgo.c

+	size_t size = sizeof(uint32_t) + 16;
+	uint32_t *inbuf = alloca (size);
+	// method tokens are globally unique within a given assembly
+	inbuf[0] = mono_method_get_token (method);


This is not true for generic instances.

OK, so I need to hash the generic arguments? Or is it more complex than that?

some generic instances are very hot in the BCL, like the ones used in string/array searches

It can hash the full method name for example, but maybe its good enough this way.

I think it's probably fine this way, if any instance of the method got tiered we can probably tier all of them. This still won't generate code for other instances until they're actually used.

If it's a wrapper method, I think the token is either shared with the method that it is wrapping, or it is zero. maybe we don't mind the collisions?

Can we share the AOT compiler's logic for uniquely identifying methods?

The aot compiler/runtime has a mono_aot_method_hash () which returns a good hash.

mono_aot_method_hash looks comprehensive but also looks really expensive, and i'm not sure how well it will avoid collisions. do you still want me to use it?

src/mono/wasm/runtime/types/index.ts

src/mono/wasm/test-main.js

src/mono/wasm/runtime/types/index.ts

src/mono/wasm/runtime/interpgo.ts

src/mono/wasm/runtime/cwraps.ts

src/mono/sample/wasm/browser-bench/main.js

src/mono/mono/mini/interp/interpgo.c

+	size_t size = sizeof(uint32_t) + 16;
+	uint32_t *inbuf = alloca (size);
+	// method tokens are globally unique within a given assembly
+	inbuf[0] = mono_method_get_token (method);


src/mono/mono/mini/interp/interpgo.c

radical

WBT changes look good.

src/mono/wasm/Wasm.Build.Tests/Templates/InterpPgoTests.cs

src/mono/wasm/Wasm.Build.Tests/BrowserRunner.cs

src/mono/wasm/Wasm.Build.Tests/Templates/InterpPgoTests.cs

src/mono/wasm/Wasm.Build.Tests/WasmTemplateTestBase.cs

src/mono/wasm/runtime/startup.ts

pavelsavara · 2023-10-18T09:14:47Z

src/mono/wasm/Wasm.Build.Tests/Templates/InterpPgoTests.cs

+            }
+
+            {
+                _testOutput.WriteLine("/// Second run");


I'm not very familiar with playwrite, should we navigate the browser away before we load the page second time ?

Each page is basically a separate tab as I understand it

src/mono/mono/mini/interp/transform.c

Checkpoint: Interpgo bloom filter works Workaround for 'reach managed cold' hanging forever Remove logging Checkpoint Checkpoint Configurable interpreter PGO Enable interpgo for browser bench Checkpoint: Refactor interpgo to share cache code with memory snapshots Use cache storage for Interpreter PGO Code cleanup + repair merge damage Repair merge damage Fix some bugs and troubleshoot WBT failure Fix build Move prototypes I love C Fix key collision between interpgo and memory snapshot Fix storeCacheEntry always writing to the memory snapshot key Checkpoint Move more of interpgo outside of platform guards Fix build Make the inline murmurhash functions also static to fix linker errors Remove unused signature argument Cleanup Address PR feedback Rearrange code Address PR feedback Fix assert when loading empty table Checkpoint Update dotnet.d.ts Move interp codegen timing to a runtime option + make it thread safe Move interp pgo logging to a runtime option Add a WBT test that verifies basic functionality of interpreter PGO Add missing file Await interp_pgo_save_data Apply suggestions from code review Co-authored-by: Ankit Jain <radical@gmail.com> Cleanup / address PR feedback Address PR feedback Increase browser-bench waitFor timeout Fix syntax

ghost assigned kg Oct 4, 2023

dotnet-issue-labeler bot added the area-Codegen-Interpreter-mono label Oct 4, 2023

kg added the arch-wasm WebAssembly architecture label Oct 4, 2023

kg force-pushed the wasm-interpgo branch from b10a9f4 to 6134ef4 Compare October 5, 2023 17:45

This was referenced Oct 5, 2023

Tracking issue for CI build timeouts #76454

Closed

System.Diagnostics.Tests.ProcessTests.TestCheckChildProcessUserAndGroupIds failing Subset assertion #92944

Open

dotnet deleted a comment from azure-pipelines bot Oct 10, 2023

kg marked this pull request as ready for review October 11, 2023 23:02

kg requested review from lewing, pavelsavara, vargaz, lambdageek, BrzVlad, kotlarmilos and SamMonoRT as code owners October 11, 2023 23:02

vargaz reviewed Oct 12, 2023

View reviewed changes

src/mono/mono/mini/interp/interpgo.c Outdated Show resolved Hide resolved

vargaz reviewed Oct 12, 2023

View reviewed changes

src/mono/mono/mini/interp/interpgo.h Outdated Show resolved Hide resolved

vargaz reviewed Oct 12, 2023

View reviewed changes

src/mono/mono/mini/interp/interpgo.c Outdated Show resolved Hide resolved

vargaz reviewed Oct 12, 2023

View reviewed changes

src/mono/mono/mini/interp/interpgo.c Outdated Show resolved Hide resolved

vargaz reviewed Oct 12, 2023

View reviewed changes

pavelsavara reviewed Oct 12, 2023

View reviewed changes

lambdageek reviewed Oct 12, 2023

View reviewed changes

src/mono/mono/mini/interp/interpgo.c Outdated Show resolved Hide resolved

lambdageek reviewed Oct 12, 2023

View reviewed changes

src/mono/mono/mini/interp/interpgo.c Outdated Show resolved Hide resolved

src/mono/mono/mini/interp/interpgo.c Outdated Show resolved Hide resolved

src/mono/mono/mini/interp/interpgo.c Outdated Show resolved Hide resolved

build-analysis bot mentioned this pull request Oct 13, 2023

[6.0, 7.0, 8.0] Connection reset assert failure in System.Net.Sockets.Tests.SendReceive_SyncForceNonBlocking.TcpReceiveSendGetsCanceledByDispose #92423

Open

kg requested a review from radical as a code owner October 18, 2023 00:20

build-analysis bot mentioned this pull request Oct 18, 2023

NuGet failing with Response status code does not indicate success: 503 (Service Unavailable) dotnet/arcade#11723

Open

5 tasks

radical approved these changes Oct 18, 2023

View reviewed changes

build-analysis bot mentioned this pull request Oct 18, 2023

MSBuild crashing in the build #92290

Open

pavelsavara reviewed Oct 18, 2023

View reviewed changes

src/mono/wasm/runtime/startup.ts Show resolved Hide resolved

pavelsavara reviewed Oct 18, 2023

View reviewed changes

src/mono/wasm/runtime/startup.ts Outdated Show resolved Hide resolved

pavelsavara reviewed Oct 18, 2023

View reviewed changes

build-analysis bot mentioned this pull request Oct 20, 2023

[wasm] Runtime tests failing - _Vector2_3_4::Vector2_3_4Test.RunVector*Tests because of missing support for native libraries #93669

Closed

pavelsavara approved these changes Oct 26, 2023

View reviewed changes

BrzVlad reviewed Oct 30, 2023

View reviewed changes

src/mono/mono/mini/interp/transform.c Outdated Show resolved Hide resolved

kg added 3 commits October 31, 2023 15:15

Fix build

c89838d

Move codegen timing hooks

80973c4

kg force-pushed the wasm-interpgo branch from 5872351 to 80973c4 Compare October 31, 2023 22:31

BrzVlad approved these changes Nov 1, 2023

View reviewed changes

kg merged commit 8bce5a8 into dotnet:main Nov 1, 2023
108 checks passed

ghost locked as resolved and limited conversation to collaborators Dec 1, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[wasm] Interpreter automatic PGO #92981

[wasm] Interpreter automatic PGO #92981

kg commented Oct 4, 2023 •

edited

Loading

ghost commented Oct 4, 2023

vargaz Oct 12, 2023

kg Oct 12, 2023

This comment was marked as duplicate.

kg Oct 12, 2023

vargaz Oct 12, 2023

kg Oct 12, 2023

lambdageek Oct 12, 2023

lambdageek Oct 12, 2023

vargaz Oct 12, 2023

kg Oct 13, 2023

This comment was marked as duplicate.

radical left a comment

pavelsavara Oct 18, 2023

kg Oct 19, 2023

[wasm] Interpreter automatic PGO #92981

[wasm] Interpreter automatic PGO #92981

Conversation

kg commented Oct 4, 2023 • edited Loading

ghost commented Oct 4, 2023

Choose a reason for hiding this comment

Choose a reason for hiding this comment

This comment was marked as duplicate.

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

This comment was marked as duplicate.

radical left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

kg commented Oct 4, 2023 •

edited

Loading