JIT: use blend rather then repair for profile inconsistencies #85171

AndyAyersMS · 2023-04-21T16:36:32Z

If we have a partial profile then the current count reconstruction will adjust the exit likelihood of some loop exit when it hits a capped loop. But for multiple exit loops we might wish to see some profile flow out of all exits, not just one.

In ludcmp we choose to send all the profile weights down an early return path, leaving the bulk of the method with zero counts.

Instead of trying increasingly elaborate repair schemes, we will now use blend mode for these sorts of problems; this gives a more balanced count redistribution.

I also updated blend to use the same logic as repair if a block has zero weights, since presumably whatever likelihood was assigned there during reconstruction is not well supported.

Fixes the ludcmp regression with PGO over no PGO, noted in #84264 (comment)

If we have a partial profile then the current count reconstruction will adjust the exit likelihood of some loop exit when it hits a capped loop. But for multiple exit loops we might wish to see some profile flow out of all exits, not just one. In `ludcmp` we choose to send all the profile weights down an early return path, leaving the bulk of the method with zero counts. Instead of trying increasingly elaborate repair schemes, we will now use blend mode for these sorts of problems; this gives a more balanced count redistribution. I also updated blend to use the same logic as repair if a block has zero weights, since presumably whatever likelihood was assigned there during reconstruction is not well supported. Fixes the `ludcmp` regression with PGO over no PGO, noted in dotnet#84264 (comment)

ghost · 2023-04-21T16:36:43Z

Tagging subscribers to this area: @JulieLeeMSFT, @jakobbotsch
See info in area-owners.md if you want to be subscribed.

Issue Details

If we have a partial profile then the current count reconstruction will adjust the exit likelihood of some loop exit when it hits a capped loop. But for multiple exit loops we might wish to see some profile flow out of all exits, not just one.

In ludcmp we choose to send all the profile weights down an early return path, leaving the bulk of the method with zero counts.

Instead of trying increasingly elaborate repair schemes, we will now use blend mode for these sorts of problems; this gives a more balanced count redistribution.

I also updated blend to use the same logic as repair if a block has zero weights, since presumably whatever likelihood was assigned there during reconstruction is not well supported.

Fixes the ludcmp regression with PGO over no PGO, noted in #84264 (comment)

Author:	AndyAyersMS
Assignees:	AndyAyersMS
Labels:	`area-CodeGen-coreclr`
Milestone:	-

AndyAyersMS · 2023-04-21T16:36:45Z

@EgorBo PTAL
cc @dotnet/jit-contrib

AndyAyersMS · 2023-04-21T17:25:37Z

/azp run runtime-coreclr pgo, runtime-coreclr libraries-pgo

azure-pipelines · 2023-04-21T17:25:57Z

Azure Pipelines successfully started running 2 pipeline(s).

AndyAyersMS · 2023-04-21T17:36:51Z

Diffs are fairly surgical: ludcmp and one related method, and a couple others here and there.

EgorBo

I assume a few size regressions are expected

AndyAyersMS · 2023-04-21T20:08:36Z

I assume a few size regressions are expected

Yes. It is mainly just this one method where the old repair process left most of the method cold and so we didn't clone loops like we need to for good perf.

Top method regressions (bytes):
        1784 (124.93 % of base) : 37381.dasm - LUDecomp:ludcmp(double[][],int,int[],byref):int

AndyAyersMS · 2023-04-22T14:45:24Z

Failure is known issue,

dotnet-issue-labeler bot added the area-CodeGen-coreclr CLR JIT compiler in src/coreclr/src/jit and related components such as SuperPMI label Apr 21, 2023

ghost assigned AndyAyersMS Apr 21, 2023

AndyAyersMS mentioned this pull request Apr 21, 2023

Investigate microbenchmarks that regress with PGO enabled #84264

Closed

build-analysis bot mentioned this pull request Apr 21, 2023

Could not load file or assembly 'Microsoft.CodeAnalysis.NetAnalyzers #84995

Closed

EgorBo approved these changes Apr 21, 2023

View reviewed changes

AndyAyersMS merged commit c119e4f into dotnet:main Apr 22, 2023
167 of 170 checks passed

ghost locked as resolved and limited conversation to collaborators May 22, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

JIT: use blend rather then repair for profile inconsistencies #85171

JIT: use blend rather then repair for profile inconsistencies #85171

AndyAyersMS commented Apr 21, 2023

ghost commented Apr 21, 2023

AndyAyersMS commented Apr 21, 2023

AndyAyersMS commented Apr 21, 2023

azure-pipelines bot commented Apr 21, 2023

AndyAyersMS commented Apr 21, 2023

EgorBo left a comment

AndyAyersMS commented Apr 21, 2023

AndyAyersMS commented Apr 22, 2023

JIT: use blend rather then repair for profile inconsistencies #85171

JIT: use blend rather then repair for profile inconsistencies #85171

Conversation

AndyAyersMS commented Apr 21, 2023

ghost commented Apr 21, 2023

AndyAyersMS commented Apr 21, 2023

AndyAyersMS commented Apr 21, 2023

azure-pipelines bot commented Apr 21, 2023

AndyAyersMS commented Apr 21, 2023

EgorBo left a comment

Choose a reason for hiding this comment

AndyAyersMS commented Apr 21, 2023

AndyAyersMS commented Apr 22, 2023