Syntax classification taking up to 12% of CPU time #68996

Youssef1313 · 2023-07-12T05:15:54Z

I'm not sure if addressing this comment would improve it.

roslyn/src/Compilers/CSharp/Portable/Symbols/AbstractTypeMap.cs

Lines 53 to 55 in 86156fa

    
           // TODO: we could construct the result's ConstructedFrom lazily by using a "deep" 
        
           // construct operation here (as VB does), thereby avoiding alpha renaming in most cases. 
        
           // Aleksey has shown that would reduce GC pressure if substitutions of deeply nested generics are common.

I'll try to upload the trace soon.

Youssef1313 · 2023-07-12T05:55:31Z

Aside from CPU time, there are lots of allocations of TypeWithAnnotations[] in AbstractTypeMap.SubstituteNamedType. That is 6.5% of allocations.

Youssef1313 · 2023-07-12T06:23:43Z

Trace in https://developercommunity.visualstudio.com/t/Performance-trace-for-Roslyn-team/10413054

Partial fix for dotnet#68996 Both SubstituteTypesWithoutModifiers and SubstituteNamedTypes allocate a temporary array that is used to potentially return an immutable array. Previously, this immutable array was allocated if needed, whereas we can not perform the extra allocation by utilizing ImmutableCollectionsMarshal as the array being wrapped is local to this method.

Partial fix for #68996 Both SubstituteTypesWithoutModifiers and SubstituteNamedTypes allocate a temporary array that is used to potentially return an immutable array. Previously, this immutable array was allocated if needed, whereas we can not perform the extra allocation by utilizing ImmutableCollectionsMarshal as the array being wrapped is local to this method.

ToddGrun · 2024-03-19T14:12:56Z

Allocations under AbstractTypeMap improved by #72588 . CPU characteristics from classification have improved significantly (due to merging classifiers and reducing frequency invalidated) since 17.8. If you are able to gather a profile against Roslyn with #72588, or against stock VS >= 17.10 Preview 3, that still shows this as an area of concern, I will gladly dig into it more to see if I can find other improvements. Thanks!

Youssef1313 · 2024-04-05T19:03:19Z

@ToddGrun It's still super significant in my case (using 17.10 P1).

9.5% (over 1GB) of TypeWithAnnotations[] is too much.

ToddGrun · 2024-04-05T19:42:50Z

@Youssef1313 -- Any chance you can share your profile so I can dig into it a bit?

Youssef1313 · 2024-04-05T19:50:55Z

Repro project:

UnoSampleSolution.zip

Scenario: Typing the following:

I'm uploading the trace. But upload speed is too slow here for me.

Youssef1313 · 2024-04-05T19:51:55Z

This is also the same repro that was used for #67926

Youssef1313 · 2024-04-05T20:26:08Z

@ToddGrun https://developercommunity.visualstudio.com/t/10632375

ToddGrun · 2024-04-05T21:26:05Z

@Youssef1313 -- I've tried reproducing your scenario with your project and I don't see nearly the allocations that you are hitting. I'm not sure if it's because the project isn't fully working for me, as I don't have a couple of the net7.0 tfms used installed or whether it's because I can't resolve a couple of the Uno.* packages. I'll keep looking, but I'm not seeing anything obvious from my debugging or looking at your trace about how to improve this.

mentioned @CyrusNajmabadi for visibility

Youssef1313 · 2024-04-05T21:31:15Z

@ToddGrun I think the issue should be apparent/reproducible if you got the project to build properly. Let me know if there is anything I can help with so you can repro.

Youssef1313 · 2024-04-05T21:37:13Z

From the trace, I'm seeing DiagnosticInfo.GetHashCode showing up taking 6.7% of CPU time. I don't think the issue is that GetHashCode itself is slow, but it's probably that it's called excessively.

My understanding is that they are created while checking whether an overload is applicable or not, and likely non-applicable overloads produce diagnostics. But, I think those diagnostics are thrown away in the end anyway, and this is probably wasted time. So it feels like some work can be saved, but I don't really see exactly what should be done.

There is also #59733 which may help with the scenario above, but seems like the PR got stale :'(

CyrusNajmabadi · 2024-04-05T21:51:26Z

i think this is being approached from the wrong direction. Looking at teh code, there's a lambda involved, and completion is Exceedingly slow. My guess is that there's something about thsi lambda and lambda binding in general, making this rough (perhaps some sort of large overload resolution case).

Youssef1313 · 2024-04-07T21:54:34Z

@CyrusNajmabadi Yeah we have multiple overloads for the extension methods and overload resolution definitely has to do more work. Still, I think it shouldn't be slow to that extent. I'm not sure how to best approach the issue here.

CyrusNajmabadi · 2024-04-07T22:00:46Z

Generally speaking, extension methods, with complex lambdas, can explode very quickly into a perf morass. even if we do any work here, it's likely that even trivial changes you make to your api could increase costs by an order of magnitude or more. So, generally speaking, i'd advise restructuring the API.

Youssef1313 · 2024-04-07T22:09:06Z

@CyrusNajmabadi Well, the API already shipped, but I think we can consider changes for the next major release. However, I think we'll want API changes to still be source-compatible (will be okay to only break binary compatibility I guess). I don't see though how the modified API should look like.

#68996 (comment) is also still concerning as it feels like unnecessary work being done, but not entirely sure if my concern is right, and how much improvement it will bring in practice.

CyrusNajmabadi · 2024-04-07T22:13:32Z

Right, my point is that that is happenign due to combinatorial explosion a not-uncommon problem with complex overload resolution scenarios and lambdas.

While we could attempt to squeeze down some of these scenarios, your performance will be dominated by this. And even a single other nesting, or overload, etc. will likely introduce more orders of magnitude perf cliffs.

It's largely because of this problem-space that we added the ability for lambdas to state both their argument type and return type. And yes, the recommendatoin would be for users of the API to supply that to ensure the compiler doesn't have to go through exponential computations to figure that stuff out.

ToddGrun · 2024-04-08T20:02:26Z

@cston from compiler who has more context on this area

CyrusNajmabadi · 2024-04-08T20:44:11Z

However, I think we'll want API changes to still be source-compatible

Sure. But if you are using patterns that really exacerbate things, then you might need to tell your customers that their perf experience will be greatly improved if they explicitly specify the arg-types and return-types of their lambdas.

I believe (and @cston may confirm) that we may even aggressively push people to this at the compiler level for when they start running into exponential blowup.

cston · 2024-04-15T17:01:55Z

We are adding an info severity diagnostic reported for lambda expression bodies that are bound many times, typically in cases where nested lambda expressions are used as arguments to overloaded or generic method calls and where the lambda parameter types are inferred: see #72823.

dotnet-issue-labeler bot added Area-IDE untriaged Issues and PRs which have not yet been triaged by a lead labels Jul 12, 2023

jeromelaban mentioned this issue Jul 12, 2023

[Epic] Opened Microsoft issues tracking unoplatform/uno#982

Open

arkalyanms assigned ToddGrun Aug 2, 2023

arkalyanms added Tenet-Performance Regression in measured performance of the product from goals. and removed untriaged Issues and PRs which have not yet been triaged by a lead labels Aug 2, 2023

arkalyanms added this to the 17.8 P4 milestone Aug 2, 2023

ToddGrun mentioned this issue Mar 18, 2024

Reduce allocations in AbstractTypeMap #72588

Merged

ToddGrun closed this as completed Mar 19, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Syntax classification taking up to 12% of CPU time #68996

Syntax classification taking up to 12% of CPU time #68996

Youssef1313 commented Jul 12, 2023

Youssef1313 commented Jul 12, 2023

Youssef1313 commented Jul 12, 2023

ToddGrun commented Mar 19, 2024

Youssef1313 commented Apr 5, 2024 •

edited

ToddGrun commented Apr 5, 2024

Youssef1313 commented Apr 5, 2024

Youssef1313 commented Apr 5, 2024

Youssef1313 commented Apr 5, 2024

ToddGrun commented Apr 5, 2024

Youssef1313 commented Apr 5, 2024

Youssef1313 commented Apr 5, 2024

CyrusNajmabadi commented Apr 5, 2024

Youssef1313 commented Apr 7, 2024

CyrusNajmabadi commented Apr 7, 2024

Youssef1313 commented Apr 7, 2024

CyrusNajmabadi commented Apr 7, 2024

ToddGrun commented Apr 8, 2024

CyrusNajmabadi commented Apr 8, 2024

cston commented Apr 15, 2024

Syntax classification taking up to 12% of CPU time #68996

Syntax classification taking up to 12% of CPU time #68996

Comments

Youssef1313 commented Jul 12, 2023

Youssef1313 commented Jul 12, 2023

Youssef1313 commented Jul 12, 2023

ToddGrun commented Mar 19, 2024

Youssef1313 commented Apr 5, 2024 • edited

ToddGrun commented Apr 5, 2024

Youssef1313 commented Apr 5, 2024

Youssef1313 commented Apr 5, 2024

Youssef1313 commented Apr 5, 2024

ToddGrun commented Apr 5, 2024

Youssef1313 commented Apr 5, 2024

Youssef1313 commented Apr 5, 2024

CyrusNajmabadi commented Apr 5, 2024

Youssef1313 commented Apr 7, 2024

CyrusNajmabadi commented Apr 7, 2024

Youssef1313 commented Apr 7, 2024

CyrusNajmabadi commented Apr 7, 2024

ToddGrun commented Apr 8, 2024

CyrusNajmabadi commented Apr 8, 2024

cston commented Apr 15, 2024

Youssef1313 commented Apr 5, 2024 •

edited