Create a managed implementation of assembly binder #91400

huoyaoyuan · 2023-08-31T16:24:27Z

It should be possible to move the whole assembly loader to C#. We would need a special unmanaged path to load CoreLib to bootstrap, but the rest can be managed. It is too big to do it all at once. I think a good place to start with this refactoring would be BINDER_SPACE::*.

I've ported the types under BINDER_SPACE::, and a large portion of binding logic in assemblybindercommon.cpp. I also created a disabled feature switch to guard the code.

The logic is ported closely from C++ and aims line-to-line match, to keep the behavior. Initially I want to convert all the HRESULTs to exceptions, but it's not practical to use exception in control logic of binder, so I kept HRESULTs in AssemblyBinderCommon.

/cc @jkotas

Is this in the desired direction? I haven't do anything around managed/unmanaged boundary. I think code should be ported to managed, until something we don't want.

ghost · 2023-08-31T16:24:38Z

Tagging subscribers to this area: @vitek-karas, @agocke, @VSadov
See info in area-owners.md if you want to be subscribed.

Issue Details

#85558 (comment)

It should be possible to move the whole assembly loader to C#. We would need a special unmanaged path to load CoreLib to bootstrap, but the rest can be managed. It is too big to do it all at once. I think a good place to start with this refactoring would be BINDER_SPACE::*.

I've ported the types under BINDER_SPACE::, and a large portion of binding logic in assemblybindercommon.cpp. I also created a disabled feature switch to guard the code.

The logic is ported closely from C++ and aims line-to-line match, to keep the behavior. Initially I want to convert all the HRESULTs to exceptions, but it's not practical to use exception in control logic of binder, so I kept HRESULTs in AssemblyBinderCommon.

/cc @jkotas

Is this in the desired direction? I haven't do anything around managed/unmanaged boundary. I think code should be ported to managed, until something we don't want.

Author:	huoyaoyuan
Assignees:	-
Labels:	`area-AssemblyLoader-coreclr`
Milestone:	-

jkotas · 2023-08-31T17:06:17Z

This is good raw material. Creating a clean managed/unmanaged boundary is probably going to be the harder part. Some thoughts:

The unmanaged binder calls out to managed to log tracing messages and to fire resolve events. This back and forth between managed and unmanaged should go away, the unmanaged code should just do a single call to managed to do everything.
The binding using TPA list should be folded into DefaultAssemblyLoadContext, no need for it to live in separate type.
The managed equivalent of TextualIdentityParser exists as System.Reflection.AssemblyNameFormatter. We should just use the managed AssemblyNameFormatter instead of duplicating it as TextualIdentifyParser.
TextualIdentifyParser may be used by debugger or in places that are not able to run managed code. If you run into situations like this, leave it alone in C++ for now.
We avoid C# record in core libraries. C# record generates too much hidden code bloat that the trimmer is not able to delete.
I am not sure what is the purpose of the binding failure HRESULT error cache. The VM caches the binding failures with the exception details (look for AppDomain::AddExceptionToCache). I would think that the binding failure HRESULT error cache should be unnecessary.

src/coreclr/System.Private.CoreLib/src/Internal/Runtime/Binder/AssemblyName.cs

src/coreclr/System.Private.CoreLib/src/Internal/Runtime/Binder/ApplicationContext.cs

jkotas · 2023-08-31T18:04:23Z

src/coreclr/System.Private.CoreLib/src/Internal/Runtime/Binder/AssemblyName.cs

+            int* dwPAFlags = stackalloc int[2];
+            using IMdInternalImport pIMetaDataAssemblyImport = BinderAcquireImport(pPEImage, dwPAFlags);
+
+            Architecture = AssemblyBinderCommon.TranslatePEToArchitectureType(dwPAFlags);


The managed binder should not need to deal with image architectures. The unmanaged runtime should validate very early whether the image is good for current architecture (and reject it if it is not). By the time we get to the managed binder, we should know that the image architecture is good.

I started with focusing the data structures, and did not look at the caller. Leaving the cleanup later should be fine, since cleaning managed code is easy.

vitek-karas · 2023-08-31T18:20:10Z

@elinor-fung FYI

huoyaoyuan · 2023-08-31T18:31:34Z

Good points for the behaviors that we don't need in managed binder. I'm aiming for a working POC first, and not tweaking yet. Records are just simplification since I don't want to write GetHashCode etc. first.

Rough thought of steps:

Explore the scope of managed binder, create shape for things to port, and use placeholder for boundary and complex functions.
Fulfill implementation and boundary incrementally, to get the POC working.
Create feature switch guard for unmanaged code replaced, and filter out things we can't remove, for example which used by debugger.
Tweak boundary and optimise managed code.
Turn on the feature switch eventually.

Questions:

Should I create a tracking issue for this? This is likely not a single-PR work.
Should we get some code merged with feature switch disabled first? How complete should it be?
Will this affect bootstrapping on new platform/architecture?

lambdageek · 2024-01-22T15:47:12Z

@jkotas More for my own education, rather than as a comment on the details of this PR: how does a managed assembly binder interact with class loading/initialization in coreclr? In Mono we have scenarios where constructing a class triggers assembly loading while we're holding the global loader lock - which can run managed loader events which can lead to deadlocks (#72796) . My understanding is that CoreCLR class loading is less atomic - relying on load levels and individual class locks. But I was under the impression that there are still scenarios where managed callbacks fire while a native lock is held. Does moving the assembly binder to managed code affect those scenarios?

Also what happens if you trigger class initialization from the managed assembly binder? One way I could confuse mono was by doing a Console.WriteLine in the AppDomain.AssemblyLoad event handler before calling WriteLine (#51881) - which triggers loading of System.Console.dll which triggers a recursive class initialization of Console

It's possible that the interaction between the assembly binder and class initialization is more well behaved in CoreCLR, but I'm curious if porting the binder to managed has any impact.

jkotas · 2024-01-22T16:53:45Z

Right, assembly loader needs to be able to call AssemblyLoadContext callbacks and AppDomain events that are managed code. Rewriting more of the assembly loader in managed code means that there is going to be more managed code running, but it does not change any fundamental invariants.

I was under the impression that there are still scenarios where managed callbacks fire while a native lock is held. Does moving the assembly binder to managed code affect those scenarios?

CoreCLR class loading/initialization has lock per type. (The implementation of this lock is being rewritten in #96653 to be more scalable and performant.) Triggering assembly loading or more type loading from type loading works fine, as long as there is no recursive loading of the same type.

class initialization

CoreCLR does not trigger class initializers during type or assembly loader. The class initializers are triggered only once the code runs. We are out of the assembly and type loader by that time.

agocke · 2024-02-27T18:56:05Z

Moved to draft while we figure out if we can either fix the perf regressions, or accept them.

huoyaoyuan · 2024-03-10T18:25:55Z

I've done some measurement about working set:

It's about 3.5MB of total regression comparing to main for a given run. Break down by vmmap:
Heap (unmanaged?): +0.8MB
Image: +1.5MB. The working set of CoreLib, coreclr, clrjit and icu.dll has all increased.
Managed: vmmap can't show this category of PR build, but successes for main branch build. It's about 1.5MB for main.
Mapped file: +0.6MB for icudtl.dat
Page table: -0.3MB
Private data: +276KB, a block of R/W data
Sharable: +2MB. Not sure what it represents.

So one obvious conclusion is that some globalization related code are touched unintentionally.

Setting globalization invariant mode doesn't change the regression though.

ANahr

There are multiple cases in the code that might trigger globalization code

src/coreclr/System.Private.CoreLib/src/System/Runtime/Loader/ApplicationContext.cs

src/coreclr/System.Private.CoreLib/src/System/Runtime/Loader/AssemblyName.cs

dotnet-policy-service · 2024-04-12T05:05:19Z

Draft Pull Request was automatically closed for 30 days of inactivity. Please let us know if you'd like to reopen it.

huoyaoyuan · 2024-04-12T05:15:40Z

Keep open since it's functionally complete.

dotnet-policy-service · 2024-05-12T06:35:15Z

Draft Pull Request was automatically closed for 30 days of inactivity. Please let us know if you'd like to reopen it.

huoyaoyuan · 2024-05-13T15:54:51Z

Status report:

Working Set:

About 15.2 MB -> 16.3 MB under default configuration.
The difference is mainly +0.6MB in mapped image of CoreLib, and +0.3MB of managed heap, +0.2MB of sharable page.
There is reduction of mapped image of coreclr.dll, but the difference is much smaller.

Start time:

Measured with Measure-Command in PowerShell as an external source. The startup time range shifts from 23.6 ~ 24.8ms to 25.3 ~ 26.7ms. The regression is clearly measurable.

File size:

-39KB in coreclr.dll, +92KB in CoreLib (R2R).

There are still some space of dead code for optimization, but I kept this PR as close to native code as possible to help reviewing.
Cleanup around managed->native calls may also improve performance in the future.

huoyaoyuan · 2024-05-13T15:59:10Z

src/coreclr/System.Private.CoreLib/src/System/Runtime/Loader/ApplicationContext.cs

+                if (outPath.EndsWith(".ni.dll", StringComparison.OrdinalIgnoreCase)
+                    || outPath.EndsWith(".ni.exe", StringComparison.OrdinalIgnoreCase))
+                {
+                    simpleName = outPath[iSimpleNameStart..^7];
+                    isNativeImage = true;
+                }


Is ni.dll still a thing? I remember it's output of NGen.

Even if it's still a thing, we should probably remove this too because the shared framework (TPA) does not contain any ni files.
The performance difference is negligible though.

huoyaoyuan added 18 commits August 30, 2023 16:11

Definition of AssemblyVersion and AssemblyIdentity

dcecbdc

Port logic body of AssemblyName.Init

c946228

Port HashCode and Equals for AssemblyName

b4c0ba8

AssemblyName.GetDisplayName

a6bf121

Assembly

bf85741

Some methods in AssemblyBinderCommon and BindReseult

61143ce

ApplicationContext

5fdf8a7

Basic assembly binder wrapper

7175e30

Refactor FailureCache

6cfdc44

BindLocked

52dae46

FindInExecutionContext

e7789cd

BindByTpaList

c32ec98

RegisterAndGetHostChosen

9a47f75

BindAssemblyByProbingPaths

21ea3e1

Use HResult instead of exception

93b4b10

GetAssembly

a7de55a

BindUsingPEImage

9c77d7e

Guard behind feature switch

116f7bf

dotnet-issue-labeler bot added the area-AssemblyLoader-coreclr label Aug 31, 2023

ghost added the community-contribution Indicates that the PR has been added by a community member label Aug 31, 2023

jkotas reviewed Aug 31, 2023

View reviewed changes

src/coreclr/System.Private.CoreLib/src/Internal/Runtime/Binder/AssemblyName.cs Outdated Show resolved Hide resolved

jkotas reviewed Aug 31, 2023

View reviewed changes

src/coreclr/System.Private.CoreLib/src/Internal/Runtime/Binder/ApplicationContext.cs Outdated Show resolved Hide resolved

jkotas reviewed Aug 31, 2023

View reviewed changes

This was referenced Aug 31, 2023

Microsoft.NET.HostModel.Tests failing with "No space left on device" #91039

Closed

JSMarshalAsAttribute failing error tests #91410

Closed

Reuse IsPathFullyQualified

9fff7e2

agocke marked this pull request as draft February 27, 2024 18:55

huoyaoyuan added 2 commits March 10, 2024 20:58

Merge branch 'main'

77ff24e

Use separated helper to handle nullable array

146c0f7

ANahr reviewed Mar 11, 2024

View reviewed changes

Don't use InvariantCulture

8afeb7d

dotnet-policy-service bot closed this Apr 12, 2024

huoyaoyuan reopened this Apr 12, 2024

dotnet-policy-service bot closed this May 12, 2024

huoyaoyuan reopened this May 12, 2024

huoyaoyuan added 3 commits May 13, 2024 17:28

Merge branch 'main'

8ee2a01

Update MDImport to latest pattern

dfeaf1b

Use class to reduce a generic instantiation

22a25ba

This was referenced May 13, 2024

LibraryImportGenerator.Unit.Tests crashing on linux-x64 mono interpreter #100800

Open

[mono][interpreter] Mono interpreter is crashing during System.Data.Odbc.Tests (linux-x64 Release Mono_Interpreter_LibrariesTests) #101370

Open

Remove more generic instantiation on Dictionary

dbb3a20

huoyaoyuan commented May 13, 2024

View reviewed changes

build-analysis bot mentioned this pull request May 13, 2024

System.Net.Tests.HttpWebRequestTest_Async.GetResponseAsync_ParametersAreNotCachable_CreateNewClient test fails #100912

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Create a managed implementation of assembly binder #91400

Create a managed implementation of assembly binder #91400

huoyaoyuan commented Aug 31, 2023

ghost commented Aug 31, 2023

jkotas commented Aug 31, 2023

jkotas Aug 31, 2023

huoyaoyuan Aug 31, 2023

vitek-karas commented Aug 31, 2023

huoyaoyuan commented Aug 31, 2023

lambdageek commented Jan 22, 2024

jkotas commented Jan 22, 2024

agocke commented Feb 27, 2024

huoyaoyuan commented Mar 10, 2024 •

edited

ANahr left a comment

dotnet-policy-service bot commented Apr 12, 2024

huoyaoyuan commented Apr 12, 2024

dotnet-policy-service bot commented May 12, 2024

huoyaoyuan commented May 13, 2024

huoyaoyuan May 13, 2024 •

edited

Create a managed implementation of assembly binder #91400

Are you sure you want to change the base?

Create a managed implementation of assembly binder #91400

Conversation

huoyaoyuan commented Aug 31, 2023

ghost commented Aug 31, 2023

jkotas commented Aug 31, 2023

jkotas Aug 31, 2023

Choose a reason for hiding this comment

huoyaoyuan Aug 31, 2023

Choose a reason for hiding this comment

vitek-karas commented Aug 31, 2023

huoyaoyuan commented Aug 31, 2023

lambdageek commented Jan 22, 2024

jkotas commented Jan 22, 2024

agocke commented Feb 27, 2024

huoyaoyuan commented Mar 10, 2024 • edited

ANahr left a comment

Choose a reason for hiding this comment

dotnet-policy-service bot commented Apr 12, 2024

huoyaoyuan commented Apr 12, 2024

dotnet-policy-service bot commented May 12, 2024

huoyaoyuan commented May 13, 2024

Status report:

Working Set:

Start time:

File size:

huoyaoyuan May 13, 2024 • edited

Choose a reason for hiding this comment

huoyaoyuan commented Mar 10, 2024 •

edited

huoyaoyuan May 13, 2024 •

edited