Skip to content

Add WinRT.Interop.dll public API spec #2002

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Open
wants to merge 2 commits into
base: fhl/cswinrt-3.0
Choose a base branch
from
Open
Changes from all commits
Commits
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
116 changes: 116 additions & 0 deletions docs/winrt-interop-dll-spec.md
Original file line number Diff line number Diff line change
@@ -0,0 +1,116 @@
# `WinRT.Interop.dll` public API specification

## Overview

The `WinRT.Interop.dll` is a dynamically generated assembly providing additional marshalling code for applications and libraries that require interop with WinRT APIs. This assembly is generated on the fly after build, by an MSBuild task bundled with CsWinRT. This allows the interop .dll to leverage from global-program-view info, as it can see all types in the entire application domain. This enables several performance optimizations (eg. all vtables can be pre-initialized), security features (all vtables will be in readonly data segments in the PE file), and usability improvements (no need to mark types as being marshalled, things will "just work").
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Will there be a separate spec that describes the task and the conditions under which the task runs?


This document provides the specification for the public API of this interop .dll, allowing other projects to rely on these generated members being present and having the detailed format and signature. All generated code not documented here is considered an implementation detail, which can (and will) change at any time, without following semantic versioning.

> [!NOTE]
> The interop .dll cannot be referenced by any other assembly (as it is produced at the very end of the build process), so all upstream assemblies that need to invoke APIs from it must do so by using [`[UnsafeAccessor]`](https://learn.microsoft.com/dotnet/api/system.runtime.compilerservices.unsafeaccessorattribute) and `[UnsafeAccessorType]`. All generated projections and code within `WinRT.Runtime.dll` will use this technique to access all of these interop APIs. User code should never try to or need to do this manually: all of this code exists solely to support the WinRT marshalling infrastructure behind the scenes.

## Name mangling scheme

The name mangling scheme for interop types is designed to ensure that generated type names are unique, compact, and descriptive. It uses a combination of assembly names, namespaces, and type names to construct the final mangled name. The scheme handles primitive types, user-defined types, and generic types, including nested generics.

The **mangled namespace** for a given type is defined as follows:

1. **No namespace**: if the input type has no namespace, the generated namespace defaults to `ABI`.
2. **Existing namespace**: if the input type has a namespace, the generated namespace prepends `ABI.` to the original namespace.

The **mangled type name** for a given type is defined as follows:

1. **Primitive types**: well-known primitive types (e.g., `int`, `string`) are mapped to their corresponding identifiers.
2. **User-defined types**: the type name is prefixed with the assembly name (or a compact identifier for well-known assemblies) within angle brackets (i.e. `<>`), and suffixed with the type name. The namespace is not included, as it matches the containing namespace for the generated type (without the `ABI[.]` prefix).
3. **Generic types**: their type arguments are enclosed in angle brackets, right after the type name. Nested generics are recursively processed, and type arguments are separated by a pipe (i.e. `|`). Each type argument also has its name prefixed by the containing namespace.

All `.` characters in the final mangled name are replaced with `-` characters.

> [!NOTE]
> This is not strictly required, as the `.` character is a valid character for an identifier as per ECMA-335. However, using that character can be inconvenient when using reflection APIs to inspect such types, as it makes it not possible to easily distinguish the namespace from the actual type name. So to account for this, we just do this substitution, given the final length of the mangled name remains the same anyway when doing so.

These are the well-known assemblies and their compact identifiers:
- `System.Runtime`: `#corlib`
- `Microsoft.Windows.SDK.NET` or `Microsoft.Windows.UI.Xaml`: `#Windows`
- `WinRT.Runtime`: `#CsWinRT`
- `Microsoft.UI.Xaml.Projection`: `#WinUI2`
- `Microsoft.Graphics.Canvas.Interop`: `#Win2D`

Compact identifiers are prefixed with `#` to distinguish them from user-defined assembly names.

### Examples

**Primitive type**

- Type: `System.Int32`
- Mangled name: `ABI.System.<#corlib>int`

**User-defined type**

- Type: `MyNamespace.MyType` (from assembly `MyAssembly`)
- Mangled name: `ABI.MyNamespace.<MyAssembly>MyType`

**Generic type**

- Type: `System.Collections.Generic.IEnumerable<System.String>`
- Mangled name: ``ABI.System.Collections.Generic.<#corlib>IEnumerable`1<<#corlib>string>"``

**Nested generic type**

- Type: `System.Collections.Generic.ICollection<System.Collections.Generic.KeyValuePairSystem.String, MyNamespace.MyType>` (`MyType` is from assembly `MyAssembly`)
- Mangled name: ``ABI.System.Collections.Generic.<#corlib>ICollection`1<<#corlib>System-Collections-Generic-KeyValuePair`2<<#corlib>string|<MyAssembly>MyNamespace-MyType>>``

### ANTLR4 name mangling rules

Below is the full specification of the name mangling scheme using ANTLR4 syntax:

```antlr
grammar NameMangling;

// Entry point for a mangled name
mangledName : namespace '.' mangledTypeName EOF;

// Namespace rules
namespace : 'ABI' ('.' identifier)*;

// Mangled type name rules
mangledTypeName : primitiveType
| userDefinedType
| genericType;

// Primitive types
primitiveType : 'bool'
| 'char'
| 'sbyte'
| 'byte'
| 'short'
| 'ushort'
| 'int'
| 'uint'
| 'long'
| 'ulong'
| 'float'
| 'double'
| 'string'
| 'object';

// User-defined types
userDefinedType : '<' assemblyName '>' identifier;

// Generic types
genericType : '<' assemblyName '>' identifier '<' typeArgument ( '|' typeArgument )* '>';
typeArgument : primitiveType
| userDefinedType
| genericType;

// Assembly name rules
assemblyName : '#corlib'
| '#Windows'
| '#CsWinRT'
| '#WinUI2'
| '#Win2D'
| identifier;

// Identifier rules
identifier : [a-zA-Z_][a-zA-Z0-9_]*;
```
Loading