Inline nullable allocators #122167

EgorBo · 2025-12-03T23:56:49Z

Today we never inline & stack-allocate (via the escape analysis) boxed Nullable<>, let's see if we can fix that.

object Test(long? n)
{
    return n;
}

Main:

; Method My:Test(System.Nullable`1[long]):System.Object:this (FullOpts)
       sub      rsp, 40
       mov      rcx, 0x7FFBCE1680C0      ; System.Nullable`1[long]
       call     [CORINFO_HELP_BOX_NULLABLE]
       nop      
       add      rsp, 40
       ret      
; Total bytes of code: 26

PR:

; Method My:Test(System.Nullable`1[long]):System.Object:this (FullOpts)
       push     rbx
       sub      rsp, 32
       mov      rbx, rdx
       cmp      byte  ptr [rbx], 0
       je       SHORT G_M7100_IG04
       mov      rcx, 0x7FFBDFBC08B0      ; System.Int64
       call     CORINFO_HELP_NEWSFAST
       mov      rcx, qword ptr [rbx+0x08]
       mov      qword ptr [rax+0x08], rcx
       jmp      SHORT G_M7100_IG05
G_M7100_IG04:
       xor      rax, rax
G_M7100_IG05:
       add      rsp, 32
       pop      rbx
       ret      
; Total bytes of code: 46

Another example (from #114497 (comment)) cc @pentp

public static string? Format<T>(T value)
{
    if (value is IFormattable formattable)
        return formattable.ToString(null, null);
    return null;
}

diff.
Doesn't allocate anymore to box that nullable.

Benchmarks: EgorBot/runtime-utils#563

src/coreclr/jit/compiler.h

EgorBo · 2025-12-04T22:57:08Z

Enabled structs too (basically, all types, even shared), so now #114497 (comment) is properly handed.

EgorBo · 2025-12-04T23:35:55Z

@EgorBot -amd -arm

using BenchmarkDotNet.Attributes;

[MemoryDiagnoser]
[DisassemblyDiagnoser]
public class Bench
{
    private Nullable<bool> _null = null;
    private Nullable<bool> _nonnull = true;

    [Benchmark] public object BoxNull() => _null; 

    [Benchmark] public object BoxNonNull() => _nonnull;

    // https://github.com/dotnet/runtime/issues/50915
    private S? _ns = (S?)default(S);
    [Benchmark] public int Issue50915() => CallM(_ns);
    static int CallM<T>(T t)
    {
        if (t is IMyInterface)
            return ((IMyInterface)t).M();
        return 0;
    }
}

interface IMyInterface
{
    int M();
}

struct S : IMyInterface
{
    public int M() => 42;
}

Copilot

Pull request overview

This PR optimizes nullable boxing operations by introducing early inline expansion instead of using helper calls. The optimization enables escape analysis to stack-allocate boxed nullable values when they don't escape the method, eliminating heap allocations in hot paths. The implementation adds a new early QMARK expansion phase that runs after import but before other optimizations, allowing subsequent passes to optimize the expanded control flow.

Key changes:

Inline expansion of nullable box operations in hot, optimized code paths using conditional allocation
New early QMARK expansion phase to enable object stack allocation optimizations
COMMA node splitting during QMARK expansion to improve optimization opportunities

Reviewed changes

Copilot reviewed 5 out of 5 changed files in this pull request and generated 2 comments.

Show a summary per file

File	Description
src/coreclr/jit/importer.cpp	Adds inline expansion of nullable boxing with conditional allocation based on hasValue field
src/coreclr/jit/morph.cpp	Enhances QMARK expansion to support early phase, adds COMMA splitting, changes return type to PhaseStatus
src/coreclr/jit/compiler.cpp	Adds early QMARK expansion phase after import and updates late expansion call signature
src/coreclr/jit/compiler.h	Adds OMF_HAS_EARLY_QMARKS flag and updates fgExpandQmarkNodes signature
src/coreclr/jit/compphases.h	Defines new PHASE_EARLY_QMARK_EXPANSION phase

src/coreclr/jit/morph.cpp

src/coreclr/jit/importer.cpp

EgorBo · 2025-12-05T13:44:39Z

@MihuBot

src/coreclr/jit/importer.cpp

AndyAyersMS · 2025-12-05T15:39:55Z

src/coreclr/jit/importer.cpp

+    // E.g. if we are boxing Nullable<System.Int32>, typeToBox is System.Int32.
+    CORINFO_RESOLVED_TOKEN tk = *pResolvedToken;
+    tk.hClass                 = typeToBox;
+    tk.tokenType              = CORINFO_TOKENKIND_Casting;


We had some offline discussion about this... would it make sense to introduce new token kind here rather than overloading CORINFO_TOKENKIND_Casting?

@jkotas @MichalStrehovsky do you an opinion here? Context:
We have a token representing, say box Nullable<int>, we need to emit an allocator for boxed int (not Nullable<int> since we box the underlying type only). I have to call our work horse routine that accepts ResolvedToken which end up calling embedGenericHandle. so I ended up using this hack

CORINFO_RESOLVED_TOKEN tk = *pResolvedToken; tk.hClass = typeToBox; tk.tokenType = CORINFO_TOKENKIND_Casting;

where I create a copy of that token and change its type from Boxing to Casting so I hit this path:

runtime/src/coreclr/tools/aot/ILCompiler.Compiler/Compiler/Compilation.cs

Lines 338 to 340 in 495fca5

if (type.IsNullable)

type = type.Instantiation[0];

return NecessaryTypeSymbolIfPossible(type);

(because ResolveToken.hClass is ignored)

is it acceptable? it seems to work for all three runtimes (coreclr, r2r and naot)

it seems to work for all three runtime

If it actually worked, you would not need to disable it for shared generic types. eeIsSharedInst check above is part of the hack.

This hack may cause problems with NAOT tracking of necessary vs. maximally constructed handles. I am not sure.

it seems to work for all three runtime

If it actually worked, you would not need to disable it for shared generic types. eeIsSharedInst check above is part of the hack.

This hack may cause problems with NAOT tracking of necessary vs. maximally constructed handles. I am not sure.

Runtime lookups are problematic in my case even without it - the code that emits them doesn't support emitting them inside QMARKs (so they're constructed only inside the corresponding branch, it can be fixed, though). For non-shared types I expect NAOT's dependency analsys to track both Nullable<T> and just T when ILScan hits box opcode, but I'm no expert in NAOT to say for sure

I don't know how to properly check the dependency analysis but this seems to be compiling just fine:

class MyProgram { static void Main() => Box(null); [MethodImpl(MethodImplOptions.NoInlining)] static object Box(MyStruct1? ms) => ms; } public struct MyStruct1 {}

so MyStruct1 is not used anywhere explicitly, but ILC gives a proper type for the allocator:

G_M42651_IG01: ;; offset=0x0000 sub rsp, 40 mov dword ptr [rsp+0x30], ecx ;; size=8 bbWeight=1 PerfScore 1.25 G_M42651_IG02: ;; offset=0x0008 test cl, cl je SHORT G_M42651_IG04 ;; size=4 bbWeight=1 PerfScore 1.25 G_M42651_IG03: ;; offset=0x000C lea rcx, [(reloc 0x4000000000420a08)] ; MyStruct1 call CORINFO_HELP_NEWSFAST mov cl, byte ptr [rsp+0x31] mov byte ptr [rax+0x08], cl jmp SHORT G_M42651_IG05 ;; size=21 bbWeight=0.25 PerfScore 1.38 G_M42651_IG04: ;; offset=0x0021 xor rax, rax ;; size=2 bbWeight=0.25 PerfScore 0.06 G_M42651_IG05: ;; offset=0x0023 add rsp, 40 ret ;; size=5 bbWeight=1 PerfScore 1.25 ; Total bytes of code 40

It will probably work today wrt dependency analysis because we already have to be very careful not to expose RyuJIT to necessary/constructed MethodTables. Doesn't mean we're not going to curse this hack two years down the road (if/when we start using RyuJIT as a scanner and this will end up boxing a MethodTables that doesn't have a GCDesc at runtime because MethodTables for casting don't get them). It is a hack.

@jkotas @MichalStrehovsky @AndyAyersMS Ok, I ended up simplifying the implementation without this hack, with just non-shared types and no R2R (cause getReadyToRunHelper JIT-EE API also expects a resolvedtoken and R2R doesn't support stack-allocation I believe which makes it less beneficial to expand for). NAOT is supported.

I still expect NAOT to keep the underlying Nullable<> type alive. I believe it's already required even without my PR - the current importation logic calls getTypeForBox and sets it on the resuling local so then various cast related optimization can work with it.

Does it sound good?

R2R supports stack allocation of objects. It does not (yet) support stack allocation for arrays.

R2R supports stack allocation of objects. It does not (yet) support stack allocation for arrays.

Good to know. Still, it requires JIT-EE work to enable so I left it for the future.

src/coreclr/jit/morph.cpp

EgorBo · 2025-12-06T01:53:00Z

The superpmi diffs are obviously useless due to missing contexts. The normal jit-diff (e.g. this) is also a mess due to the way PMI works - we take all generics and try to instantiate them with one of the 7 predefined types (see here) and int? in that list which meant tons of boxings on top of nullable where nullable is never really used. If I remove from that list and only leave real usage, the diff become much more smaller and reasonable (it's expected for it to be a size increase obviously).

Inline nullable allocators

6ce0a3b

github-actions bot added the area-CodeGen-coreclr CLR JIT compiler in src/coreclr/src/jit and related components such as SuperPMI label Dec 3, 2025

dotnet-policy-service bot assigned EgorBo Dec 3, 2025

MihuBot mentioned this pull request Dec 4, 2025

[JitDiff X64] [EgorBo] Inline nullable allocators MihuBot/runtime-utils#1665

Open

This comment was marked as outdated.

Sign in to view

pentp reviewed Dec 4, 2025

View reviewed changes

src/coreclr/jit/compiler.h Outdated Show resolved Hide resolved

EgorBo added 2 commits December 4, 2025 02:59

Update importer.cpp

8802daa

test

8292f7b

MihuBot mentioned this pull request Dec 4, 2025

[JitDiff X64] [EgorBo] Inline nullable allocators MihuBot/runtime-utils#1670

Open

test

6fb8a3e

MihuBot mentioned this pull request Dec 4, 2025

[JitDiff X64] [EgorBo] Inline nullable allocators MihuBot/runtime-utils#1671

Open

EgorBo added 3 commits December 4, 2025 21:23

fix naot

147653d

enable for all types

17d3f60

add comments

c9bd019

MihuBot mentioned this pull request Dec 4, 2025

[JitDiff X64] [EgorBo] Inline nullable allocators MihuBot/runtime-utils#1672

Open

EgorBot mentioned this pull request Dec 4, 2025

Benchmarks for #122167 (EgorBo) EgorBot/runtime-utils#563

Open

EgorBo added 2 commits December 5, 2025 00:47

disable shared generics

6384a5e

expand qmarks twice

926ac20

MihuBot mentioned this pull request Dec 5, 2025

[JitDiff X64] [EgorBo] Inline nullable allocators MihuBot/runtime-utils#1673

Open

build-analysis bot mentioned this pull request Dec 5, 2025

Vector saturate tests failing on arm32 #122185

Closed

EgorBo marked this pull request as ready for review December 5, 2025 09:10

Copilot AI review requested due to automatic review settings December 5, 2025 09:10

Copilot started reviewing on behalf of EgorBo December 5, 2025 09:11 View session

Copilot finished reviewing on behalf of EgorBo December 5, 2025 09:14

Copilot AI reviewed Dec 5, 2025

View reviewed changes

src/coreclr/jit/morph.cpp Show resolved Hide resolved

src/coreclr/jit/importer.cpp Show resolved Hide resolved

This was referenced Dec 5, 2025

The Operation will be canceled. The next steps may not contain expected logs. dotnet/dnceng#3008

Open

[android-arm64] The Operation will be canceled. The next steps may not contain expected logs. dotnet/dnceng#6408

Open

Clean up

44e2f7a

EgorBo added 2 commits December 5, 2025 13:27

clean up

179458e

clean up

25d856a

MihuBot mentioned this pull request Dec 5, 2025

[JitDiff X64] [EgorBo] Inline nullable allocators MihuBot/runtime-utils#1676

Open

AndyAyersMS reviewed Dec 5, 2025

View reviewed changes

src/coreclr/jit/importer.cpp Show resolved Hide resolved

AndyAyersMS reviewed Dec 5, 2025

View reviewed changes

build-analysis bot mentioned this pull request Dec 5, 2025

System.Security.Cryptography.X509Certificates.Tests: Assertion failed: pkey != NULL #116307

Open

feedback

ab99de1

EgorBo force-pushed the inline-nullable-allocators branch from c1431db to ab99de1 Compare December 5, 2025 21:27

Simplify the logic for now

69f6243

EgorBo force-pushed the inline-nullable-allocators branch from 23f505f to 69f6243 Compare December 6, 2025 01:30

EgorBo added 2 commits December 6, 2025 02:38

clean up

426c227

Merge branch 'main' into inline-nullable-allocators

91708dd

EgorBo requested a review from AndyAyersMS December 6, 2025 01:54

This was referenced Dec 6, 2025

"We stopped hearing from agent Azure Pipelines 32. Verify the agent machine is running and has a healthy network connection" dotnet/dnceng#1886

Open

Intermittent build failure in AfterSourceBuild: "Could not write state file" #76488

Open

	if (type.IsNullable)
	type = type.Instantiation[0];
	return NecessaryTypeSymbolIfPossible(type);

Inline nullable allocators #122167

Are you sure you want to change the base?

Inline nullable allocators #122167

Conversation

EgorBo commented Dec 3, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

This comment was marked as outdated.

Uh oh!

EgorBo commented Dec 4, 2025

Uh oh!

EgorBo commented Dec 4, 2025

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Reviewed changes

Uh oh!

Uh oh!

Uh oh!

EgorBo commented Dec 5, 2025

Uh oh!

Uh oh!

AndyAyersMS Dec 5, 2025

Choose a reason for hiding this comment

Uh oh!

EgorBo Dec 5, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

jkotas Dec 5, 2025

Choose a reason for hiding this comment

Uh oh!

EgorBo Dec 5, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

EgorBo Dec 5, 2025

Choose a reason for hiding this comment

Uh oh!

MichalStrehovsky Dec 5, 2025

Choose a reason for hiding this comment

Uh oh!

EgorBo Dec 6, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

AndyAyersMS Dec 6, 2025

Choose a reason for hiding this comment

Uh oh!

EgorBo Dec 6, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

EgorBo commented Dec 6, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

EgorBo commented Dec 3, 2025 •

edited

Loading

EgorBo Dec 5, 2025 •

edited

Loading

EgorBo Dec 5, 2025 •

edited

Loading

EgorBo Dec 6, 2025 •

edited

Loading

EgorBo commented Dec 6, 2025 •

edited

Loading