Follow-up: memory-based activation shedding #9577

DeagleGross · 2025-06-18T14:55:47Z

Adding logs and refactoring configuration of #9532.
also testing whether the memory pressure works against CI agents here.

Microsoft Reviewers: Open in CodeFlow

src/Orleans.Runtime/Configuration/Options/GrainCollectionOptions.cs

…ons to GrainCollectionOptions

ReubenBond · 2025-06-23T16:27:11Z

src/Orleans.Runtime/Catalog/ActivationCollector.cs

@@ -242,7 +240,7 @@ public List<ICollectibleGrainContext> ScanStale()
                        if (!activation.IsValid)
                        {
                            // This is not an error scenario because the activation may have become invalid between the time
-                            // we captured a snapshot in 'DequeueQuantum' and now. We are not be able to observe such changes.
+                            // we captured a snapshot in 'Dequeue_grainCollectionOptions.CollectionQuantum' and now. We are not be able to observe such changes.


This looks like a find-and-replace error

src/Orleans.Runtime/Catalog/ActivationCollector.cs

src/Orleans.Core.Abstractions/Statistics/EnvironmentStatisticExtensions.cs

src/Orleans.Runtime/Catalog/ActivationCollector.cs

ReubenBond · 2025-06-23T17:03:50Z

test/NonSilo.Tests/SiloBuilderTests.cs

+            public EnvironmentStatistics GetEnvironmentStatistics()
+            {
+                EnvironmentStatistics stats = new();
+                if (!stats.IsValid())


Wont this always be false?

I left this code intentionally in this way to let people know that we try with stats, and we fallback to realStatisticsProvider in case something is wrong with the stats.

At this point we should probably just remove all "fake" stat providers - because why do we need them if we are ending up using the real one?

I mean next time this code is touched, all people will do is fake the stats somehow, and they will be valid, but faked. In case something went wrong stats are replaced

The reason fake stats are used is for testing - i.e, to present some condition like high memory or CPU usage and verify that the behavior is expected.

But if a branch can never be taken, leaving it in the code is misleading. If I read some code and there is a branch, I expect that sometimes the branch will be taken, and sometimes it will not, depending on some condition. In this case, new EnvironmentStatistics().IsValid() is always false, so the branch will always be taken, and therefore the whole method may as well just return the real stats to begin with rather than misdirect the reader.

It looks like we can remove the entire type because there is an IEnvironmentStatisticsProvider available by default in Orleans now. That wasn't true when this test was written but the test wasn't updated since.

The test can be changed to:

/// <summary> /// Ensures <see cref="LoadSheddingValidator"/> fails when LoadSheddingLimit greater than 100. /// </summary> [Fact] public async Task SiloBuilder_LoadSheddingValidatorAbove100ShouldFail() { await Assert.ThrowsAsync<OrleansConfigurationException>(async () => { await new HostBuilder().UseOrleans((ctx, siloBuilder) => { siloBuilder .UseLocalhostClustering() .Configure<ClusterOptions>(options => options.ClusterId = "someClusterId") .Configure<EndpointOptions>(options => options.AdvertisedIPAddress = IPAddress.Loopback) .ConfigureServices(services => services.AddSingleton<IMembershipTable, NoOpMembershipTable>()) .Configure<LoadSheddingOptions>(options => { options.LoadSheddingEnabled = true; options.CpuThreshold = 101; }); }).RunConsoleAsync(); }); }

removed completely noop stats providers, now only meaningful set of stats or just taking the real ones.

DeagleGross added 2 commits June 18, 2025 16:54

follow up

0a0858b

detailed message name

54b5590

DeagleGross self-assigned this Jun 18, 2025

no env

dd66bbc

ReubenBond reviewed Jun 18, 2025

View reviewed changes

src/Orleans.Runtime/Configuration/Options/GrainCollectionOptions.cs Outdated Show resolved Hide resolved

DeagleGross added 5 commits June 18, 2025 17:31

info log - it will strike only when high memory pressure

f208ff0

log bytes to see actual values

bbf9189

only run when gen2 GC fired at least once

4311a04

<= 0

47e4325

more logs / rollout parameters from MemoryPressureGrainCollectionOpti…

7553c4a

…ons to GrainCollectionOptions

DeagleGross mentioned this pull request Jun 18, 2025

Memory pressure based activation shedding #9532

Merged

DeagleGross added 4 commits June 18, 2025 22:26

use non-fake environmnet stats if invalid

4915ed0

try different calculation

3724701

use same limit elsewhere

2723931

reformat tests

930588e

ReubenBond reviewed Jun 23, 2025

View reviewed changes

src/Orleans.Runtime/Catalog/ActivationCollector.cs Outdated Show resolved Hide resolved

ReubenBond reviewed Jun 23, 2025

View reviewed changes

src/Orleans.Core.Abstractions/Statistics/EnvironmentStatisticExtensions.cs Outdated Show resolved Hide resolved

ReubenBond reviewed Jun 23, 2025

View reviewed changes

src/Orleans.Runtime/Catalog/ActivationCollector.cs Show resolved Hide resolved

ReubenBond reviewed Jun 23, 2025

View reviewed changes

DeagleGross added 2 commits June 23, 2025 19:18

wip

6d002e9

get rid of fake env providers

f6983ca

ReubenBond changed the title ~~follow-up: memory shedding~~ Follow-up: memory-based activation shedding Jun 23, 2025

ReubenBond approved these changes Jun 23, 2025

View reviewed changes

ReubenBond merged commit 0b3e926 into main Jun 23, 2025
32 of 35 checks passed

ReubenBond deleted the dmkorolev/memory-shedding-2 branch June 23, 2025 18:54

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Follow-up: memory-based activation shedding #9577

Follow-up: memory-based activation shedding #9577

Uh oh!

DeagleGross commented Jun 18, 2025 •

edited by dotnet-policy-service bot

Loading

Uh oh!

Uh oh!

ReubenBond Jun 23, 2025

Uh oh!

DeagleGross Jun 23, 2025

Uh oh!

Uh oh!

Uh oh!

Uh oh!

ReubenBond Jun 23, 2025

Uh oh!

DeagleGross Jun 23, 2025

Uh oh!

DeagleGross Jun 23, 2025

Uh oh!

ReubenBond Jun 23, 2025

Uh oh!

ReubenBond Jun 23, 2025

Uh oh!

ReubenBond Jun 23, 2025

Uh oh!

DeagleGross Jun 23, 2025

Uh oh!

Uh oh!

Uh oh!

Follow-up: memory-based activation shedding #9577

Follow-up: memory-based activation shedding #9577

Uh oh!

Conversation

DeagleGross commented Jun 18, 2025 • edited by dotnet-policy-service bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Microsoft Reviewers: Open in CodeFlow

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

DeagleGross commented Jun 18, 2025 •

edited by dotnet-policy-service bot

Loading