Faster optimized frozen dictionary creation (1/n) #87510

adamsitnik · 2023-06-13T18:52:17Z

every strategy needs an array of keys, we can create it up-front and iterate over it rather than the dictionary to get min and max lengths (1-2% gain)
Instead of ensuring that at least 95% of data is good, we stop when we know that at least 5% is bad (13-14% gain)
toggle the direction and re-use the comparer and hashset (3% time gain, 12% allocations reduction)

BenchmarkDotNet=v0.13.2.2052-nightly, OS=Windows 11 (10.0.22621.1702)
AMD Ryzen Threadripper PRO 3945WX 12-Cores, 1 CPU, 24 logical and 12 physical cores
.NET SDK=8.0.100-preview.4.23259.14
  [Host]     : .NET 8.0.0 (8.0.23.25905), X64 RyuJIT AVX2

LaunchCount=3  MaxIterationCount=20  MemoryRandomization=True

Method	Job	Size	Mean	Ratio	Allocated	Alloc Ratio
FrozenDictionaryOptimized	PR	512	73.236 us	0.82	74.91 KB	0.88
FrozenDictionaryOptimized	main	512	89.431 us	1.00	85.21 KB	1.00

For this particular benchmark, the initial gap between creating the non-optimized (4 us) and optimized (89.4 us) was 85.4 us, now it's 69.2 us.

…iterate over it rather than the dictionary to get min and max lengths (1-2% gain)

…e know that at least 5% is bad (13-14% gain)

…n, 12% allocations reduction)

ghost · 2023-06-13T18:52:28Z

Tagging subscribers to this area: @dotnet/area-system-collections
See info in area-owners.md if you want to be subscribed.

Issue Details

BenchmarkDotNet=v0.13.2.2052-nightly, OS=Windows 11 (10.0.22621.1702)
AMD Ryzen Threadripper PRO 3945WX 12-Cores, 1 CPU, 24 logical and 12 physical cores
.NET SDK=8.0.100-preview.4.23259.14
  [Host]     : .NET 8.0.0 (8.0.23.25905), X64 RyuJIT AVX2

LaunchCount=3  MaxIterationCount=20  MemoryRandomization=True

Method	Job	Size	Mean	Ratio	Allocated	Alloc Ratio
FrozenDictionaryOptimized	PR	512	73.236 us	0.82	74.91 KB	0.88
FrozenDictionaryOptimized	main	512	89.431 us	1.00	85.21 KB	1.00

Author:	adamsitnik
Assignees:	-
Labels:	`area-System.Collections`, `tenet-performance`
Milestone:	-

Tornhoof · 2023-06-13T19:00:07Z

Instead of ensuring that at least 95% of data is good, we stop when we know that at least 5% is bad (13-14% gain)

Do you mean at most 5% is bad? Otherwise you could simply nop everything and 100% is bad, which is at least 5% :)

stephentoub · 2023-06-13T20:40:21Z

Instead of ensuring that at least 95% of data is good, we stop when we know that at least 5% is bad (13-14% gain)

Do you mean at most 5% is bad? Otherwise you could simply nop everything and 100% is bad, which is at least 5% :)

The wording is correct. For a given capacity, once we've seen at least 5% of the entries conflict, we know we can't get to > 95% not conflicting, so we can give up on that capacity and try with the next. We can't give up until we've seen at least 5% conflict, and if we never see 5% conflict, then that capacity is good to go and we can use it.

Tornhoof · 2023-06-13T20:43:27Z

Thank you for the explanation.

stephentoub · 2023-06-13T20:47:28Z

src/libraries/System.Collections.Immutable/src/System/Collections/Frozen/String/KeyAnalyzer.cs

        }

-        private sealed class RightJustifiedCaseInsensitiveSubstringComparer : SubstringComparer
+        private sealed class JustifiedCaseInsensitiveSubstringComparer : SubstringComparer


I'm happy to hear it's helpful, and the fewer the types the better, but I'm a little surprised this makes a positive impact on throughput, since it's adding more work on every comparison. What's the logic for why it makes things faster?

it's adding more work on every comparison

That is true, but it's less work compared to creating a new HashSet<string>(keys.Length)

Is that true just in the example you're profiling or generally? The HashSet will happen once regardless of how many retries are needed.

stephentoub · 2023-06-13T20:55:50Z

src/libraries/System.Collections.Immutable/src/System/Collections/Frozen/String/KeyAnalyzer.cs

        {
            set.Clear();
+
+            // SufficientUniquenessFactor of 95% is good enough.


This constant was deleted.

I was too lazy/tired to turn the const name into three separate words. If you don't mind I am going to do that in next PR.

adamsitnik · 2023-06-14T10:37:31Z

The failure is unrelated: #87505, merging

IDisposable · 2023-06-15T02:02:01Z

@adamsitnik Can you point me to how to generate the benchmarking analysis? I'm looking into tweaking this a tiny bit more...

adamsitnik · 2023-06-15T06:24:03Z

Can you point me to how to generate the benchmarking analysis?

Of course!

This doc describes how to run benchmarks against a local build of dotnet/runtime:

To get a trace file you can use the EtwProfiler

Here is a short introduction to PerfView.

If you are already familiar with PerfView then the generation of the profile diffs is described here

BTW automating this diff is on my TODO list, I've already made the first step and implemented the dotnet/BenchmarkDotNet#2116

I'm looking into tweaking this a tiny bit more...

I am working on it right now, I expect to introduce changes to FrozenHashTable.Create and CalcNumBuckets (just letting you know)

adamsitnik added 3 commits June 13, 2023 19:26

every strategy needs an array of keys, we can create it up-front and …

095762a

…iterate over it rather than the dictionary to get min and max lengths (1-2% gain)

Instead of ensuring that at least 95% of data is good, we stop when w…

98a7a02

…e know that at least 5% is bad (13-14% gain)

toggle the direction and re-use the comparer and hashset (3% time gai…

0b631d9

…n, 12% allocations reduction)

adamsitnik added area-System.Collections tenet-performance Performance related issue labels Jun 13, 2023

adamsitnik requested a review from stephentoub June 13, 2023 18:52

ghost assigned adamsitnik Jun 13, 2023

stephentoub reviewed Jun 13, 2023

View reviewed changes

stephentoub approved these changes Jun 13, 2023

View reviewed changes

build-analysis bot mentioned this pull request Jun 13, 2023

Tracking issue for CI build timeouts #76454

Closed

adamsitnik merged commit 9b6bab4 into dotnet:main Jun 14, 2023
99 of 105 checks passed

This was referenced Jun 15, 2023

Faster optimized frozen dictionary creation (2/n) #87630

Merged

Faster optimized frozen dictionary creation (3/n) #87688

Merged

kotlarmilos mentioned this pull request Jun 22, 2023

[Perf] Linux/x64: 1 Improvement on 6/14/2023 10:51:44 AM dotnet/perf-autofiling-issues#19026

Closed

IDisposable mentioned this pull request Jun 22, 2023

Speed up KeyAnalyser #87590

Closed

adamsitnik mentioned this pull request Jun 23, 2023

Frozen collection construction performance #87964

Closed

kotlarmilos mentioned this pull request Jun 29, 2023

.NET 8 Per-Preview Performance report on WASM, Mono AOT, and Interpreter #84302

Closed

IDisposable mentioned this pull request Jul 7, 2023

Move the IsLeft/IsRight decision out of the loop and use computed substring set #88516

Closed

dotnet locked as resolved and limited conversation to collaborators Jul 15, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Faster optimized frozen dictionary creation (1/n) #87510

Faster optimized frozen dictionary creation (1/n) #87510

adamsitnik commented Jun 13, 2023 •

edited

ghost commented Jun 13, 2023

Tornhoof commented Jun 13, 2023

stephentoub commented Jun 13, 2023

Tornhoof commented Jun 13, 2023

stephentoub Jun 13, 2023

adamsitnik Jun 14, 2023

stephentoub Jun 14, 2023

stephentoub Jun 13, 2023

adamsitnik Jun 14, 2023

adamsitnik commented Jun 14, 2023

IDisposable commented Jun 15, 2023

adamsitnik commented Jun 15, 2023

Faster optimized frozen dictionary creation (1/n) #87510

Faster optimized frozen dictionary creation (1/n) #87510

Conversation

adamsitnik commented Jun 13, 2023 • edited

ghost commented Jun 13, 2023

Tornhoof commented Jun 13, 2023

stephentoub commented Jun 13, 2023

Tornhoof commented Jun 13, 2023

stephentoub Jun 13, 2023

Choose a reason for hiding this comment

adamsitnik Jun 14, 2023

Choose a reason for hiding this comment

stephentoub Jun 14, 2023

Choose a reason for hiding this comment

stephentoub Jun 13, 2023

Choose a reason for hiding this comment

adamsitnik Jun 14, 2023

Choose a reason for hiding this comment

adamsitnik commented Jun 14, 2023

IDisposable commented Jun 15, 2023

adamsitnik commented Jun 15, 2023

adamsitnik commented Jun 13, 2023 •

edited