Improve System.Collections.BitArray #115069

tfenise · 2025-04-25T23:54:08Z

Use xplat SIMD intrinsics in BitArray.CopyTo. Closes #116079

Other minor improvements.

dotnet-policy-service · 2025-04-25T23:54:42Z

Tagging subscribers to this area: @dotnet/area-system-collections
See info in area-owners.md if you want to be subscribed.

src/libraries/System.Collections/src/System/Collections/BitArray.cs

…adability

watfordsuzy · 2025-05-02T18:20:22Z

src/libraries/System.Collections/src/System/Collections/BitArray.cs

                    Vector256<byte> isFalse = Vector256.Equals(vector, Vector256<byte>.Zero);

-                    uint result = isFalse.ExtractMostSignificantBits();
-                    m_array[i / 32u] = (int)(~result);
+                    m_array[i / 32] = (int)~(isFalse.ExtractMostSignificantBits());
                }
            }
            else if (Vector128.IsHardwareAccelerated)


All the other similar code dropped the else for this branch.

Here, the loop conditions for Vector256 and Vector128 paths are basically same (i <= values.Length - 32), so there is no point executing the Vector128 path if the Vector256 path is already executed.

tannergooding · 2025-05-30T00:18:01Z

src/libraries/System.Collections/src/System/Collections/BitArray.cs

+                    case 3:
+                        last = byteSpan[2] << 16;
+                        goto case 2;
+                    // fall through


I know these are being caried over from the previous, but the comments are superfluous. The goto case # makes it explicit where the control flow goes and so it isn't "technically" fallthrough and more-so just adds noise.

If I am reading it right, this block (lines 87 - 118) can be

bytes.CopyTo(MemoryMarshal.AsBytes(m_array.AsSpan())); if (!BitConverter.IsLittleEndian) { BinaryPrimitives.ReverseEndianness<int>(m_array, m_array); }

True, I think that 'last 4 bytes' case doesn't need to be handled explicitly.

BinaryPrimitives.ReverseEndianness<int>(m_array, m_array);

minus the <int> part as ReverseEndianness is a non-generic method.

bytes.CopyTo(MemoryMarshal.AsBytes(m_array.AsSpan())); if (!BitConverter.IsLittleEndian) { BinaryPrimitives.ReverseEndianness(m_array, m_array); }

tannergooding · 2025-05-30T00:20:57Z

src/libraries/System.Collections/src/System/Collections/BitArray.cs

-                    ulong result = isFalse.ExtractMostSignificantBits();
-                    m_array[i / 32u] = (int)(~result & 0x00000000FFFFFFFF);
-                    m_array[(i / 32u) + 1] = (int)((~result >> 32) & 0x00000000FFFFFFFF);
+                    ulong result = ~(isFalse.ExtractMostSignificantBits());


This is less efficient, you should rather have this be ulong result = (~Vector512.IsZero(vector)).ExtractMostSignificantBits()

It's almost always better to do the negations and similar on the comparison or vector directly, as it allows it to become a direct cmpneq instead of staying a cmpeq followed by not or similar

There is no pcmpneq, only pcmpeq.

AVX512 does have vptestmb or vpcmpub, but neither (~Vector512.IsZero(value)).ExtractMostSignificantBits() nor ~(Vector512.IsZero(value).ExtractMostSignificantBits()) are optimised to use them (latest daily build).

(~Vector512.IsZero(value)).ExtractMostSignificantBits():

vmovups zmm0, zmmword ptr [rcx] vptestnmb k1, zmm0, zmm0 vpmovm2b zmm0, k1 vpternlogd zmm0, zmm0, zmm0, 85 vpmovb2m k1, zmm0 kmovq rax, k1 vzeroupper ret

~(Vector512.IsZero(value).ExtractMostSignificantBits()):

vmovups zmm0, zmmword ptr [rcx] vptestnmb k1, zmm0, zmm0 kmovq rax, k1 not rax vzeroupper ret

Currently, ~(Vector512.IsZero(value).ExtractMostSignificantBits()) looks better.

It's the same with Vector512.Equals(value, Vector512<byte>.Zero) in the place of Vector512.IsZero(value).

This is xplat code and we strictly care more about the readability, legibility, and maintainability of it then micro tuning the exact instructions.

Longer term this code will likely be shared with the vector128/256 paths using generics and the ISimdVector interface, so that we only need to maintain 1 rather than 3 paths. We may even centralize the looping logic in a similar way as is done for TensorPrimitives

As such, the pattern I indicated is the best long term pattern across all potential hardware. If there is a case of suboptimal codegen then we should separately log an issue to track that and get it improved because we want developers to use and follow the more idiomatic patterns and use the relevant dedicated APIs where possible

Notably vpcmpneq does exist and is an alias for vpcmp with the correct immediate control byte (I believe 4). The same “should” be an optimization we do over vptest, it just looks to be missing right now

If we don't go to the details of the exact instructions, the intuitive reasoning should rather be that inverting a 64-bit ulong looks less expensive than inverting a 512-bit Vector512<byte>, so it prefers ~(Vector512.IsZero(value).ExtractMostSignificantBits()).

use the relevant dedicated APIs where possible

What dedicated API? There is no VectorXXX.NotEquals() or VectorXXX.IsNonZero().

What dedicated API? There is no VectorXXX.NotEquals() or VectorXXX.IsNonZero().

Using ~ directly on the comparison/query, much as you would use ! on something returning bool

the intuitive reasoning should rather be that inverting a 64-bit ulong looks less expensive than inverting a 512-bit Vector512, so it prefers ~(Vector512.IsZero(value).ExtractMostSignificantBits()).

That isn't how vectors work at all nor how someone using vectors should expect them to work. Vectors are "primitives" that are expected to be able to process Count items in the same general time as 1 scalar item. Any kind of reduction operation, such as Sum or ExtractMostSignificantBits which has to create a value combined from the contained elements is expected to be more expensive, and in many cases it will be.

Improve BitArray.cs

b008b4f

ghost added the area-System.Collections label Apr 25, 2025

dotnet-policy-service bot added the community-contribution label Apr 25, 2025

tfenise mentioned this pull request Apr 26, 2025

Vector512.Shuffle does not produce optimal codegen in some case #115078

Open

EgorBo reviewed Apr 26, 2025

View reviewed changes

src/libraries/System.Collections/src/System/Collections/BitArray.cs Outdated Show resolved Hide resolved

EgorBo reviewed Apr 26, 2025

View reviewed changes

src/libraries/System.Collections/src/System/Collections/BitArray.cs Show resolved Hide resolved

EgorBo reviewed Apr 26, 2025

View reviewed changes

src/libraries/System.Collections/src/System/Collections/BitArray.cs Outdated Show resolved Hide resolved

EgorBo reviewed Apr 26, 2025

View reviewed changes

src/libraries/System.Collections/src/System/Collections/BitArray.cs Outdated Show resolved Hide resolved

Improve BitArray.cs

857dc49

MichalPetryka reviewed Apr 26, 2025

View reviewed changes

src/libraries/System.Collections/src/System/Collections/BitArray.cs Show resolved Hide resolved

tfenise added 2 commits April 28, 2025 14:58

Improve BitArray.cs

71d3a55

Improve BitArray.cs - Replace unnecessary uint with int to improve re…

56c61a3

…adability

This was referenced Apr 29, 2025

System.Net.Quic.Tests.MsQuicTests.WriteTests failed with "System.Net.Quic.QuicException : The connection timed out from inactivity." #105177

Open

OSX failure on StringTests.StartsWithNoMatch_StringComparison #112195

Open

Improve BitArray.cs

b1ae205

tfenise marked this pull request as ready for review May 1, 2025 17:00

watfordsuzy reviewed May 2, 2025

View reviewed changes

Merge branch 'main' into bitarray

31cb965

build-analysis bot mentioned this pull request May 29, 2025

System.Net.Http.Functional.Tests timeouts #115683

Open

tannergooding reviewed May 30, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Improve System.Collections.BitArray #115069

Improve System.Collections.BitArray #115069

Uh oh!

tfenise commented Apr 25, 2025 •

edited

Loading

Uh oh!

dotnet-policy-service bot commented Apr 25, 2025

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

watfordsuzy May 2, 2025

Uh oh!

tfenise May 2, 2025

Uh oh!

tannergooding May 30, 2025

Uh oh!

Rob-Hague May 30, 2025

Uh oh!

am11 May 30, 2025

Uh oh!

tannergooding May 30, 2025

Uh oh!

tannergooding May 30, 2025

Uh oh!

tfenise May 30, 2025 •

edited

Loading

Uh oh!

tannergooding May 30, 2025

Uh oh!

tfenise May 30, 2025

Uh oh!

tannergooding May 31, 2025

Uh oh!

Uh oh!

Improve System.Collections.BitArray #115069

Are you sure you want to change the base?

Improve System.Collections.BitArray #115069

Uh oh!

Conversation

tfenise commented Apr 25, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

dotnet-policy-service bot commented Apr 25, 2025

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

tfenise May 30, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

tfenise commented Apr 25, 2025 •

edited

Loading

tfenise May 30, 2025 •

edited

Loading