Use a bitwise and instead of shifts #3092

AreaZR · 2022-09-08T14:36:48Z

This is faster in some instances and is clearer in intent

StephanTLavavej · 2022-09-09T01:26:31Z

Thanks for looking into this!

The optimized codegen is identical: https://godbolt.org/z/K4envEYs5

I believe that the debug codegen is not important here - these are per-algorithm-call, and we generally don't worry about slightly suboptimal instruction sequences (whereas we occasionally worry about many layers of unnecessary function calls, e.g. in invoke()).

So the main question is clarity, which is a judgement call. I believe that the existing pattern of shifting back and forth by N more clearly expresses the intent to clear the low N bits. Marking as "decision needed" for the other maintainers to consider.

MikeGitb · 2022-09-10T10:42:26Z

Just a suggestion: If you want to use this notation, I'd use binary litterals instead of decimal numbers.

AlexGuteniev · 2022-09-12T08:57:35Z

My opinion:

The clear way is * 16 / 16, though the debug codegen is expected to be awful (with actual division)
I'd wrote & 0xF. Hex is a compact binary.
There's no much point to make this clearer. Those who read SSE / AVX will not be confused by any of the opitions

strega-nil-ms · 2022-09-12T16:24:45Z

I strongly agree that & ~... is more clear than >> N << N; however, I would also appreciate @AlexGuteniev's heximal literal change (or even binary literal). I'm not against making this clearer.

Replacements: _Byte_length(_First, _Last) >> 5 << 5 _Byte_length(_First, _Last) & ~size_t{0x1F} _Byte_length(_First, _Last) >> 4 << 4 _Byte_length(_First, _Last) & ~size_t{0xF}

Replacements: _Byte_length(_First, _Last) >> 6 << 5 (_Byte_length(_First, _Last) >> 1) & ~size_t{0x1F} _Byte_length(_First, _Last) >> 5 << 4 (_Byte_length(_First, _Last) >> 1) & ~size_t{0xF}

StephanTLavavej · 2022-09-12T22:40:32Z

Ok, I've validated and pushed changes to:

Use hex literals as suggested by @AlexGuteniev and @strega-nil-ms.
Use ~size_t{VALUE} to start with a constant of the desired width and avoid having to think about signed-to-unsigned conversions. (size_t & ~31 worked because ~31 is a negative int that gets sign-extended when converted to size_t, which I found highly surprising when I realized it.)
Also change >> 6 << 5 and >> 5 << 4.
- I believe that this is a clarity improvement, as this makes the division by 2 (followed by masking) more obvious, so I am now in favor of this change.

Codegen is unaffected: https://godbolt.org/z/h7qE3vca1

Thanks again!

StephanTLavavej · 2022-09-12T23:28:27Z

I'm mirroring this to the MSVC-internal repo - please notify me if any further changes are pushed.

StephanTLavavej · 2022-09-13T21:55:27Z

Thanks for this code cleanup! 😸 😸 😸 😸 😸 😸 😸 😸

Co-authored-by: Stephan T. Lavavej <stl@nuwen.net>

AreaZR requested a review from a team as a code owner September 8, 2022 14:36

AreaZR changed the title ~~Use a bitwise instead of shifts~~ Use a bitwise and instead of shifts Sep 8, 2022

Use a bitwise and instead of shifts

f3e17cb

This is faster in some instances and is clearer in intent

AreaZR force-pushed the bitwise-and branch from a21594e to f3e17cb Compare September 8, 2022 14:45

StephanTLavavej added enhancement Something can be improved decision needed We need to choose something before working on this labels Sep 9, 2022

StephanTLavavej removed the decision needed We need to choose something before working on this label Sep 12, 2022

StephanTLavavej added 2 commits September 12, 2022 15:26

Use & ~size_t{0x1F} and & ~size_t{0xF}.

f17704f

Replacements: _Byte_length(_First, _Last) >> 5 << 5 _Byte_length(_First, _Last) & ~size_t{0x1F} _Byte_length(_First, _Last) >> 4 << 4 _Byte_length(_First, _Last) & ~size_t{0xF}

Also change >> 6 << 5 and >> 5 << 4.

3d4dcbf

Replacements: _Byte_length(_First, _Last) >> 6 << 5 (_Byte_length(_First, _Last) >> 1) & ~size_t{0x1F} _Byte_length(_First, _Last) >> 5 << 4 (_Byte_length(_First, _Last) >> 1) & ~size_t{0xF}

StephanTLavavej approved these changes Sep 12, 2022

View reviewed changes

CaseyCarter approved these changes Sep 12, 2022

View reviewed changes

StephanTLavavej self-assigned this Sep 12, 2022

StephanTLavavej merged commit 5985b06 into microsoft:main Sep 13, 2022

AreaZR deleted the bitwise-and branch September 26, 2022 16:06

CaseyCarter pushed a commit to CaseyCarter/STL that referenced this pull request Oct 6, 2022

Use a bitwise and instead of shifts (microsoft#3092)

5774640

Co-authored-by: Stephan T. Lavavej <stl@nuwen.net>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Use a bitwise and instead of shifts #3092

Use a bitwise and instead of shifts #3092

AreaZR commented Sep 8, 2022

StephanTLavavej commented Sep 9, 2022

MikeGitb commented Sep 10, 2022

AlexGuteniev commented Sep 12, 2022

strega-nil-ms commented Sep 12, 2022

StephanTLavavej commented Sep 12, 2022

StephanTLavavej commented Sep 12, 2022

StephanTLavavej commented Sep 13, 2022

Use a bitwise and instead of shifts #3092

Use a bitwise and instead of shifts #3092

Conversation

AreaZR commented Sep 8, 2022

StephanTLavavej commented Sep 9, 2022

MikeGitb commented Sep 10, 2022

AlexGuteniev commented Sep 12, 2022

strega-nil-ms commented Sep 12, 2022

StephanTLavavej commented Sep 12, 2022

StephanTLavavej commented Sep 12, 2022

StephanTLavavej commented Sep 13, 2022