Workaround for slow codegen on x86 atomic load #2110

AlexGuteniev · 2021-08-10T13:30:34Z

Compiler emits jumptable or jcc sequence that prevents inlining
of atomic load; separation of order check and barrier condition helps

Compiler emits jumptable or jcc sequence that prevents inlining of atomic load; separation of order check and barrier condition helps

AlexGuteniev · 2021-08-10T13:41:13Z

Not sure if it should be applied, or the compiler should be fixed instead.

RSilicon · 2021-08-10T15:02:35Z

Send feedback to the Visual Studio compiler team via visual studio feedback

AlexGuteniev · 2021-08-10T16:12:47Z

The feedback was already sent by @Chronial , see DevCom-1491677

I'm not really sure if there should be workaround in code. There are at least three optimizer issues:

Compiler barrier emits npad 1 (nop instruction) instead of being truly no-op
switch does not collapse to if, even though there are only two possibilities
Decision whether to inline a function is based on full function size, not the size when constant propagation happens

Fixing just one of them would help. The workaround is a particular solutiin for wider problems.

AlexGuteniev · 2021-08-10T18:28:03Z

Proof that it works: https://godbolt.org/z/5e7re6bGY

StephanTLavavej

Looks good! The behavior is equivalent, and the code is simpler, so this is a good perma-workaround (no TRANSITION comment).

barcharcraz

I also like the "workaround" code better than the old code.

StephanTLavavej · 2021-08-14T00:39:17Z

I'm mirroring this to an MSVC-internal PR. Please notify me if any further changes are pushed.

StephanTLavavej · 2021-08-14T02:07:15Z

Changed the title to say "slow codegen", as the compiler back-end team conventionally uses "bad codegen" to mean incorrect codegen.

StephanTLavavej · 2021-08-17T03:33:27Z

Thanks... for... improving... this... slow... codegen... ! 🐌 😹 🐇

Workaround for bad codegen, ox x86 atomic load

ace6ee3

Compiler emits jumptable or jcc sequence that prevents inlining of atomic load; separation of order check and barrier condition helps

AlexGuteniev requested a review from a team as a code owner August 10, 2021 13:30

AlexGuteniev mentioned this pull request Aug 10, 2021

<chrono>: steady_clock::now() is avoidably slow #2085

Closed

AlexGuteniev changed the title ~~Workaround for bad codegen, ox x86 atomic load~~ Workaround for bad codegen on x86 atomic load Aug 10, 2021

StephanTLavavej added the performance Must go faster label Aug 10, 2021

StephanTLavavej approved these changes Aug 11, 2021

View reviewed changes

StephanTLavavej assigned barcharcraz Aug 11, 2021

barcharcraz approved these changes Aug 13, 2021

View reviewed changes

StephanTLavavej unassigned barcharcraz Aug 13, 2021

StephanTLavavej self-assigned this Aug 14, 2021

StephanTLavavej changed the title ~~Workaround for bad codegen on x86 atomic load~~ Workaround for slow codegen on x86 atomic load Aug 14, 2021

StephanTLavavej merged commit 7feac35 into microsoft:main Aug 17, 2021

AlexGuteniev deleted the atomic_load_fix branch August 17, 2021 15:46

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Workaround for slow codegen on x86 atomic load #2110

Workaround for slow codegen on x86 atomic load #2110

AlexGuteniev commented Aug 10, 2021

AlexGuteniev commented Aug 10, 2021

RSilicon commented Aug 10, 2021

AlexGuteniev commented Aug 10, 2021 •

edited

Loading

AlexGuteniev commented Aug 10, 2021

StephanTLavavej left a comment

barcharcraz left a comment

StephanTLavavej commented Aug 14, 2021

StephanTLavavej commented Aug 14, 2021

StephanTLavavej commented Aug 17, 2021

Workaround for slow codegen on x86 atomic load #2110

Workaround for slow codegen on x86 atomic load #2110

Conversation

AlexGuteniev commented Aug 10, 2021

AlexGuteniev commented Aug 10, 2021

RSilicon commented Aug 10, 2021

AlexGuteniev commented Aug 10, 2021 • edited Loading

AlexGuteniev commented Aug 10, 2021

StephanTLavavej left a comment

Choose a reason for hiding this comment

barcharcraz left a comment

Choose a reason for hiding this comment

StephanTLavavej commented Aug 14, 2021

StephanTLavavej commented Aug 14, 2021

StephanTLavavej commented Aug 17, 2021

AlexGuteniev commented Aug 10, 2021 •

edited

Loading