Synchronization primitive improvements #3918

kprotty · 2019-12-16T01:52:20Z

SpinLock

removed Backoff
use better yielding per platform
added tryAcquire(): ?Held (and deinit() for Mutex compatability)

ResetEvent

simpler interface (no bool returns) & blocking wait cant fail anymore
fix test case from relying on os scheduling with time.sleep
fix deadlock detection in single threaded & support single threaded test cases

Mutex

fix lack of spin_count increment which made Mutex effectively SpinLock
updated atomic ops (orderings for correctness, loads & yields for performance)
specialized implementation per platform for performance
added tryAcquire(): ?Held

andrewrk · 2019-12-16T01:54:07Z

Thanks for the PR. What kind of differences did you observe in the example code you were testing with this?

kprotty · 2019-12-16T02:04:55Z

@andrewrk These were only the results on my dev machine (ryzen 5 2600) so it may not be too representative. Am also yet to see the effects in actual software and plan to check out the zig-async-demo repo for more testing soon.

for SpinLock:

linux and mac osx catalina benefited about 2x improvement for high contended, small critical sections (SCS)
windows by 2x as well, but for larger critical sections (LCS)

for Mutex:

for linux, stopped eating up cpu when idle, 20-30% behind glibc pthread_mutex SCS, about even in LCS
for mac osx catalina, same situation but 2x faster than pthread_mutex in both
for windows, about 10-25% faster than SRWLock for both

kprotty · 2019-12-16T16:43:44Z

In regards to the differences in zig-async-demo compiled with release-safe, the outcomes were similar for both fact-await and sieve under gnu perf: The new structures took a bit longer to complete (~200-500 ms) under various iteration counts but used 60% less cycles & the hottest path moved from the "is locked" check in std.mutex.acquireSlow() to other places like std.math.big.int.Int.llmullacc and std.event.Channel.put from the mutex not being a spinlock anymore.

JesseRMeyer · 2019-12-16T17:08:02Z

The new structures took a bit longer to complete (~200-500 ms)

How much total time did the old structures take to complete?

andrewrk · 2019-12-16T17:09:20Z

These test failures look legit

kprotty · 2019-12-16T17:14:49Z

@JesseRMeyer should note that the 200-500ms is the difference rather than the absolute time to complete; as that varies more based on machine + the amount of iterations specified in each run case

JesseRMeyer · 2019-12-16T17:22:53Z

@JesseRMeyer should note that the 200-500ms is the difference rather than the absolute time to complete; as that varies more based on machine + the amount of iterations specified in each run case

Right. A ratio here would be a bit more meaningful.

kprotty · 2019-12-16T17:32:08Z

@JesseRMeyer ah you're right. Its around 0.9 - 1.2% slower under release-safe

andrewrk · 2019-12-17T20:47:04Z

@kprotty -

@LemonBoy mentioned in #3932 that it would fix the test failures in this PR. Mind rebasing this PR?

LemonBoy mentioned this pull request Dec 16, 2019

Make sure the fields array is always non-null #3932

Merged

kprotty added 5 commits December 17, 2019 15:38

Spinlock: remove Backoff & improve yielding

947db78

ResetEvent: simpler interface + fix tests

e67ce44

Mutex: fix lock/spin bugs, improve perf slightly & more specialization

ac5ba27

ResetEvent: use futex on linux when possible

26e08d5

SpinLock: loopHint & yield distinction

c912296

kprotty force-pushed the lock_fix branch from 03de3b4 to c912296 Compare December 17, 2019 21:38

andrewrk merged commit 4d54e9a into ziglang:master Dec 18, 2019

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Synchronization primitive improvements #3918

Synchronization primitive improvements #3918

kprotty commented Dec 16, 2019

andrewrk commented Dec 16, 2019

kprotty commented Dec 16, 2019

kprotty commented Dec 16, 2019

JesseRMeyer commented Dec 16, 2019

andrewrk commented Dec 16, 2019

kprotty commented Dec 16, 2019

JesseRMeyer commented Dec 16, 2019

kprotty commented Dec 16, 2019

andrewrk commented Dec 17, 2019

Synchronization primitive improvements #3918

Synchronization primitive improvements #3918

Conversation

kprotty commented Dec 16, 2019

SpinLock

ResetEvent

Mutex

andrewrk commented Dec 16, 2019

kprotty commented Dec 16, 2019

kprotty commented Dec 16, 2019

JesseRMeyer commented Dec 16, 2019

andrewrk commented Dec 16, 2019

kprotty commented Dec 16, 2019

JesseRMeyer commented Dec 16, 2019

kprotty commented Dec 16, 2019

andrewrk commented Dec 17, 2019