Initial implementation of a new lockless hashtable mpmc #262

danielealbano · 2022-11-30T23:00:22Z

This PR implements a new lockless multi producer and multi consumer parallel hashtable which is capable of achieving amazing numbers (included at the end of the summary)!

This new hashtable works in a very different way from the previous one although the goal keeps being the same: spread the contention of the threads as much as possible and reduce the amount of times the data in memory are changed.

The old approach was relying on a combination of memory fencing and userspace spinlocks to perform lockless GET operations meanwhile using userspace spinlocks for the SET and DEL operations with a lock every 14 buckets, the results were great as an hashtable with 10000 was being controlled by about 714 locks, but the downside was a lot of memory changes, also impacting the cache of the cores, done by 1 single thread to write a value.

The new approach relies on a combination of approaches and components:

an epoch operation queue, each time an operation is triggered with the hashtable an operation is pushed to the queue and marked as completed once done, guaranteeing that the data read by the thread after pushing the new operation to the queue will never be deleted until the operation itself is completed
an epoch garbage collector, which operates in combination with the epoch operation queue, every time a bucket is deleted the associated key value data are not deleted immediately but insted staged for deletion, will only be deleted once all the operation started before the staging are marked as completed
a 3 pass approach to insert new buckets which guarantees that at some point if multiple threads are fighting to insert the same key one will be successful
- which becomes 4 pass in case an insertion is carried out during an upsize if the key is new
the key value data are not stored inside the hashtable itself but as a pointer to an external structure, this makes a bucket much larger (16 bytes) but drammatically speed up the copy time and makes possible to use 128 bit atomic operations to update a bucket
pointer tagging to store status in the pointer of the key value itself
a transaction spinlock, as it's in use in the current hashtable, for the single/multi key transactions and the Read-Modify-Write operations

The hashtable will implement some extra optimizations to box the upper level data (the storagedb_entry_index) into the lower level data (the key value itself) using a combination of an union and the code in the header using some defines to create a typed version of the hashtable.

This new hashtable will provide much better performances when batching operations or operating in a cluster.

The PR includes new tests and new benchmarks, used to generate the numbers initially mentioned.

The PR is drammatically large:

74 commits
18 files changed
4465 new lines
only 71 deletions

Talking about numbers, here some

Operation	Threads	V1 - Million Op/s	V2 - Million Op/s
INSERT	1	3.22191	5.02507
INSERT	2	6.26603	9.74552
INSERT	4	11.73655	19.20719
INSERT	8	24.03989	38.92227
INSERT	16	42.58111	72.36672
INSERT	32	69.45982	129.38364
INSERT	64	109.68172	197.67271
UPDATE	1	3.18695	5.55167
UPDATE	2	6.16648	11.06459
UPDATE	4	11.53689	21.79368
UPDATE	8	22.8819	43.98683
UPDATE	16	37.42774	83.61501
UPDATE	32	63.96726	143.30408
UPDATE	64	89.5212	236.87745

The V2 hashtable is between 2 and 2.5 times faster than the V1, the current implementation.

The hardware used for benchmarking was an EPYC 7502P (32 core, 64 hw thread, default bios settings) with 256GB RDIMM DDR4 3200mhz.

Closes #103

… the op set

…ry for freeing it

…ing (2/2)

…eparation for the resize support

…ng a bucket

… beginning

…d entry has to be skipped

…bage collector

…e hashtable mpmc

… has to retry the operation

…nt of iterations that can be done in the benchmark

…o 1, style cleanup

…ey would be run every 100/150 operations on the same thread

…generator as it gets updated every 4ms and might cause the generation of the same sequences

…n 11

…re realistic

… epoch gc

codecov · 2022-12-01T21:27:57Z

Codecov Report

Base: 82.34% // Head: 82.74% // Increases project coverage by +0.41% 🎉

Coverage data is based on head (a602326) compared to base (48ba7eb).
Patch coverage: 91.43% of modified lines in pull request are covered.

Additional details and impacted files

@@            Coverage Diff             @@
##             main     #262      +/-   ##
==========================================
+ Coverage   82.34%   82.74%   +0.41%     
==========================================
  Files         157      158       +1     
  Lines        9795    10240     +445     
==========================================
+ Hits         8065     8473     +408     
- Misses       1730     1767      +37

Flag	Coverage Δ
unittests	`82.74% <91.43%> (+0.41%)`	⬆️

Flags with carried forward coverage won't be shown. Click here to find out more.

Impacted Files	Coverage Δ
...rc/data_structures/hashtable/spsc/hashtable_spsc.h	`100.00% <ø> (ø)`
...rc/data_structures/hashtable_mpmc/hashtable_mpmc.c	`91.03% <91.03%> (ø)`
...rc/data_structures/hashtable/spsc/hashtable_spsc.c	`94.51% <100.00%> (-0.06%)`	⬇️
src/epoch_gc.c	`99.03% <100.00%> (+0.05%)`	⬆️
src/random.c	`100.00% <100.00%> (ø)`
src/xalloc.c	`97.14% <0.00%> (+1.43%)`	⬆️
src/spinlock.h	`94.44% <0.00%> (+5.56%)`	⬆️

Help us with your feedback. Take ten seconds to tell us how you rate us. Have a feature suggestion? Share it here.

☔ View full report at Codecov.
📢 Do you have feedback about the report comment? Let us know in this issue.

… when an upsize is in progress

…le key value directly

…e that the bucket is altered on the right hashtable array, also properly cleanup after failing to insert because of a migration and when migrating migrating wait for temporary buckets to be cleaned up or finalized

danielealbano added 30 commits November 14, 2022 22:51

Fix object type names

6f45215

Update epoch gc tests to use new names

3b6587b

Initial cut implementation of the new hashtable

1a458e8

Refactor out hashtable_mpmc_support_hash_half

b6fe569

Ensure that the operation is always marked as completed at the end of…

17d3379

… the op set

Keep track of the amount of memory allocated via mmap as it's necessa…

e9d2845

…ry for freeing it

The memory allocated for the buckets as to be zero-ed

3d78b12

Store in the struct the amount of memory allocated, required for free…

6dec29c

…ing (2/2)

Improve code readability

e322ba5

Implement the necessary functions to free up the allocated memory

f71330f

Style fix

4aefbcd

Implement a wrapper to get a bucket index from an hash

1c8c411

Should not operate on hashtable_mpmc but on hashtable_mpmc_data in pr…

6e3d9d2

…eparation for the resize support

The value can be fetched only if an entry has been found

9ac0dae

Optimize the creation and the destruction of new_key_value when setti…

1a7559f

…ng a bucket

If a double entry was found, reset the current entry and try from the…

094ec29

… beginning

Improve code readability

add58c6

When searching for a duplicated entry while setting, the newly create…

7463d10

…d entry has to be skipped

Add headers

2f57332

Add tests

d57e219

Implement the destructor for the memory to be gc-ed via the epoch gar…

a2c1e0c

…bage collector

Update tests to do not touch the epoch gc destructor registered by th…

b2be620

…e hashtable mpmc

Expose an interface, compatible with bools, to let the caller know it…

37c4a8d

… has to retry the operation

Duplicate the benchmark for the hashtable for the new implementation

dd262ba

Drop unused code

b681f71

Ensure to interrupt the search if a match is found

1c8e8d0

Catch failures when the epoch operation queue is full

9932c4f

Fix benchmarks for hashtable_mpmc_op_get

b96e024

Increase the size of the epoch operations queue and increase the amou…

e060f2a

…nt of iterations that can be done in the benchmark

Ensure keys are validated only if the keys validation define is set t…

65a11f9

…o 1, style cleanup

danielealbano added 10 commits November 27, 2022 12:53

Fix benchmark parameters

5ddd0cc

It's not realistic running the operations collection every insert, th…

7b28e2e

…ey would be run every 100/150 operations on the same thread

Style fix

df96d47

Performance improvement on the hotpath

4811758

Don't use the monotonic coarse clock to initialize the random number …

b5516a9

…generator as it gets updated every 4ms and might cause the generation of the same sequences

Add some required libraries to the list of the requirements for Debia…

3293aca

…n 11

Various fixes and improvements to the benchmark which will make it mo…

3a01187

…re realistic

The resize should always be triggered by the caller

c21fdaf

Drop test code

6ad3b3d

Update the test after the refactoring of the upsize interface

32f4c7b

danielealbano added the enhancement New feature or request label Nov 30, 2022

danielealbano added this to the v0.2 milestone Nov 30, 2022

danielealbano self-assigned this Nov 30, 2022

danielealbano added this to In Progress in cachegrand via automation Nov 30, 2022

danielealbano mentioned this pull request Nov 30, 2022

Update the hashtable documentation #263

Open

danielealbano added 2 commits December 1, 2022 00:31

Ensure to unregister the epoch gc thread at the end of the section

79b3e4f

Fix epoch gc worker tests after introducing a new object type for the…

4289fba

… epoch gc

danielealbano added 5 commits December 5, 2022 00:09

Various fix for upsize process and the get, set and delete operations…

4555097

… when an upsize is in progress

Add tracking of the creation time and last update time in the hashtab…

39e0f43

…le key value directly

Update & fix testing after refactoring

1d822ca

Tune tests duration, drop unused variable

a602326

danielealbano changed the title ~~Initial implementation of a new new lockless hashtable mpmc~~ Initial implementation of a new lockless hashtable mpmc Dec 5, 2022

danielealbano merged commit 7f32454 into main Dec 5, 2022

cachegrand automation moved this from In Progress to Completed Dec 5, 2022

danielealbano deleted the new-lockless-hashtable-mpmc branch December 5, 2022 00:00

danielealbano moved this from Completed to Ready for Work in cachegrand Dec 5, 2022

danielealbano moved this from Ready for Work to Completed in cachegrand Dec 5, 2022

danielealbano linked an issue Dec 5, 2022 that may be closed by this pull request

Implement support to automatically upsize the hashtable if needed instead of having a fixed amount of allowed keys #103

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Initial implementation of a new lockless hashtable mpmc #262

Initial implementation of a new lockless hashtable mpmc #262

danielealbano commented Nov 30, 2022 •

edited

codecov bot commented Dec 1, 2022 •

edited

Initial implementation of a new lockless hashtable mpmc #262

Initial implementation of a new lockless hashtable mpmc #262

Conversation

danielealbano commented Nov 30, 2022 • edited

codecov bot commented Dec 1, 2022 • edited

Codecov Report

danielealbano commented Nov 30, 2022 •

edited

codecov bot commented Dec 1, 2022 •

edited