New `map` internals #2181

gingerBill · 2022-11-11T16:12:14Z

High performance, cache-friendly, open-addressed Robin Hood hashing hash map data structure with various optimizations for Odin.

To make this map cache-friendly it uses a novel strategy to ensure keys and values of the map are always cache-line aligned and that no single key or value of any type ever straddles a cache-line. This cache efficiency makes for quick lookups because the linear-probe always addresses data in a cache friendly way. This is enabled through the use of a special meta-type called a Map_Cell which packs as many values of a given type into a local array adding internal padding to round to MAP_CACHE_LINE_SIZE. One other benefit to storing the internal data in this manner is false sharing no longer occurs when using a map, enabling efficient concurrent access of the map data structure with minimal locking if desired.

Notes:

Open-Addressed Robin Hood Hashing
SOA-based cache-friendly entries
Defaults to dynamic calls currently, add -use-static-map-calls to enable static calls
- Which should be used and when?

This map implementation makes extensive use of uintptr for representing sizes, lengths, capacities, masks, pointers, offsets, and addresses to avoid expensive sign extension and masking that would be generated if types were casted all over. The only place regular ints show up is in the cap() and len() implementations.

// The raw, type-erased representation of a map.
//
// 32-bytes on 64-bit
// 16-bytes on 32-bit
Raw_Map :: struct {
	// A single allocation spanning all keys, values, and hashes.
	// {
	//   k: Map_Cell(K) * (capacity / ks_per_cell)
	//   v: Map_Cell(V) * (capacity / vs_per_cell)
	//   h: Map_Cell(H) * (capacity / hs_per_cell)
	// }
	//
	// The data is allocated assuming 64-byte alignment, meaning the address is
	// always a multiple of 64. This means we have 6 bits of zeros in the pointer
	// to store the capacity. We can store a value as large as 2^6-1 or 63 in
	// there. This conveniently is the maximum log2 capacity we can have for a map
	// as Odin uses signed integers to represent capacity.
	//
	// Since the hashes are backed by Map_Hash, which is just a 64-bit unsigned
	// integer, the cell structure for hashes is unnecessary because 64/8 is 8 and
	// requires no padding, meaning it can be indexed as a regular array of
	// Map_Hash directly, though for consistency sake it's written as if it were
	// an array of Map_Cell(Map_Hash).
	data:      uintptr,   // 8-bytes on 64-bits, 4-bytes on 32-bits
	len:       int,       // 8-bytes on 64-bits, 4-bytes on 32-bits
	allocator: Allocator, // 16-bytes on 64-bits, 8-bytes on 32-bits
}

Benefits of this approach over the old approach

VERY fast map get (~3x dynamic calls, ~5x static calls)
VERY fast map set (~5x dynamic calls, ~6x static calls)
One allocation per map rather than two (previously hashes and entries)
- Only requires alloc_non_zeroed and free internally
SOA keys, values, and hashes, allowing each to be loaded into a separate cache line
Entries are stored in non-contiguous cell-layout which means no element straddles across a cache line
Small header information (runtime.Map_Info) for dynamic calls
Allows for calling delete_key whilst iterating across the map

Issues of this approach over the old approach

Cannot resize in-place, must make new map, copy contents to new map, and delete old map
Non-trivial to iterate across in a for in loop
- Due to the SOA layout, non-contiguous cell-layout, and non-valid buckets
- Old implementation had all entries be contiguous and in a separate dynamic array
Requires extra scratch storage for the keys and values
- When inserting, the scratch storage is used rather than relying on stack memory
A lot more complicated than the previous implementation

General differences

Hashing function has not been changed EXCEPT it has been sanitized
- 0 is not a valid hash value any more, it will be set to 1 internally
- Highest bit must be zero has this bit is used as a flag to indicate a bucket was deleted but still allow for probing
log2(capacity) is encoded into the data as the lowest 6 bits (allowing for a hypothetical maximum capacity of 2^63 elements)
- Capacities can only be a power of two

TODOs for Future PRs

Improved hash function
- Use architecture specific instructions (e.g. AES on amd64) when possible
- If not possible, fallback to a different hash
- Currently using a (modified) fnv64a, maybe something wyhash might be better
  - wyhash is being used by other languages as a fallback, and appears to perform well with smhasher

…finition of `runtime.Raw_Map`

… `map`; add `runtime.map_get`

This is test code

Test code

map[K]struct{} works fine.

gingerBill added 30 commits November 7, 2022 23:02

Begin work on implementing the new map internals

c96e0af

Basic get and set support for new map

e914a87

Basic fmt printing for map

bce62b9

Correct fmt printing to be robust

2c3febd

General modifications

da774e3

Correct hashing for map types

50e10ce

Support for in loops for map

e3e225d

Remove the need for type->Map.internal_type and replace with the de…

810a1ee

…finition of `runtime.Raw_Map`

Correct reflect.map_entry_info_slice

45f0c81

Add runtime.map_exists_dynamic

ea263b8

Disallow zero sized map keys

d77269d

Correct reflection usage of maps

6dd4d1a

Make Map_Info store pointers to cell info rather than inline

ed58374

Add intrinsics.map_cell_info and intrinsics.map_info

a740937

Change Raw_Map.len to int from uintptr

2fc3da3

Change __dynamic_map_get signature

046dd55

Allow for -use-static-map-calls which generates a get procedure per…

a71daee

… `map`; add `runtime.map_get`

Fix json marshal for maps

6a4e446

Fix for in for map

0819d05

Minor change to map_cell_index_static

2f29894

Make map_free_dynamic take the total size of the allocation

dae299b

Fix bug with allocator not getting set on a map

366779f

Correct map_insert_hash_dynamic and map_insert_dynamic

667af1b

Do an extra check before insertion for pre-existing keys

503eb47

This is test code

Check for existence before setting

bcf437d

Test code

Inline __dynamic_map_set code where possible

d4f3437

Use mem_resize where possible

3858422

Rewrite map_insert_hash_dynamic

0424fb4

Swap hashes

b035ee2

Change map internal calls to use a pointer

1bcec3f

gingerBill requested review from colrdavidson, graphitemaster, Kelimion and oskarnp November 11, 2022 16:12

Correct json/unmarshal.odin

04a1e7d

gingerBill self-assigned this Nov 11, 2022

gingerBill added enhancement design implementation compiler-development core-library implementation labels Nov 11, 2022

Kelimion and others added 10 commits November 12, 2022 17:25

Add tests for new map implementation.

9c1b464

Add tests/internal/build.bat

7207f4b

Update tests/internal/build.bat

699cabe

Correct map_reserve_dynamic caused by an bizarre code generation bug

ad0f116

map: Add tests for update + delete.

16a4943

map tests for Linux and Mac

9b88a38

Fix CI typo.

7dfbda5

Minor improvement to multi return value reducing stack usage

a705a2e

Simplify the handling of the hashing calls for maps

3edb3d8

Add @(require_results) to map procedures where possible

489e8dc

gingerBill marked this pull request as ready for review November 13, 2022 23:47

gingerBill added 4 commits November 13, 2022 23:50

Enforce pointer cast

d2019e3

Fix prototype

81f83d5

Revert "Minor improvement to multi return value reducing stack usage"

25bec19

Correct map_insert

bbe44b4

gingerBill force-pushed the map-dev branch from b4729d9 to bbe44b4 Compare November 14, 2022 11:48

Test new map when used as a set.

3949e22

map[K]struct{} works fine.

gingerBill merged commit 15bbdb2 into master Nov 17, 2022

gingerBill deleted the map-dev branch November 17, 2022 16:04

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

New `map` internals #2181

New `map` internals #2181

gingerBill commented Nov 11, 2022 •

edited

New map internals #2181

New map internals #2181

Conversation

gingerBill commented Nov 11, 2022 • edited

High performance, cache-friendly, open-addressed Robin Hood hashing hash map data structure with various optimizations for Odin.

Notes:

Benefits of this approach over the old approach

Issues of this approach over the old approach

General differences

TODOs for Future PRs

New `map` internals #2181

New `map` internals #2181

gingerBill commented Nov 11, 2022 •

edited