Linear Probing Cuckoo Hashing (LP-Cuckoo)

用法

把cuckoo.h和murmur3.h复制到工作目录下，#include"cuckoo.h"，创建一个Cuckoo类，然后调用它的public接口函数insert，search，remove即可。

推荐用g++，-O3编译。

About this repo

算法

定位

LP-Cuckoo是一个用来管理大量KV Pairs的哈希表，它的装载率可以达到90%以上。并且在装载率90%以上时可以保证很高的更新速度和查询速度。

Cuckoo Hashing

Cuckoo Hashing由两个表T₁、T₂和两个哈希函数h₁、h₂构成。规定元素e = <k, v>在哈希表中被存放在表T₁的h₁(k)处或表T₂的h₂(k)处。

当要插入e₀ = <k₀, v₀>时，计算h₁(k₀)，然后把e₀存到T₁的h₁(k₀)处。如果T₁的h₁(k₀)处原来没有元素，则插入成功。否则，设那里原来的元素是e₁ = <k₁, v₁>。那么接下来要把e₁ = <k₁, v₁>插入到表T₂的h₂(k₁)处。同样地，如果那里原来没有元素，则插入成功。否则把原来的元素再踢到T₁，如此反复······

上面的kick操作可能陷入死循环，所以cuckoo hashing为插入操作设置了一个kick的上限。插入一个元素时如果kick的次数超过了这个上限则插入失败。

当要查询k₀时，计算h₁(k₀)和h₂(k₀)，然后在T₁的h₁(k₀)处和T₂的h₂(k₀)处查找k₀。如果找到了就返回对应的kv pair，如果没找到则查询失败。

Linear Probing Cuckoo Hashing

Cuckoo Hashing要做到高装载率需要把kick的次数设得很高，这会让更新速度变得很慢。而且Cuckoo Hashing当插入失败时会有元素丢失。我们的算法针对Cuckoo Hashing的上述缺陷做出了改进。

在插入元素e₀ = <k₀, v₀>时，线性探查表T₁的从h₁(k₀)开始的m₁个位置，和T₂的从h₂(k₀)开始的m₂个位置，如果这些位置都满了，再把T₁的h₁(k₀)处的元素踢到T₂，为e₀腾地方。同样地，在向表T₂中插入元素时也要线性探查m₂个位置。自然地，对查询操作也做出相应的修改。

这个改进可以显著提升装载率。改进后的装载率能达到90%以上。

Evaluations

Experimental setup

Platform

Intel(R) Core(TM) i9-10980XE CPU @ 3.00GHz

We implement all codes with c++ and build them with g++ 7.5.0 and -O3 option.

Workload

We randomly generate 10,000,000 key-value pairs, where the size of key/value in each pair is 8Bytes.

Hash functions and parameters

We use MurmurHash for hash calculations. We fix the size of LP-Cuckoo to 10,000,000 slots, where T₁ has 7,500,000 slots and T₂ has 2,500,000 slots. We set m₁ = 8, m₂ = 8, and set the kick threshold to 35. We sequentially insert the 10,000,000 key-value pairs until the 500_th insertion failure. Then we calculate the load factor.

Results

load factor: 95.61% ;

insert speed: 10.19M/s (average), 4.29M/s (load factor >90%) ;

query speed: 18.84M/s (positive), 13.60M/s (negative) ;

How to run

git clone https://github.com/fdvty/cuckoo.git
cd cuckoo
make clean; make
./cuckoo

Name		Name	Last commit message	Last commit date
Latest commit History 27 Commits
.vscode		.vscode
Makefile		Makefile
Readme.md		Readme.md
cuckoo		cuckoo
cuckoo.h		cuckoo.h
main.cpp		main.cpp
murmur3.h		murmur3.h
utils.h		utils.h

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Linear Probing Cuckoo Hashing (LP-Cuckoo)

用法

About this repo

算法

定位

Cuckoo Hashing

Linear Probing Cuckoo Hashing

Evaluations

Experimental setup

Platform

Workload

Hash functions and parameters

Results

How to run

Directory

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Linear Probing Cuckoo Hashing (LP-Cuckoo)

用法

About this repo

算法

定位

Cuckoo Hashing

Linear Probing Cuckoo Hashing

Evaluations

Experimental setup

Platform

Workload

Hash functions and parameters

Results

How to run

Directory

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages