Cuckoo Filter

The Fast Forward Labs team explored probabilistic data structures in our "Probabilistic Methods for Real-time Streams" report and prototype (contact us if you're interested in this topic). We provided an update to that report here, exploring Cuckoo filters, a new probabilistic data structure that improves upon the standard Bloom filter. The Cuckoo filter provides a few advantages:

it enables dynamic deletion and addition of items
it can be easily implemented compared to Bloom filter variants with similar capabilities, and
for similar space constraints, the Cuckoo filter provides lower false positives, particularly at lower capacities. We provide a python implementation of the Cuckoo filter here, and compare it to a counting Bloom filter (a Bloom filter variant).

This repository contains a python implementation of the Cuckoo filter, as well as a copy-paste of a counting Bloom filter from the fuggedaboutit repository for benchmarking.

Please see our post for more details on the Cuckoo filter.

Demo

Below we show how to go about using this package.

>>> from cuckoofilter import CuckooFilter
>>> c_filter = CuckooFilter(10000, 2)

>>> c_filter.insert('James')
>>> print("James in c_filter == {}".format("James" in c_filter))
James in c_filter == True

>>> c_filter.remove('James')
>>> print("James in c_filter == {}".format("James" in c_filter))
James in c_filter == False

Similarly, the counting Bloom filter can be used as well.

>>> from cuckoofilter import CountingBloomFilter
>>> b_filter = CountingBloomFilter(10000)

>>> b_filter.add('James')
>>> print("James in c_filter == {}".format("James" in c_filter))
James in b_filter == True

>>> b_filter.remove('James')
>>> print("James in c_filter == {}".format("James" in c_filter))
James in b_filter == False

References

Below we link to a few references that contributed to the work shown here:

Fan et. al. Cuckoo Filter: Practically Better Than Bloom
CS 166 Stanford lecture Cuckoo Hashing
Charles Ren, Course Notes. An Overview of Cuckoo Hashing

Name		Name	Last commit message	Last commit date
Latest commit History 55 Commits
cuckoofilter		cuckoofilter
images		images
.gitignore		.gitignore
LICENSE.md		LICENSE.md
README.md		README.md
__init__.py		__init__.py
bench_marking_notebook.ipynb		bench_marking_notebook.ipynb
cuckoo_filter_notebook.ipynb		cuckoo_filter_notebook.ipynb
setup.py		setup.py
test_cuckoo_filter.py		test_cuckoo_filter.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

cuckoofilter

cuckoofilter

images

images

.gitignore

.gitignore

LICENSE.md

LICENSE.md

README.md

README.md

init.py

init.py

bench_marking_notebook.ipynb

bench_marking_notebook.ipynb

cuckoo_filter_notebook.ipynb

cuckoo_filter_notebook.ipynb

setup.py

setup.py

test_cuckoo_filter.py

test_cuckoo_filter.py

Repository files navigation

Cuckoo Filter

Demo

References

About

Releases

Packages

Contributors 4

Languages

License

fastforwardlabs/cuckoofilter

Folders and files

Latest commit

History

Repository files navigation

Cuckoo Filter

Demo

References

About

Resources

License

Stars

Watchers

Forks

Languages