A simple, time-tested, family of random hash functions in Python, based on CRC32 and xxHash, affine transformations, and the Mersenne Twister. 🎲
-
Updated
Jun 6, 2022 - Python
A simple, time-tested, family of random hash functions in Python, based on CRC32 and xxHash, affine transformations, and the Mersenne Twister. 🎲
python implementations of the Flajolet-Martin, LogLog, SuperLogLog, and HyperLogLog cardinality estimation algorithms, specifically used to estimate the cardinality of unique traffic violations in NYC in the 2019 fiscal year
Bloom filtering, Flajolet-Martin algorithm, and reservoir sampling
Streaming data in Spark and doing data analytics
USC DSCI 553 - Foundations & Applications of Data Mining - Spring 2024 - Prof. Wei-Min Shen
Simple Projects in Data Mining
Add a description, image, and links to the flajolet-martin topic page so that developers can more easily learn about it.
To associate your repository with the flajolet-martin topic, visit your repo's landing page and select "manage topics."