A simple, time-tested, family of random hash functions in Python, based on CRC32 and xxHash, affine transformations, and the Mersenne Twister. 🎲
-
Updated
Jun 6, 2022 - Python
A simple, time-tested, family of random hash functions in Python, based on CRC32 and xxHash, affine transformations, and the Mersenne Twister. 🎲
A simple, time-tested, family of random hash functions in Java, based on CRC32, affine transformations, and the Mersenne Twister. 🎲
python implementations of the Flajolet-Martin, LogLog, SuperLogLog, and HyperLogLog cardinality estimation algorithms, specifically used to estimate the cardinality of unique traffic violations in NYC in the 2019 fiscal year
Comparative Analysis of Unsupervised Learning Methods for Real-time Anomaly Detection in Industrial Control Systems (ICS)
Basic implementation of Bloom filter and Flajolet-Martin algorithms in python with hashes and test files
Bloom filtering, Flajolet-Martin algorithm, and reservoir sampling
Streaming data in Spark and doing data analytics
USC DSCI 553 - Foundations & Applications of Data Mining - Spring 2024 - Prof. Wei-Min Shen
This repository contains the assignments and project codes created during the Big data coursework
Simple Projects in Data Mining
Add a description, image, and links to the flajolet-martin topic page so that developers can more easily learn about it.
To associate your repository with the flajolet-martin topic, visit your repo's landing page and select "manage topics."