A curated list of awesome big data frameworks, ressources and other awesomeness.
-
Updated
Mar 15, 2023
A curated list of awesome big data frameworks, ressources and other awesomeness.
Apache Kafka® running on Kubernetes
Probabilistic data structures for processing continuous, unbounded streams.
A lightweight stream processing library for Go
NIST Certified SCAP 1.2 toolkit
An open-source framework for real-time anomaly detection using Python, ElasticSearch and Kibana
A stream processing API for Go (alpha)
Series and Panels for Real-time and Exploratory Analysis of Data Streams
Conduit streams data between data stores. Kafka Connect replacement. No JVM required.
Public tracker for Scramjet Cloud Platform, a platform that bring data from many environments together.
Data stream analytics: Implement online learning methods to address concept drift in data streams using the River library. Code for the paper entitled "PWPAE: An Ensemble Framework for Concept Drift Adaptation in IoT Data Streams" published in IEEE GlobeCom 2021.
The Open Source Time-Series Data Historian
Probabilistic deep learning for data streams.
The Tornado
Kafka-ML: connecting the data stream with ML/AI frameworks (now TensorFlow and PyTorch!)
Event Based Applications [DEPRECATED]
Learn how to use Kinesis Firehose, AWS Glue, S3, and Amazon Athena by streaming and analyzing reddit comments in realtime. 100-200 level tutorial.
Explore Apache Kafka data pipelines in Kubernetes.
Simple yet powerful live data computation framework.
unsupervised concept drift detection
Add a description, image, and links to the data-stream topic page so that developers can more easily learn about it.
To associate your repository with the data-stream topic, visit your repo's landing page and select "manage topics."