A curated list of awesome big data frameworks, ressources and other awesomeness.
-
Updated
Mar 15, 2023
A curated list of awesome big data frameworks, ressources and other awesomeness.
Miller is like awk, sed, cut, join, and sort for name-indexed data such as CSV, TSV, and tabular JSON
Fancy stream processing made operationally mundane
Open-Source Web UI for Apache Kafka Management
Utils for streaming large files (S3, HDFS, gzip, bz2...)
ReadySet is a lightweight SQL caching engine written in Rust that helps developers enhance the performance and scalability of existing applications.
Pravega - Streaming as a new software defined storage primitive
A lightweight stream processing library for Go
Open-source graph database, built for real-time streaming data, compatible with Neo4j.
Trill is a single-node query processor for temporal or streaming data.
Real-time stream processing for python
A machine learning package for streaming data in Python. The other ancestor of River.
HStreamDB is an open-source, cloud-native streaming database for IoT and beyond. Modernize your data stack for real-time applications.
A list about Apache Kafka
Full stack application platform for building stateful microservices, streaming APIs, and real-time UIs
Add a description, image, and links to the streaming-data topic page so that developers can more easily learn about it.
To associate your repository with the streaming-data topic, visit your repo's landing page and select "manage topics."