Denormalized is a fast embeddable stream processing engine built on Apache DataFusion. It currently supports kafka as a real-time source and a sink, windowed aggregations, and stream joins.
- Docker
- Rust/Cargo installed
- Start kafka in docker
docker run -p 9092:9092 --name kafka apache/kafka
- Start emitting some sample data:
cargo run --example emit_measurements
- Run a simple streaming aggregation on the data using denormalized:
cargo run --example simple_aggregation
A more powerful example can be seen in our Kafka ridesharing example
- Stream aggregation
- Stream joins
- Checkpointing / restoration
- Session windows
- Stateful UDF API
- DuckDB support
- Reading/writing from Postgres
- Python bindings
- Typescript bindings
- UI
Denormalized is built and maintained by Denormalized in San Francisco.
This repo is still a work-in-progress and we are actively seeking design partners. If you have have a specific use-case you'd like to discuss please drop us a line via a github issue or email hello@denormalized.io
.