Twitter bot
-
Updated
Feb 7, 2021 - Python
Twitter bot
Apache Beam Katas exercises 🚀 https://beam.apache.org/blog/beam-kata-release/
Crane is a distributed, fault-tolerant, stream processing system, like Apache Storm
Apache Spark Streaming application which reads web log data and monitors for 404 errors.
MNIST digit recognition sample for Streams Flows
A Data Engineering project using Pyspark, Pyspark Structured Streaming, PostgreSQL Kimball Data Warehouse, Kafka, and Superset.
HyperStream
MNIST digit recognition notebook sample
cloud-array is an open-source Python library for storing and streaming large Numpy Arrays on local file systems and major cloud proviers CDNs.
Personal Apache Beam studies repository
Construct a streaming event pipeline around Apache Kafka and its ecosystem. Using public data from the Chicago Transit Authority, we will construct an event pipeline around Kafka that allows us to simulate and display the status of train lines in real time.
A concurrent streaming package
General repository for the USGS Airflow Processing Pipeline
I'm documenting my learning of Data Engineering here.
GPS data processing based on python streamz
Calco Python API implementation. Contract-based approach to declaratively specify distributed dataflows
Provides a basic edge simulator
Python packages for IBM Streams Standard Toolkit
Pyrandall project, supporting active development
Add a description, image, and links to the stream-processing topic page so that developers can more easily learn about it.
To associate your repository with the stream-processing topic, visit your repo's landing page and select "manage topics."