Simple datalake
-
Updated
Jun 22, 2024 - Python
Simple datalake
Spark Structured Streaming data pipeline that processes movie ratings data in real-time.
A data pipeline with Docker, Airflow, Kafka, Spark Streaming, cassandra
Spark Structured Streaming with Kafka Integration
Kafka streaming job from iomete. This streaming job copies data from Kafka to Iceberg.
Building a scalable solution using Spark and Kafka to discover trending topics within Meetup data using Z-Score analysis.
Spark Structured Streaming vs Kafka Streams
Twitter Web-App using Apache Kafka, Spark & perform analysis
A distributed streaming data processing pipeline.
Streaming Synthetic Sales Data Generator: Streaming sales data generator for Apache Kafka, written in Python
Stream processing pipeline for analyzing live chat data
Pyspark sample for upsert data to oracle table
cpu anomaly detection with spark
An naive anomaly detection and data visualization tool for F1 on board telemetry data.
Spark Examples
Statistical analyses of San Francisco crime incidents using Apache Spark Structured Streaming
A Log Analytics demo based on Spark Structured Streaming + Kafka
Sentiment Analysis of a Twitter Topic with Spark Structured Streaming
Add a description, image, and links to the spark-structured-streaming topic page so that developers can more easily learn about it.
To associate your repository with the spark-structured-streaming topic, visit your repo's landing page and select "manage topics."