This repository contains an alternative Apache Spark-based analysis tier for the TweetPipe streaming data pipeline
-
Updated
Aug 23, 2020 - Java
This repository contains an alternative Apache Spark-based analysis tier for the TweetPipe streaming data pipeline
Implementation of Static mining part of "Mining maximal frequent patterns in transactional databases and dynamic data streams: A spark-based approach" Information Sciences, Volume 432, March 2018, Pages 278-300
基于Spark 3.1.x 数据源API实现的MQ数据源示例代码
This repo contains examples of high throughput ingestion using Apache Spark and Apache Iceberg. These examples cover IoT and CDC scenarios using best practices. The code can be deployed into any Spark compatible engine like Amazon EMR Serverless or AWS Glue. A fully local developer environment is also provided.
Spark 3.0.0 Structured Streaming Kafka Avro Demo
Add a description, image, and links to the structured-streaming topic page so that developers can more easily learn about it.
To associate your repository with the structured-streaming topic, visit your repo's landing page and select "manage topics."