An open source framework for building data analytic applications.
-
Updated
Jun 5, 2024 - Java
An open source framework for building data analytic applications.
a suite of benchmark applications for distributed data stream processing systems
Schema Registry
Solving Big Data Problems using Spark framework in Java. Running the Project on HDFS clusters (BigData@Polito) to get the results.
A data pipeline is about "Random Number Counting"
Distributed Anonymization Platform for SQL databases
基于 Spark Streaming 的电影推荐系统
A Flexible and Powerful Parameter Server for large-scale machine learning
The current repository contains all the code developed during the Big Data processing and Analytics laboratories. Data are processed and analyzed using Hadoop and Spark
New Socket TCP Project
SparkStreamingProj Local save JSON
Kafka Source/Sink for reading/writing to kafka topic
Big Data projects for beginners
Repartitions a spark RDD
This project aims to consume a twitter stream via Apache Kafka, apply a sentiment analysis on the tweets with Apache Spark jobs and save the result into Apache HBase.
Add a description, image, and links to the spark-streaming topic page so that developers can more easily learn about it.
To associate your repository with the spark-streaming topic, visit your repo's landing page and select "manage topics."