#
apachespark
Here are 6 public repositories matching this topic...
Source code for the work "dSpark: Deadline-Based Resource Allocation for Big Data Applications in Apache Spark" published in IEEE e-Science 2017
-
Updated
Apr 4, 2018 - Java
Use this project to join data from multiple csv files. Currently in this project we support one to one and one to many join. Along with this you can find how to use kafka producer efficiently with spark.
java
kafka
spark
kafka-producer
one-to-many
spark-java
spark-sql
spark-kafka-integration
spark-dataframes
spark-csv
apachespark
spark-kafka
kafka-spark
one-to-many-join
one-to-one-join
kafka-producer-spark
one-to-many-joins-spark
join-apache-spark
kafka-with-spark
integrate-kafka-spark
-
Updated
Jul 1, 2022 - Java
Upserts, Deletes And Incremental Processing on Big Data.
bigdata
stream-processing
data-integration
datalake
apachespark
hudi
apachehudi
incremental-processing
apacheflink
-
Updated
Jun 19, 2024 - Java
Improve this page
Add a description, image, and links to the apachespark topic page so that developers can more easily learn about it.
Add this topic to your repo
To associate your repository with the apachespark topic, visit your repo's landing page and select "manage topics."