Projects completed as part of the CSE 6332 CCBD course at UTA, covering distributed computing, data processing frameworks, and cloud platforms.
-
Updated
Jun 22, 2024 - Java
Projects completed as part of the CSE 6332 CCBD course at UTA, covering distributed computing, data processing frameworks, and cloud platforms.
Source code for the work "dSpark: Deadline-Based Resource Allocation for Big Data Applications in Apache Spark" published in IEEE e-Science 2017
Use this project to join data from multiple csv files. Currently in this project we support one to one and one to many join. Along with this you can find how to use kafka producer efficiently with spark.
Upserts, Deletes And Incremental Processing on Big Data.
Add a description, image, and links to the apachespark topic page so that developers can more easily learn about it.
To associate your repository with the apachespark topic, visit your repo's landing page and select "manage topics."