Apache Spark Course Material
-
Updated
Apr 21, 2023 - Scala
Apache Spark Course Material
Data cleaning, pre-processing, and Analytics on a million movies using Spark and Scala.
This repository is created by Dharshan Kumar K S and Siva Prakash as part of our semester project from 'Big Data Analysis' subject
Minimal project setup required to run applications on Spark
Demo of spark program with cucumber framework using scala
Demonstration of basic data transformations using Spark RDD and Spark DataFrame in Scala
Lightweight type-safe operations for Spark
Assignment for Scalable Machine Learning which aims to study the basics of regression and classification in Spark.
A Spark framework written in Scala with gradle as build tool.
Using Scala for big data computations for basic tasks
This Spark App analyses various covid cases data and enables you to create custom mathematical insights using a unified data structure and a trait method. After processing data it then writes to Cassandra which is then used as primary source for Data Visualization.
Spark BigQuery Parallel
Coursework from Functional Programming in Scala Coursera specialization.
This repository contains spark scala programs
Repository to demonstrate sample data engineering assignment
Add a description, image, and links to the spark-scala topic page so that developers can more easily learn about it.
To associate your repository with the spark-scala topic, visit your repo's landing page and select "manage topics."