Data cleaning, pre-processing, and Analytics on a million movies using Spark and Scala.
-
Updated
May 19, 2021 - Scala
Data cleaning, pre-processing, and Analytics on a million movies using Spark and Scala.
Apache Spark Course Material
Spark BigQuery Parallel
Lightweight type-safe operations for Spark
Techniques for analyzing and visualizing data at scale.
Hadoop hdfs mapreduce hive spark使用案例
This repository is created by Dharshan Kumar K S and Siva Prakash as part of our semester project from 'Big Data Analysis' subject
A Spark framework written in Scala with gradle as build tool.
Demonstration of basic data transformations using Spark RDD and Spark DataFrame in Scala
Assignment for Scalable Machine Learning which aims to study the basics of regression and classification in Spark.
This is the repository for Youtube Project for the subject PBDA. We are implementing analysis for finding top videos in each and every category. Also, we are planning to find the top trending words in each category.
Coursework from Functional Programming in Scala Coursera specialization.
Demo of spark program with cucumber framework using scala
Summarizing large text document using LDA (Latent Dirichlet Allocation) in SPARK-Scala framework
Learning Journey: Spark using Scala, Python, PySpark
Minimal project setup required to run applications on Spark
Apache Spark mllib example for seminar 'AI with scala'
Add a description, image, and links to the spark-scala topic page so that developers can more easily learn about it.
To associate your repository with the spark-scala topic, visit your repo's landing page and select "manage topics."