This is the github repo for Learning Spark: Lightning-Fast Data Analytics [2nd Edition]
-
Updated
May 8, 2024 - Scala
This is the github repo for Learning Spark: Lightning-Fast Data Analytics [2nd Edition]
This package contains the code for calculating external clustering validity indices in Spark. The package includes Chi Index among others.
Maven project cover scala language: sparkml, spark_streaming, spark_dataframe, ... + java language: threadpool, kafka, jpa, timer, request api
Qubole Sparklens tool for performance tuning Apache Spark
I do some basic statistics and machine learning work on a dataset of tornado events across the United States. The dataset is nowhere near big enough to warrant using Spark over something like R, but I was looking for practice. I do some basic SQL to find out which years and states saw the most tornadoes and the most F5 tornadoes. Then I use Spar…
For detecting the fraud credit card transactions at real time
Revature Big Data Project 2 - Collaborative work with SamuelTaylr, AtwalMandeep, and mark-coffer
✨ Spark ML implementation of SOM algorithm (Kohonen self-organizing map)
using spark to predict stock, the data come from sina
🌟 ✨ Analyze and visualize Twitter Sentiment on a world map using Spark MLlib
Assignment for Scalable Machine Learning which aims to study the basics of regression and classification in Spark.
Developing a Model-based (Alternating Least Squares) Movie Recommendation System using Apache Spark and deploying on AWS EMR Cluster
In this project, we are going to build a Bicycle sharing demand prediction service using Apache Spark and Scala. I have created a two spark application one for model generation and another for model demand prediction.
Implementation of SMOTE - Synthetic Minority Over-sampling Technique in SparkML / MLLib
Scala/Spark project, for Languages and Algorithms for Artificial Intelligence class at UNIBO
Add a description, image, and links to the spark-mllib topic page so that developers can more easily learn about it.
To associate your repository with the spark-mllib topic, visit your repo's landing page and select "manage topics."