spark with scala, including rdd, transform, action, hdfs, sparkSQL, dataframe and mllib
-
Updated
Feb 8, 2018 - XSLT
Apache Spark is an open source distributed general-purpose cluster-computing framework. It provides an interface for programming entire clusters with implicit data parallelism and fault tolerance.
spark with scala, including rdd, transform, action, hdfs, sparkSQL, dataframe and mllib
[Scala] ELB Log Analysis and Prediction using Spark.
This application will calculate the daily product revenue that displays date in ascending order and revenue in decending order in Spark & MySQL. It also demonstrates how to reduce Stages & Task in Spark using broadcast variables.
This is Spark/Scala based Mobile Telecommunication Customer Value Added Services Recommendation Model which used Collaborative Filtering based Alternative Least Square Algorithm.
This is spark/Scala based Mobile Telecommunication Customer Churn Prediction model developed using Random Forest algorithm
Created by Matei Zaharia
Released May 26, 2014