Spark library for generalized K-Means clustering. Supports general Bregman divergences. Suitable for clustering probabilistic data, time series data, high dimensional data, and very large data.
-
Updated
Jan 19, 2024 - HTML
Spark library for generalized K-Means clustering. Supports general Bregman divergences. Suitable for clustering probabilistic data, time series data, high dimensional data, and very large data.
Apache Spark™ and Scala Workshops
Explanatory Data Analysis and ML model building using Apache Spark and PySpark
基于Spark的电影推荐系统
Recommendation System written in Python, using the pySpark framework and other Data Science libraries
Code and Data for PyData-Hyderabad-Chapter meetup
Add a description, image, and links to the spark-mllib topic page so that developers can more easily learn about it.
To associate your repository with the spark-mllib topic, visit your repo's landing page and select "manage topics."