Spark library for generalized K-Means clustering. Supports general Bregman divergences. Suitable for clustering probabilistic data, time series data, high dimensional data, and very large data.
spark
entropy
clustering
embeddings
kullback-leibler-divergence
cosine-similarity
k-means
spark-mllib
similarity-search
euclidean-distance
bregman-divergence
itakura-saito-divergence
-
Updated
Jan 19, 2024 - HTML