Skip to content

kaurrachneet6/Distributed-K-means

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

15 Commits
 
 
 
 
 
 

Repository files navigation

Distributed-K-means

This project is an implementation of Distributed k-Means Clustering on General Topologies.

In the era of big data, centralized algorithms like k-means and k- medians need to be scaled to distributed settings.

Since we cannot transfer the whole dataset from each node to centralized node, we define a coreset. Transfering just the coresets save a considerable amount of communication cost.

The k means cost obtained was as low as the one obtained by running Lloyd’s algorithm on the global dataset whereas the communication cost for the coreset algorithm was considerably less.

About

Distributed k-Means Clustering on General Topologies

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published