Skip to content

🔢 Python implementation of the GMeans clustering algorithm for automatic determination of the optimal number of clusters in a dataset.

Notifications You must be signed in to change notification settings

alberto-mate/G-Means

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

1 Commit
 
 
 
 

Repository files navigation

🔢 GMeans Algorithm Implementation

This repository contains an implementation of the GMeans clustering algorithm, providing a robust and efficient way to automatically determine the optimal number of clusters in a dataset.

The GMeans algorithm is an extension of the KMeans algorithm that automatically finds the appropriate number of clusters in the data. The splitting of clusters is done by using a hypothesis test to determine if a cluster follows a Gaussian distribution.

📙 References

[1] Hamerly, G., & Elkan, C.P. (2003). Learning the k in k-means. Neural Information Processing Systems.

[2] R.B. D’Augostino & M.A. Stephens, Eds., 1986, Goodness-of-Fit Techniques, Marcel Dekker.

About

🔢 Python implementation of the GMeans clustering algorithm for automatic determination of the optimal number of clusters in a dataset.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages