Gap-Statistics was introduced by Robert Tibshirani in the year 2000 at Stanford. Determining the ideal number of clusters in a dataset is one of the key tasks in the clustering exercise. After the elbow method gap-statistics is one of the most prominent ones used. The basic idea is to compute the goodness of clustering measure based on average dispersion compared to a reference distribution for an increasing number of clusters. Please go through the notebook for explanation on the same.
-
Notifications
You must be signed in to change notification settings - Fork 0
Determining the ideal number of clusters in a dataset is one of the key tasks in the clustering exercise. After the elbow method gap-statistics is one of the most prominent ones used.
License
Mallikarjun29/Gap_statistics
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
Folders and files
Name | Name | Last commit message | Last commit date | |
---|---|---|---|---|
Repository files navigation
About
Determining the ideal number of clusters in a dataset is one of the key tasks in the clustering exercise. After the elbow method gap-statistics is one of the most prominent ones used.
Resources
License
Stars
Watchers
Forks
Releases
No releases published
Packages 0
No packages published