Given a dataset with cluster labels (ground-truth), it can be hard to determine how well your clustering method is performing on that dataset. Indeed, how do you define how far you are from the ground-truth ? This can be very much problem dependent. Here we introduce a simple Jaccard index measure, combining L0 and L1 norm like to give the user a good idea of how well his method is performing. This method is sensitive to the assignments but also to the number of cluster used, and it thus not monotonic w.r.t the number of clusters inferred.
-
Notifications
You must be signed in to change notification settings - Fork 2
alexandreday/clustering_distance
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
Folders and files
Name | Name | Last commit message | Last commit date | |
---|---|---|---|---|
Repository files navigation
About
Distance between clustering assignments. Non-trivial measure weighting L0 and L1 Jaccard norms.
Topics
Resources
Stars
Watchers
Forks
Releases
No releases published
Packages 0
No packages published