## Focus on metrics that assess the quality of the inherent groupings found within the data. Here's a breakdown of commonly used metrics and a Python example:

# Key Metrics:

## Silhouette Coefficient:
Measures how similar an object is to its own cluster compared to other clusters.
Ranges from -1 to 1:
1: Well-separated clusters.
0: Overlapping clusters.
-1: Incorrect clustering.
## Calinski-Harabasz Index:
Calculates the ratio of between-cluster variance to within-cluster variance.
Higher values indicate better-defined clusters.
##Davies-Bouldin Index:
Measures the average similarity between each cluster and its most similar cluster.
Lower values indicate better clustering.   
  
## Inertia (for K-Means):
Calculates the sum of squared distances of samples to their closest cluster center.
Lower inertia indicates denser clusters.
Adjusted Rand Index (ARI):
If ground truth labels are available, the ARI measures the similarity between true labels and predicted cluster labels, adjusting for chance.
Ranges from -1 to 1, with 1 being a perfect match.

In [1]:
import numpy as np
from sklearn.cluster import KMeans
from sklearn.metrics import (
    silhouette_score,
    calinski_harabasz_score,
    davies_bouldin_score,
    adjusted_rand_score,
)
from sklearn.datasets import make_blobs

# Generate sample data
X, y = make_blobs(n_samples=300, centers=4, random_state=42)

# Apply K-Means clustering
kmeans = KMeans(n_clusters=4, random_state=42)
labels = kmeans.fit_predict(X)

# Calculate metrics
silhouette = silhouette_score(X, labels)
calinski_harabasz = calinski_harabasz_score(X, labels)
davies_bouldin = davies_bouldin_score(X, labels)
inertia = kmeans.inertia_

print(f"Silhouette Coefficient: {silhouette}")
print(f"Calinski-Harabasz Index: {calinski_harabasz}")
print(f"Davies-Bouldin Index: {davies_bouldin}")
print(f"K-Means Inertia: {inertia}")

#If you have the true labels you can also calculate the adjusted rand index.
if y is not None:
    ari = adjusted_rand_score(y,labels)
    print(f"Adjusted Rand Index: {ari}")

Silhouette Coefficient: 0.7915830011443039
Calinski-Harabasz Index: 3386.414706995127
Davies-Bouldin Index: 0.2905354431615239
K-Means Inertia: 564.9141808210254
Adjusted Rand Index: 0.9910811504997546


