Skip to content

jyu-theartofml/Clustering

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

5 Commits
 
 
 
 

Repository files navigation

Simple Comparison of K-means and Hierarchical Clustering in R

For this R code, the mammals milk dataset was used to perfrom K-means and Hierarchical clustering. The following packages are required when running in R: cluster and cluster.datasets.

A few words on k-means clustering : traditional k-means clustering algorithm works nicely with numerical data. However, if you have categorical features, try K-mode. K-mean algorithm doesn't really provide meanininful insights for one-hot-encoded categorical data because the sample space is so sparse for applying euclidean distance calculation. Try either Gower Distance or agglomerative methods.

About

No description or website provided.

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages