Clustering Methods

Purpose

The goal of this project is to look deeper into the most common methods of grouping objects on the basis of their similarity. The K-means and EM algorithm can both be used for this general purpose but differ in their strengths and weaknesses.

Packages

The comparison is done using generated data from the bivariate normal distribution. Below is a list of the packages used.

MASS: For generating bivariate data.
ggplot2: For beautiful plots.
cluster: Functions for clustering.
factoextra: ggplot2 compatible silhoutte plots.

Most of the important functions were written by myself for instructive purposes. To view the project in a browser, visit the link beside the repository description in the code tab.

Name		Name	Last commit message	Last commit date
Latest commit History 30 Commits
docs		docs
km_em		km_em
rsconnect/documents/clustering.Rmd/rpubs.com/rpubs		rsconnect/documents/clustering.Rmd/rpubs.com/rpubs
.gitignore		.gitignore
README.html		README.html
README.md		README.md
clustering.Rmd		clustering.Rmd
clustering.Rproj		clustering.Rproj
clustering.log		clustering.log
clustering_functions.R		clustering_functions.R
graphical_model.png		graphical_model.png
methods.png		methods.png

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Clustering Methods

Purpose

Packages

About

Releases

Packages

Languages

derekwayne/clustering

Folders and files

Latest commit

History

Repository files navigation

Clustering Methods

Purpose

Packages

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages