Skip to content
Workshop (6 hours): Clustering (Hdbscan, LCA, Hopach), dimension reduction (UMAP, GLRM), and anomaly detection (isolation forests).
R
Branch: master
Clone or download

Latest commit

Fetching latest commit…
Cannot retrieve the latest commit at this time.

Files

Permalink
Type Name Latest commit message Commit time
Failed to load latest commit information.
R Add lca-covariate-table.R Feb 12, 2020
data-raw gitignores for data/ and data-raw/ Feb 10, 2020
data gitignores for data/ and data-raw/ Feb 10, 2020
docs typo Feb 16, 2020
solutions hopach: begin challenge solution, other edits Feb 14, 2020
.gitignore Don't ignore all html files Feb 16, 2020
0-install.Rmd Add slides link to 0-install.Rmd Feb 16, 2020
1-clean-data.Rmd typo Feb 10, 2020
2-hdbscan.Rmd hdbscan: fix data import file Feb 14, 2020
3-umap.Rmd umap: add chunk names Feb 10, 2020
4-glrm.Rmd Minor glrm edits Feb 12, 2020
5-lca.Rmd minor LCA updates Feb 14, 2020
6-hopach.Rmd hopach: begin challenge solution, other edits Feb 14, 2020
7-isolation-forests.Rmd Add incomplete isolation forest starter Feb 14, 2020
LICENSE Add license Jan 12, 2020
README.md 0-setup.Rmd -> 0-install.Rmd Feb 15, 2020

README.md

Unsupervised Learning in R

Unsupervised machine learning is a class of algorithms that identifies patterns in unlabeled data, i.e. without considering an outcome or target. This workshop will describe and demonstrate powerful unsupervised learning algorithms used for clustering (hdbscan, latent class analysis, hopach), dimensionality reduction (umap, generalized low-rank models), and anomaly detection (isolation forests). Participants will learn how to structure unsupervised learning analyses and will gain familiarity with example code that can be adapted to their own projects.

Author: Chris Kennedy

Prerequisites

This is an intermediate machine learning workshop. Participants should have significant prior experience with R and RStudio, including manipulation of data frames, installation of packages, and plotting.

Prerequisite workshops

Recommended workshops

Technology requirements

Participants should have access to a computer with the following software:

Initial steps for participants

To prepare for the workshop, please download the materials and work through the package installation in 0-install.Rmd. Please report any errors to the GitHub issue queue.

There is also an RStudio Cloud workspace that can be used.

Reporting errors or giving feedback

Please create a GitHub issue to report any errors or give feedback on this workshop.

Resources

Books

You can’t perform that action at this time.