This repository contains projects written for a data mining class. They are roughly themed as follows:
- project_1 - classifcation
- project_2 - clustering
- project_3 - recommendation systems
- project_4 - regression
Here is an example from project 2, which shows the features of a dataset of 5 flowers extracted using an autoencoder. We then perform a compression by reducing the dimensionality from 4096 features to 50 via UMAP. Below is a projection of the resulting clusters:
- Install Python 3.9 or higher.
- Install all dependencies with
pip install -r requirements.txt
. - For any project, start up a jupyter server and open the
.ipynb
file.