Skip to content

Exploration of USArrests data using unsupervised machine learning

Notifications You must be signed in to change notification settings

EstherSlabbert/Final-Capstone-Unsupervised-ML

Repository files navigation

Final Capstone - Unsupervised Machine Learning on US Arrest data

Table of Contents
  1. About The Project
  2. Getting Started
  3. Usage
  4. Contact
  5. Acknowledgments

About The Project

A description of the data in 'UsArrests.csv' is given as: “This data set contains statistics, in arrests per 100,000 residents, for assault, murder, and rape in each of the 50 US states in 1973. Also given is the percent of the population living in urban areas.” This contains a .ipynb file that generates an in-depth PCA and data clustering report of the data using unsupervised machine learning. Dataset can be found: https://www.kaggle.com/datasets/halimedogan/usarrests

(back to top)

Built With

  • Visual Studio Code

(back to top)

Getting Started

This is an example of how you may give instructions on setting up your project locally. To get a local copy up and running follow these simple example steps.

Prerequisites

This is an example of how to list things you need to use the software and how to install them.

  • npm
    npm install npm@latest -g

Installation

  1. Clone the repo
    git clone https://github.com/EstherSlabbert/finalCapstone.git
  2. Install python go to: https://www.python.org/downloads/ download and install
  3. Install sklearn packages
    py -m pip install -U scikit-learn
  4. Install pandas packages
    py -m pip install pandas
  5. Install numpy packages
    py -m pip install numpy
  6. Download the .ipynb and .csv files from this repository
  7. Run the files Project was created and run with Visual Studio Code (to download: https://code.visualstudio.com/) and can be run from there or from Jupyter Notebook.

(back to top)

Usage

Once the files have been downloaded run them with Visual Studio Code or Jupyter Notebook or your preferred choice of program to run .ipynb files. Running with Visual Studio Code one can see the PCA and cluster analysis report generated by this project. Below are some examples of what the code should reproduce:

Screenshot1 Screenshot2

(back to top)

Contact

Esther Slabbert - super.ejs@gmail.com

Project Link: https://github.com/EstherSlabbert/finalCapstone

(back to top)

Acknowledgments

  • HyperionDev Data Science Bootcamp
  • Esther Slabbert

(back to top)