The purpose of this repo is to compile (no pun intended) a knowledge base, and reference repo, for all things machine learning. The information present is not mine, and originates from my graduate work in machine learning, online classes I have completed, or are currently taking, as-well-as articles and blog post I find interesting. All attribution and citations are in the footnote where possible, with a more detailed reference in the reference section at the end.
- Getting the data
- Importing the libraries
import matplotlib.pyplot as plt
matplotlib
is the library andpyplot
is the sub-library
- Importing the dataset
- Missing data
- Categorical data
- Splitting the dataset into training and test sets
- Feature scaling
- Data preprocessing template
- Naive Bayes Classifier 1
-
Datasets
-
Anaconda with Python3.6 for specific OS:
-
R and RStudio for specific OS:
1: ODU MISM607, Machine Learning I, Machine Learning A-Z, Hands on Python & R in Data Science (Udemy 2018)