This Repo is designed to be a guide for different Machine Learning Models and Algorithms. All Models have a - Notebook - Jupyter Notebook with Step-by-Step Instructions - Script - Python files to show how to deploy models
The Repo starts of very simple with LinearRegression and progressivly gets more advanced. I'm trying explain the Algorithms as best as possible. Many of the text here have been written with help from ChatGPT.
You can use the code with the supplied datasets but I've also tried to make the code modular and unviersal applicable to other datasets. (There are some limitations with this, especially with Preprocessing the data). The only place where changes are required are in the YAML Config file. I've set it up this way so you can limit what columns you want to include as features.
I'm building this Repo as part of my portfolio but also as a contionous learning project. If you spot any errors or have suggestions for the repo or specific algorthims or notebooks please don't hesitate to reach out.
This Repo is a work in progress and I will continously update it with more Notebooks and Scripts. I'm also working on a similar Repo for DeepLearning Algorithms which I will release soon.
- install
requirements.txt
- If you using the supplied datasets you can run the scripts from the terminal and the Notebooks directly
- If you're using other datasets, you need to adjust the Features and Target variable in the YAML file.
- In the
config.yaml
file you can also adjust the hyperparamters
- LinearRegression (OLS)
- Gradient Boosting Regressor (XGBoost)
- Random Forest Regressor (Build with Streamlit as an endpoint)
- Random Forest Classifier
- Gradient Boosting Classifier (XGBoost)