simpleML

🔜

Simple ML is command-line machine learning utility written in Python3.7 wrapped over existing ML libraries.

Aim is to attain no-code ML training while still having the ability to use multiple(all) technique/model/parameters with just few clicks and output neat HTML reports(transperancy on data and models/analysis) with plots and data analysis report which can be helpful in reducing the hypothesis to insights cycle time in a ML experiment

Altough a lot of cloud providers provide this option and sometimes it may not be worth using it, and even costlier. While It's simpler over CLI.

Currently we can run linear regression & Binary classification from command line. It provides interactive terminal to configure your machine learning pipelines and preprocessing steps.

The analysis part run on a jupyter notebook which can be changed as required easily and we can still use the existing framework to provide the reports and CLI configurations.

Linear Regression

Regression Module is a supervised machine learning module that is used for estimating the relationships between a dependent variable (often called the ‘outcome variable’, or ‘target’) and one or more independent variables (often called ‘features’, ‘predictors’, or ‘covariates’). The objective of regression is to predict continuous values such as predicting sales amount, predicting quantity, predicting temperature etc. This supports several pre-processing features that prepare the data for modeling through CLI just by clicking. It has over 25 ready-to-use algorithms and several plots to analyze the performance of trained models.

Option to run on default configuration (docs 🔜).

Provides preprocessing configuration for EDA & making data ready.

Supports comparing 25 Linear regression results based on below

R2
MAE
MAPE
RMSE metrics and provides the best model, but still users have an option to override and run their model of interest.

Option for auto hyperparameter tuning based on random grid search.

Creates a details HTML report with :

residual plots
Feature importance plot
Prediction Error plot
Learning Curve plot
Cooks Distance Plot
Validation Curve Plot

SHAP plots for SHapley Additive exPlanations

Pickling model for re-use.

Binary classification

Classification Module is a supervised machine learning module which is used for classifying elements into groups. The goal is to predict the categorical class labels which are discrete and unordered. Some common use cases include predicting customer default (Yes or No), predicting customer churn (customer will leave or stay), disease found (positive or negative). This module can be used for binary and provides several pre-processing features that prepare the data for modeling through CLI. It has over 18 ready-to-use algorithms and several plots to analyze the performance of trained models.

Option to run on customized preprocessing configurations (docs 🔜).

Provides preprocessing configuration for EDA & making data ready.

Runs 18 classification and comapres the

'Accuracy'
'AUC'
'Recall'
'Precision'
'F1'
'Kappa' merics and provides the best model, but still users have an option to override and run their model of interest.

Option for auto tune hyperparameters based on random grid search.

Creates a details HTML report with below plots

Area Under the Curve
Discrimination Threshold
Precision Recall Curve
Confusion Matrix
Class Prediction Error
Classification Report
Decision Boundary
Recursive Feature Selection
Learning Curve
Manifold Learning
Calibration Curve
Validation Curve
Dimension Learning
Feature Importance
Model Hyperparameter

SHAP plots for SHapley Additive exPlanations

Pickling model for re-use.

Sample demos as of (30thMay2020) - View here>

Creating Linear regression with default configuration on boston dataset

Creating Linear regression model with customized preprocessing configurations

Creating Binary Classification model with default preprocessing configurations on credit card dataset

Creating Binary Classification with default preprocessing configurations on credit card dataset

Install & run

git clone https://github.com/iamlmn/simpleML.git
cd simpleML
pip install -r requirements.txt
python3 auto_regression/main.py

TODOs and completed work :

Name		Name	Last commit message	Last commit date
Latest commit History 35 Commits
.ipynb_checkpoints		.ipynb_checkpoints
assets		assets
demos		demos
gists		gists
packages		packages
simpleML		simpleML
.Rhistory		.Rhistory
.config_ipynb		.config_ipynb
.gitignore		.gitignore
Dockerfile		Dockerfile
README.md		README.md
_config.yml		_config.yml
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

simpleML

Linear Regression

Binary classification

Sample demos as of (30thMay2020) - View here>

Creating Linear regression with default configuration on boston dataset

Creating Linear regression model with customized preprocessing configurations

Creating Binary Classification model with default preprocessing configurations on credit card dataset

Creating Binary Classification with default preprocessing configurations on credit card dataset

Install & run

Contributions and ideas are welcome.s

About

Releases

Packages

Languages

iamlmn/simpleML

Folders and files

Latest commit

History

Repository files navigation

simpleML

Linear Regression

Binary classification

Sample demos as of (30thMay2020) - View here>

Creating Linear regression with default configuration on boston dataset

Creating Linear regression model with customized preprocessing configurations

Creating Binary Classification model with default preprocessing configurations on credit card dataset

Creating Binary Classification with default preprocessing configurations on credit card dataset

Install & run

Contributions and ideas are welcome.s

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages