SUPERVISED-ML

Supervised machine learning algorithms explained with code

Lasso (L1) Regression | code

Grafting Lasso Algorithm | Shooting Lasso Algorithm | Lasso via eta-trick (GD & SGD)

Lasso regression solves the problem of regularization and induces sparsity for dense feature space. Its requires the L1-norm which is infact a non-convex and non-differentaible norm. Using different algorithms proposed by researchers in recent times, we are able to transform the L1-norm into a convex and differentiable form. The solution to this algorithm is the backbone behind this lasso.

Ridge (L2) Regression | code

Ridge regression is best known for its ability to reduce the complexity of a model using the L2-norm (or euclidean norm). This happens by shricking the coefficients of the model towards zero. It turns out Ridge regression works well for dense dataset and learns a bad hypothesis for non-sparse problem.

Kernel Ridge Regression | code

Kernel regression is a machine learning preceduce for projecting 2-Dimension sapce into N-Dimensional space. We achieve this by satisfying the Mercer's Theory of inner products. Kernel ridge regression project the low dimensional feature space to higher dimension space by using the concept of support vector (representer theorem).

Logistic Regression | code

Here we present the solution to logistic regression using the cross-entropy loss function. This is a binary classifier for supervised classification problem and we solve the optimization function derived from the maximum likelihood using gradient ascent.

Kernel Logistic Regression | code

Logistic regression is a linear binary classifier as implemented here. We therefore extend this version to handle non-linear datatypes by kernelizing it. The implementation method here uses Gradient Descent and Iterative Reweighted Least Square (IRLS) for Kernelizing logistic regression.

Perceptron | Cross-Entropy | code

Single layer perceptron is the foundation upon which Artificial Neural Networks (ANN) is built. We solved the optimization problem of the single layer perceptron using the 'Perceptron Convergence algorithm'.

Perceptron | Convergence algo | code

The loss function here is solved using the 'Stochastic gradient descent' algorihm. Note that 'Perceptron Convergence algorrithm' is also a variant of the stochastic gradient descent algorithm except it uses the stepwise activation function.

Linear Regression (Closed form) | Gradient descent | Stochastic GD | code

Linear regression is a foremost statistical model for predictive learning. Here we solved the regression problem by deriving its closed form solution. Since, the closed form solution is computationally expensive for large datasets, especially because the complexity of solving the matrix inverse of $X^TX$ is $O(N^{3})$, Researchers developed a stochastic gradient descent algorithm for regression.

Support Vector Machine (SVM) | code

linearSVM code | kernelSVM

Maximum margin classifier as it is commonly referred is a powerful discriminative classification algorithm. Its objective is to find data points that best seperates the data into distinctive classes. Using the data points the SVM algorithm established the optimumm margin for classification. SVM is also able to accurately classify non-linear data using the kernel trick.

K-Nearest Neighbors (KNN) | code

K-Nearest Neighbors is a non-parametric and lazy learning algorithm, meaning it does not make any assumptions about the underlying data distribution and does not build an explicit model during the training phase. The prediction for a new data point is made based on the majority class of its k nearest neighbors in the feature space. The distance metric (e.g., Euclidean, Lorentzian, Manhattan, Minkowski) is used to measure the similarity between data points. The value of k is a hyperparameter that determines the number of neighbors to consider; a small k may lead to a noisy decision boundary, while a large k may cause the algorithm to lose sensitivity to local patterns.

Name		Name	Last commit message	Last commit date
Latest commit History 202 Commits
CLASSIFICATION		CLASSIFICATION
REGRESSION		REGRESSION
Utils		Utils
LICENSE		LICENSE
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

SUPERVISED-ML

Lasso (L1) Regression | code

Grafting Lasso Algorithm | Shooting Lasso Algorithm | Lasso via eta-trick (GD & SGD)

Ridge (L2) Regression | code

Kernel Ridge Regression | code

Logistic Regression | code

Here we present the solution to logistic regression using the cross-entropy loss function. This is a binary classifier for supervised classification problem and we solve the optimization function derived from the maximum likelihood using gradient ascent.

Kernel Logistic Regression | code

Logistic regression is a linear binary classifier as implemented here. We therefore extend this version to handle non-linear datatypes by kernelizing it. The implementation method here uses Gradient Descent and Iterative Reweighted Least Square (IRLS) for Kernelizing logistic regression.

Perceptron | Cross-Entropy | code

Single layer perceptron is the foundation upon which Artificial Neural Networks (ANN) is built. We solved the optimization problem of the single layer perceptron using the 'Perceptron Convergence algorithm'.

Perceptron | Convergence algo | code

The loss function here is solved using the 'Stochastic gradient descent' algorihm. Note that 'Perceptron Convergence algorrithm' is also a variant of the stochastic gradient descent algorithm except it uses the stepwise activation function.

Linear Regression (Closed form) | Gradient descent | Stochastic GD | code

Support Vector Machine (SVM) | code

linearSVM code | kernelSVM

K-Nearest Neighbors (KNN) | code

About

Releases

Packages

Languages

License

algostatml/SUPERVISED-ML

Folders and files

Latest commit

History

Repository files navigation

SUPERVISED-ML

Lasso (L1) Regression | code

Grafting Lasso Algorithm | Shooting Lasso Algorithm | Lasso via eta-trick (GD & SGD)

Ridge (L2) Regression | code

Kernel Ridge Regression | code

Logistic Regression | code

Here we present the solution to logistic regression using the cross-entropy loss function. This is a binary classifier for supervised classification problem and we solve the optimization function derived from the maximum likelihood using gradient ascent.

Kernel Logistic Regression | code

Logistic regression is a linear binary classifier as implemented here. We therefore extend this version to handle non-linear datatypes by kernelizing it. The implementation method here uses Gradient Descent and Iterative Reweighted Least Square (IRLS) for Kernelizing logistic regression.

Perceptron | Cross-Entropy | code

Single layer perceptron is the foundation upon which Artificial Neural Networks (ANN) is built. We solved the optimization problem of the single layer perceptron using the 'Perceptron Convergence algorithm'.

Perceptron | Convergence algo | code

The loss function here is solved using the 'Stochastic gradient descent' algorihm. Note that 'Perceptron Convergence algorrithm' is also a variant of the stochastic gradient descent algorithm except it uses the stepwise activation function.

Linear Regression (Closed form) | Gradient descent | Stochastic GD | code

Support Vector Machine (SVM) | code

linearSVM code | kernelSVM

K-Nearest Neighbors (KNN) | code

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages