Implementation in python of Machine learning basic algorithm : Linear and logistic Regression, PCA, Neural network, Transformer etc 😜

This repository presents the basics of machine learning, particularly regression.

I. Linear regression

the following graphs show the results of minimizing the error function following a parameter variation

1. Gradient descent

fig_1, fig_2, fig_3, fig_6 have the same learning rate and their plots show that training on a large number of epochs quickly reduces the lost function
fig_1, fig_4 (fig_5, fig_6) have same number of epoch and we see that learning is better for a greater learning rate.

2. Stochiastic Gradient descent

fig_1, fig_4 have same number of epoch and we see that learning is better for a greater learning rate.

3. Stochiastic Gradient descent with momentum

Considering a fixed beta to calculate momentum (equal to 0.99) The performance of stochiastical with a momentum of 0.99 is not good compared to the two previous optimizers.

change beta before computing momentum (equal to 0.44) But if we reduce beta to 0.44 we have better convergence

4. Minibatch Gradient descent

Considering a fixed batch (equal to 3)

change batch (equal to 1)

The minibatch with a batch of 1 is better than 3 (Because data is simple. Try to test it with complicated one...)

5. Adam Gradient descent

Considering fixed variables like (beta1=0.9, beta2=0.999, epsilon=1e-8) fig_6 we observe a rapid convergence then oscillation around the global minimum

II. Logistic regression

From all its sets of plots we find it appropriate to choose the following hyperparameters: $$lr = 0.05 \qquad n_epochs = 5000$$

Optimizations

What about optimizations in this code ? You can notice the usage of

OOP paradigm
The single responsibility principle E.g. refactors, performance improvements, accessibility

III. Neural network for Classification

1 - Problem to solve

In this section it is a question of carrying out the classification of data which is presented as the Xor logic gate (see the data graph below).

2 - Resolution approach

to solve this classification problem we propose to use a neural network with a single hidden layer as follows:

The activation function to use is the sigmoid function and to minimize our loss we use the gradient descent. Below are the results of our decision boundary and our loss (for training and testing sets)

Losses: Decision Boundary:

IV. Transformer

Transformer Main Paper: https://arxiv.org/abs/1706.03762

In this section it is a question of implementing the essential concepts present in the transformer architecture model above.

1 - Attention

- Self-Attention

- Cross-Attention

- Layer-Normalisation

- Position encoding

- MultiHead Attention

Why MultiHead Attention ? It allows the model to jointly attend to information from different representation subspaces at different position.

Tech Stack

Language: Python, Pytorch

Package: Numpy, Sklearn, matplotlib, pandas, ipywidgets

Run Locally

Clone the project

  git clone https://github.com/Omer-alt/Basic_ML_Algorithm.git

Go to the project directory

  cd my-project

Run the main file

  main.py

Authors

@Fotso omer

License

MIT

Name		Name	Last commit message	Last commit date
Latest commit History 24 Commits
Neural_network		Neural_network
Normalization		Normalization
Plot_handdling		Plot_handdling
Representation_learning		Representation_learning
__pycache__		__pycache__
public		public
transformers_attention		transformers_attention
.DS_Store		.DS_Store
README.md		README.md
adam_gradient_descent.py		adam_gradient_descent.py
linear_Regression.py		linear_Regression.py
logistic_regression.py		logistic_regression.py
main.py		main.py
minibatch_gradient_descent.py		minibatch_gradient_descent.py
sgd_oop.py		sgd_oop.py
sgd_with_momentum.py		sgd_with_momentum.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Implementation in python of Machine learning basic algorithm : Linear and logistic Regression, PCA, Neural network, Transformer etc 😜

I. Linear regression

1. Gradient descent

2. Stochiastic Gradient descent

3. Stochiastic Gradient descent with momentum

4. Minibatch Gradient descent

5. Adam Gradient descent

II. Logistic regression

Optimizations

III. Neural network for Classification

1 - Problem to solve

2 - Resolution approach

IV. Transformer

1 - Attention

- Self-Attention

- Cross-Attention

- Layer-Normalisation

- Position encoding

- MultiHead Attention

Tech Stack

Run Locally

Authors

License

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Implementation in python of Machine learning basic algorithm : Linear and logistic Regression, PCA, Neural network, Transformer etc 😜

I. Linear regression

1. Gradient descent

2. Stochiastic Gradient descent

3. Stochiastic Gradient descent with momentum

4. Minibatch Gradient descent

5. Adam Gradient descent

II. Logistic regression

Optimizations

III. Neural network for Classification

1 - Problem to solve

2 - Resolution approach

IV. Transformer

1 - Attention

- Self-Attention

- Cross-Attention

- Layer-Normalisation

- Position encoding

- MultiHead Attention

Tech Stack

Run Locally

Authors

License

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages