A Neural Network, From Scratch

Hello!

The purpose of this project, was to better understand machine learning, and neural networks in general.

So, to achieve this, I've coded a neural network (specifically, a multi-layer perceptron), from scratch in Python.

To help keep the code tidy and presentable, I've created a class for the neural network, and made methods for all of the common calculations (such as calculating gradient, updating weights/biases, calculating output, etc).

I've also gone through the underlying mathematics of a neural network, which mainly involves multivariable chain rule, matrix calculus, and vectorisation.

Once the neural network was created, it was applied to the MNIST problem, and achieved an accuracy of about 95%, after 10 epochs of training (took around 1 minute on my bog standard computer).

Thus, overall, there are definitely improvements that could be made to the network that I've built, as there are many others out there that are more accurate/efficient/elaborate, but since the main goal was just to improve my understanding of simple neural networks, by rigorously going through the underlying mathematics, I've decided to leave it as is, and move onto other projects.

How the code works:

"main.py" is a simple script file, that first imports "mnist_loader.py" and "neuralnet.py". It then creates a neural network using "neuralnet.py", which is then trained, and the accuracy of the network is printed out at the end of every epoch of training, to help demonstrate that the network is learning.

"mnist_loader.py", is a simple program, that loads and transforms the MNIST data.

"neuralnet.py", is where the neural network class has been created, as well as all the methods needed for training and evaulation.

Walkthrough of underlying mathematics

An explanation/walkthrough of the underlying mathematics can be found here here:
https://github.com/peterw-github/NeuralNetwork_FromScratch/blob/main/Explanations/Math%20Explanation.pdf

Credit

Credit to Credit to 3Blue1Brown, for the intuitive explanation of a Neural Network, found in the playlist here:
https://www.youtube.com/watch?v=aircAruvnKk

And to Michael Nielsen, for a more indepth explanation on the underlying mathematics of a Neural Network, and recommendations for future improvements, such as changing the cost function (currently mean-squared-error method) to a logarithm based one (cross-entropy), as well as exploring other activation functions, such as tanh, softmax, and rectified linear unit (ReLU).

Dataset was provided via Kaggle, here:
https://www.kaggle.com/datasets/oddrationale/mnist-in-csv

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

A Neural Network, From Scratch

How the code works:

Walkthrough of underlying mathematics

Credit

About

Uh oh!

Releases

Packages

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 10 Commits
Data		Data
Explanations		Explanations
README.md		README.md
main.py		main.py
mnist_loader.py		mnist_loader.py
neuralnet.py		neuralnet.py

peterw-github/NeuralNetwork_FromScratch

Folders and files

Latest commit

History

Repository files navigation

A Neural Network, From Scratch

How the code works:

Walkthrough of underlying mathematics

Credit

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages