Fully Connected Neural Network

MLP (fully connected neural network) from scratch using numpy on the MNIST dataset (Digit recognition).

General

Background

Implementation of a neural network for digit classification, on the MNIST dataset, which takes as an input a 28*28 grayscale image (784 floating point values of pixels between 0-255).

Net Structure

The network has one hidden layer in size 128 (default) and it performs multiple epochs (20 by default) and trains the model by minimizing the Negative Log Likelihood (NLL) with the Sigmoid and Softmax activation functions.

During learning, the network verifies its accuracy on an independent set of data (about 10% of the training set) on which learning is not performed. This group is called a validation set. After all the epochs, the network saves its best condition, the weights that resulted the maximum accuracy on the validation set, to prevent overfitting.

Finally, the network exports a graph of the accuracy on the validation and the training sets, by the number of epochs, and verifies the accuracy on the testing set.

Running Instructions

The program gets several arguments, and this can be seen with the -h flag when running. A total of about seven arguments can be sent:

flag -train_x TRAIN_X_PATH for the training images file path (file that contains 784 values in each row).
flag -train_y TRAIN_Y_PATH for the training labels file path (file that contains one value between 0-9 in each row and has the same rows number as the train_x file).
flag -test_x TEST_X_PATH for the testing images file path (file that contains 784 values in each row).
flag -test_y TEST_Y_PATH for the testing labels file path (file that contains one value between 0-9 in each row and has the same rows number as the train_x file).
flag -lr LEARNING_RATE for the learning rate that will be used while training the model (default value = 0.1).
flag -e EPOCHS for the number of epochs (default value = 20).
flag -size HIDDEN_LAYER_SIZE for the hidden layer size (default value = 128).

running example:

	$ python3 neural_net.py -train_x train_x -train_y train_y -test_x test_x -test_y test_y -lr 0.03

Note that for using the dataset given in this repo, you need to unzip the dataset.zip folder (for example, using 7-zip)

Dependencies

Installation

Open the terminal.

Clone the project by:

$ git clone https://github.com/tomershay100/Fully-Connected-Neural-Network.git

Enter the project folder:
```
$ cd Fully-Connected-Neural-Network
```
unzip the dataset.zip:
```
$ unzip dataset.zip
```
Run the neural_net.py file with your favorite arguments:
```
$ python3 neural_net.py -lr 0.03 -e 10 -size 100
```

Name		Name	Last commit message	Last commit date
Latest commit History 10 Commits
README.md		README.md
dataset.zip		dataset.zip
neural_net.py		neural_net.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Fully Connected Neural Network

General

Background

Net Structure

Running Instructions

Dependencies

Installation

About

Releases

Packages

Languages

tomershay100/Fully-Connected-Neural-Network

Folders and files

Latest commit

History

Repository files navigation

Fully Connected Neural Network

General

Background

Net Structure

Running Instructions

Dependencies

Installation

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages