Number Classifier

PyTorch program for classifying handwritten digits 0-9 with 97% accuracy.

This project trains a convolutional neural network to classify handwritten digits 0-9 with 97% accuracy. Users can see the model in action when executing run.py. This file displays a GUI that allows users to draw their own numbers. When the user finishes drawing the number, the GUI displays a bar chart of the neural network’s predicted output probabilities (as seen above).
My goal for this project was to get comfortable with the PyTorch framework, which I used to build, train, and test the machine learning model. I am using this framework for my undergraduate research, but I never used PyTorch before starting this project.
I trained the model using an Adam Optimizer and a cross-entropy loss function. I also used a learning rate of 0.000005 and a weight decay of 0.0009 over 13 epochs after experimentation. I used PyTorch's deep learning tutorial as a framework for my convolutional neural network. The model guesses with 97% accuracy on the testing set after training.
I collected the data for this project’s training and testing set by executing add_data.py. This file displays a GUI that allows a user to draw a number and add its corresponding label. After the user finishes drawing, the program saves the image and label to either trainset.hdf5 or testset.hdf5. The training and testing sets contain 800 and 200 images, respectively.
The project saves the images as 300x300 pixel grayscale images. I computed a bounding box for the unedited drawn digit from the user and created a new cropped image containing only the contents of the bounding box. I then rescaled this new image to fit within a 200x200 pixel box and generated the final output by pasting this rescaled image into the center of a 300x300 pixel all-black image.
This project taught me the importance of collecting data for a machine learning model in a consistent manner throughout a project. This project initially used training and testing sets from "The MNIST Database of Handwritten Digits" by Yann LeCun, Corinna Cortes, and Christopher H.C. Burges. My model performed well on the MNIST testing set, with about 91% accuracy. However, the model's performance was poor when using the digits I had drawn in the run.py GUI. I realized there was likely a significant difference between the method I used to save the handwritten digits in run.py and the method the MNIST database used. Subsequently, I decided to create my own training and testing sets and saw the performance of my model with handwritten digits in run.py increase dramatically.

Name		Name	Last commit message	Last commit date
Latest commit History 25 Commits
__pycache__		__pycache__
data-set		data-set
README.md		README.md
favicon.ico		favicon.ico
model.pth		model.pth
neural_net.py		neural_net.py
run.py		run.py
test_model.py		test_model.py
train_model.py		train_model.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Number Classifier

About

Uh oh!

Releases

Packages

Languages

Jakeahuber/number-classifier

Folders and files

Latest commit

History

Repository files navigation

Number Classifier

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages