Skip to content

minhthangdang/SignLanguageRecognitionCNN

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

11 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Sign Language Recognition using CNN

Convolution Neural Network for sign language recognition using MNIST image dataset

Problem Statement

This is my own exercise resulted from Coursera's Convolutional Neural Networks.

This is my second deep learning model for sign language recognition using MNIST image dataset. This model differs from the first one (https://github.com/minhthangdang/SignLanguageRecognition) in that this one uses Convolutional Neural Network.

The Dataset

The dataset is obtained from Kaggle (https://www.kaggle.com/datamunge/sign-language-mnist). The training data has 27,455 examples and the test data has 7,172 examples. Each example is a 784 (28x28) pixel vector with grayscale values between 0-255. It has 24 classes of letters (excluding J and Z) in American Sign Language.

An illustration of the sign language is shown here (image courtesy of Kaggle):

Sign Language

Grayscale images with (0-255) pixel values:

Sign Language

One example in the MNIST dataset:

Sign Language

Convolutional Neural Network Architecture

My network architecture borrowed the ideas of LeNet-5 model (http://yann.lecun.com/exdb/publis/pdf/lecun-01a.pdf), thanks to its relatively simple and easy to train network. The architecture is as follows:

CONV2D -> RELU -> MAXPOOL -> CONV2D -> RELU -> MAXPOOL -> FLATTEN -> FULLYCONNECTED -> FULLYCONNECTED -> FULLYCONNECTED

The architecture is depicted below:

Convolutional Network Architecture

The hyperparameter values are learning_rate = 0.0001, num_epochs = 30, minibatch_size = 64, and optimizer = AdamOptimizer.

The program is written in Python and Tensorflow 1.x

The result is:

Cost Function Plot

Train Accuracy: 1.0
Test Accuracy: 0.89445066

Even though it is a relatively simple network, it achieved very good results. Compared to my first deep learning model for the same task (https://github.com/minhthangdang/SignLanguageRecognition), this model is not only better in accuracy, but also faster in training time with much lower number of epochs.

This is one of my repositories in a series of deep learning exercises. Please check out my other repositories to see more.

Should you have any questions, please contact me via Linkedin: https://www.linkedin.com/in/minh-thang-dang/

About

Sign Language Recognition using Convolution Neural Network

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Languages