OCR-Envision

Training computers to read and understand handwritten English characters and digits.

Mentors
Nishant Nayak
Harshwardan Singh Rathore

Mentees
Sana Azmiya
Madhav Kumar
Darshan S
Akhilesh P

NOTE: The file OCR_Envision.ipynb conatins all the code along with documentation and outputs/images.

Datasets Used

MNIST Dataset
A-Z Handwritten Dataset
We compile these 2 datasets into a single dataset and later uploaded to Kaggle, which might prove useful to others with similar projects.

Models Used
The model used contains 3 Convolutional layers followed by 4 Dense layers. In the first layer, there are 32 filters of size 3x3 with ReLU activation function with same padding followed by MaxPool layer of 2x2. In the second layer, the number of filters is increased to 64 with everything else is same as layer 1. In the third layer, the number of filters is further increased to 128. These are followed by 4 dense layers, the first three of which have ReLU activation function and the last one having softmax function which actually gives the probability of the image being in different classes of numbers and alphabets.

Upon training the model for 8 epochs with a batch size of 128, using the Adam optimizer to optimize the learning rate, we were able to obtain 99.18% accuracy on the training data and a validation accuracy of 98.51% accuracy. The values of the loss function and accuracy vs number of epochs are plotted below:

Name		Name	Last commit message	Last commit date
Latest commit History 13 Commits
images		images
OCR_Envision.ipynb		OCR_Envision.ipynb
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

images

images

OCR_Envision.ipynb

OCR_Envision.ipynb

README.md

README.md

Repository files navigation

OCR-Envision

About

Releases

Packages

Languages

IEEE-NITK/OCR-Envision

Folders and files

Latest commit

History

Repository files navigation

OCR-Envision

About

Resources

Stars

Watchers

Forks

Languages