Optical Character Recognition

Digit Recognition

This notebook is the source code used for the submissions for Kaggle Competition. The goal of the competition is to classify images from the MNIST handwritten digit database. The solution developed produced 99.928% accuracy and got me to 60th place. (top 3%).

The approach I took was to perform an Exploratory Data Analysis which enabled me to notice that the data was not noisy but that not all pixels in the images were useful. Thus I could do some Dimensionality Reduction. I build a simple Convolutional Neural Network (CNN) using Keras. The steps I followed are (as described in the Jupyter Notebook) to do normalization, reshaping, data augmentation and training with an Adam Optimizer and a ReduceLROnPlateau callback.

CAPTCHA

This directory is an attempt at recognizing CAPTCHA (Completely Automated Public Turing test to tell Computers and Humans Apart) images. Built in 1997 as way for users to identify and block bots (in order to prevent spam, DDOS etc.). They have since then been replace by reCAPTCHA because they are breakable using Artificial Intelligence as we will see.

The approach taken by the CAPTCHA creators to make the task of classifying the images impossible for computers, is to distort the letters. Thus the letters have noise, extra lines crossing the words... To solve this, I built a conventional CNN with a twist. I stacked at the deeper level of the model, 5 branching Convolutional Layers such that each one would be specifically trained to classify a single letter.

Name		Name	Last commit message	Last commit date
Latest commit History 36 Commits
CAPTCHA		CAPTCHA
Digit Recognition		Digit Recognition
Model Inversion Attack		Model Inversion Attack
.gitignore		.gitignore
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Optical Character Recognition

Digit Recognition

CAPTCHA

About

Releases

Packages

Languages

Fournierp/OCR

Folders and files

Latest commit

History

Repository files navigation

Optical Character Recognition

Digit Recognition

CAPTCHA

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages