Skip to content

RubenBSB/deepOCR

Folders and files

NameName
Last commit message
Last commit date

Latest commit

Β 

History

49 Commits
Β 
Β 
Β 
Β 
Β 
Β 

Repository files navigation

deepOCR

This is a school project in deep learning I am currently working on.

It consists in building a handwritten text recognition system using a CNC-LSTM-CTC architecture.

I have planned to use a language model later to analyse recognized words and improve the accuracy.

This article was really helpful to understand the concept of Convolutional Recurrent Neural Network (CRNN).

Data

I am using the IAM Dataset which includes about 115,000 labelled images of English words from more than 1500 handwritten letters.

Use

You have to register to download the dataset. Once it is done, unzip it and place the 'words' directory and 'words.txt' file in the project repository as following :

deepOCR repository
β”œβ”€β”€ data
β”‚   β”œβ”€β”€ words
β”‚   β”‚   β”œβ”€β”€ a01
β”‚   β”‚   β”œβ”€β”€ a02
β”‚   β”‚   β”œβ”€β”€ ...
β”‚   β”œβ”€β”€ words.txt
β”œβ”€β”€ src
β”œβ”€β”€ ...

Releases

No releases published

Packages

No packages published