CNN_LSTM_CTC_Tensorflow

The images are first processed by a CNN to extract features, then these extracted features are fed into a LSTM for character recognition.

CNN+LSTM+CTC based OCR(Optical Character Recognition) implemented using tensorflow.

I trained a model with 80k images using this code and got 99.98% accuracy on test dataset (20k images). The images in both dataset:

Overview

This project is based on the great work from here

Below improvements are made:

correct the time step direction
Previously the time step direction is channel, which is incorrect. Now it has been corrected to the width direction. see here for more discussion on this issue.
optimize training scripts
Previously all training images are loaded into memroy, now a simple image generator is used to generate training batch.
metrics implementation implement the character and word accuracy in tensorflow.

please see this issue about dataset， the lable file (a .txt file) is in the same folder with images after extracting .tar.gz file.

python ./train_model.py

python ./eval_model.py

Name		Name	Last commit message	Last commit date
Latest commit History 42 Commits
data		data
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
cnn_lstm_otc_ocr.py		cnn_lstm_otc_ocr.py
eval_model.py		eval_model.py
helper.py		helper.py
main.py		main.py
ocr_datasets.py		ocr_datasets.py
ocr_mtrics.py		ocr_mtrics.py
preparedata.py		preparedata.py
run_all_checkpoints.py		run_all_checkpoints.py
train_model.py		train_model.py
traininglog.txt		traininglog.txt
utils.py		utils.py