Text Recogniton in News Video frames using EAST text detection and CRNN recognition

Text is everywhere as we know it, some places we encounter it mostly are for example books, newspapers, invoices, hoardings/billboards, in movie subtitles,etc.

For many of these types of texts such as invoices the bills are black & white, have common fonts, and have problems such as tilt in the picture, maybe 2-3 degree or maybe 90/180/270 degree flip, have some kind of blurring, etc. These problems can be solved by detecting tilt by hough transform, using gaussian blurring, sharpening, etc and then using the majorly important tool tesseract.

But, for images in the wild like in stop signs billboard hoardings, for text-recognition, we have to resort to deep learning based approaches.

Aim

I aim to build a pipeline of a text-detection algorithm followed by text-recognition based on deep learning and hope to accomplish the following tasks, also wish to compare to other paid solutions, in the long term. The task list is below, please comment & give feedback as to any features I should add, and also any critical feedback!

Python version used - 3.6

Task List:

Text Detection using EAST algorithm ( Got to use maybe a better algo) on all video frames by sampling.
Text recognition using CRNN on the segments detected from the frames & then printing text recognized frame by frame.
Add text datagenerator script for people looking to do transfer learning on CRNN model.
Try training part, if possible with Mysynth dataset (10 Gb (--)) and our custom data.
Make a viable flask app.
Add the app on docker and also on heroku to host it ( I guess ).
Update to tensorflow 2.x versions.
Create a tflite-app
Aim to increase accuracy, since presently accuracy doesn't strike me as good enough 😓

<----- Installation of Tensorflow 1.8.0 ------>

Please follow the instructions at Install older versions of tensorflow not available directly through pip

Name		Name	Last commit message	Last commit date
Latest commit History 19 Commits
crnn_ctc_ocr.Tensorflow		crnn_ctc_ocr.Tensorflow
.gitignore		.gitignore
README.md		README.md
requirements.txt		requirements.txt
text_detection.py		text_detection.py
text_detection_in_news.ipynb		text_detection_in_news.ipynb
text_detection_wrapper.py		text_detection_wrapper.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

crnn_ctc_ocr.Tensorflow

crnn_ctc_ocr.Tensorflow

.gitignore

.gitignore

README.md

README.md

requirements.txt

requirements.txt

text_detection.py

text_detection.py

text_detection_in_news.ipynb

text_detection_in_news.ipynb

text_detection_wrapper.py

text_detection_wrapper.py

Repository files navigation

Text Recogniton in News Video frames using EAST text detection and CRNN recognition

Aim

<----- Installation of Tensorflow 1.8.0 ------>

About

Releases

Packages

Languages

sirius0503/text_recognition_in_videos

Folders and files

Latest commit

History

Repository files navigation

Text Recogniton in News Video frames using EAST text detection and CRNN recognition

Aim

<----- Installation of Tensorflow 1.8.0 ------>

About

Resources

Stars

Watchers

Forks

Languages