A simple program to extract the text from an image before performing OCR
Clone or download
jasonlfunk Merge pull request #4 from cortex42/master
Fixed calculation error in keep_box()
Latest commit 2ca74e9 Oct 15, 2014
Type Name Latest commit message Commit time
Failed to load latest commit information.
.gitignore Initial commit Aug 4, 2012
LICENSE Fix license Aug 4, 2012
README.md Update README.md Aug 15, 2013
extract_text Fixed error in keep_box() Oct 15, 2014



I am not actively supporting this script. It was just an experiment.

Processes an image to extract the text portions. Primarily used for pre-processing for performing OCR.

Implemented in Python using OpenCV.

Based on the paper "Font and Background Color Independent Text Binarization" by T Kasar, J Kumar and A G Ramakrishnan http://www.m.cs.osakafu-u.ac.jp/cbdar2007/proceedings/papers/O1-1.pdf

Copyright (c) 2012, Jason Funk jasonlfunk@gmail.com