GitHub - dimifili5/txt-localisation: Detecting Text Areas on mixed Digital Documents. Using Python's OpenCV, Haar features,Morhological Features,Image Integral,SOM Neural Network.

Detecting Text Areas on mixed Digital Documents. Using Python's OpenCV, Haar features,Morhological Features,Image Integral,SOM Neural Network. First, an image is uploaded and preprocessed. Later Haar like features and Morpological features are applied on the preprocessed image using Image Integral.This will by far reduce the time of execution of the programm. The values of the features applied are saved and passed on to the pretrained SOM Neural Network. The SOM Model is consisting of a 4x4 topology, where some neurons correspond to text areas. When training the SOM Network, documents, published on scientific magazines (IEEE) which contain both text and graphic such as images,diagramms,tables etc. are taken into consideration.

Name		Name	Last commit message	Last commit date
Latest commit History 8 Commits
README.md		README.md
_12masks.py		_12masks.py
final_img.jpg		final_img.jpg
initial_img.jpg		initial_img.jpg

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

About

Releases

Packages

Languages

dimifili5/txt-localisation

Folders and files

Latest commit

History

Repository files navigation

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages