Skip to content

Algorithms to understand structure of tables in images and perform OCR

License

Notifications You must be signed in to change notification settings

ideaplexus/structured-OCR

 
 

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

4 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Grid Layout Structured Text Reader

Images containing text with a two column grid structure will have it's text recognized using Tesseract OCR and it's grid structure preserved. Text and structure is saved to a comma delimited file, output.csv.

image

Install

Linux:

chmod +x ./build.sh
./build.sh

Usage

./a.out [image file]

About

Algorithms to understand structure of tables in images and perform OCR

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • C++ 95.3%
  • Shell 4.7%