Skip to content

Python based OCR program with deep text structure analysing and style detecting

License

Notifications You must be signed in to change notification settings

BalazsNyiro/deepcopy

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

deepcopy

Python based OCR program with deep text structure analysing and style detecting

Features:

  • Rolling screen reading:

    • screenshot -> OCR current screen
    • keyboard/mouse simulation -> next page
    • OCR again, until we reach end page
  • Wawed paper/text recognition

  • Phone screenshots OCR

  • Windows/Linux desktop screenshot OCR

  • image process from multipage PDF

  • color detection to separate keywords and different text elements from source

  • multiline text processing

  • detect images and text blocks on the page

  • structured json export of text, as a database

  • user learning function: the user can teach special chars to the program

  • client/server architecture: you can reach it from command line or from different guis

  • use only basic Python3 libs if it's possible

ROADMAP:

  • 2019 december: program planning, api definitions,

INSTALL:

  • important python3 modules (Ubuntu package names)
    • tkinter
    • python3-pil
    • python3-pil.imagetk

TODO:

  • program planning in doc dir

About

Python based OCR program with deep text structure analysing and style detecting

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages