This project is a reimplementation of Michele Moglia's project on lunchpad, see: https://launchpad.net/pysesh
Our goal with this project is to implement slowly some of the functions of Jsesh for mass treatement of encoded egyptian texts. The milestones are currently being discussed with M. Serge Rosmorduc, the developper of Jsesh.
For OCR side the following goals are being established:
-
- Our input would be a scanned page and our output would be MDC file.
-
- Implement a Generative Adverserial Network for separating sections that contain hieroglyphic characters in books
-
- Implement a line separating algorithm for separating lines/columns in hieroglyphic texts
-
- Implement character/sequence recognizer