Skip to content

Converts Abby FineReader XML (for example, from archive.org) to Dsed, embeddable to DJVU with 'djvused -u -s -f file.dsed file.djvu'

Notifications You must be signed in to change notification settings

l-4-l/abbyxml2djvu

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

9 Commits
 
 
 
 
 
 

Repository files navigation

abby2dsed.py - Converts Abby FineReader XML* to Dsed-File, embeddabe to DJVU.

  • for example, for Public Domain scans you can get Abby-xml for free from archive.org, or use Free-to-Try version of FineReader, or buy a FineReader.

Example of an ABBY xml (in DOWNLOAD OPTIONS - ABBYY GZ): https://archive.org/details/maupassant_selected_by_tolstoi_transl_lazareva_1894_rus

Usage:

  abby2dsed.py filename.xml

It will produce the file filename.dsed in a same folder (inside it will have only unicode text with bracket-like structure, human-readable and editable). All coordinates are starting from left-down corner of an each page.

To embed dsed to DJVU, use

djvused -u -s -f filename.dsed filename.djvu

(djvused is a part of DJVULibre package in Linux, there are Windows and MacOS/OSX versions somewhere).

xml2dsed.py - I have used it as an example, it was published in italian Wikisource, you'll find a link inside

About

Converts Abby FineReader XML (for example, from archive.org) to Dsed, embeddable to DJVU with 'djvused -u -s -f file.dsed file.djvu'

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages