Skip to content
master
Switch branches/tags
Go to file
Code

Latest commit

 

Git stats

Files

Permalink
Failed to load latest commit information.
Type
Name
Latest commit message
Commit time
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

README

Latin OCR Training for Tesseract
================================

Produces: lat.traineddata

You need wget, unzip and the Tesseract training tools to make this
training.

The following files have been automatically generated using the
tools in the lattraining git repository located at
  https://github.com/ryanfb/latinocr-lattraining

- training_text.txt
- lat.word.txt
- lat.freq.txt
- lat.unicharambigs

You can see the exact process for generating them in the lattraining
Makefile.

The Latin.unicharset file has been copied from Tesseract's
tesseract-ocr.langdata git repository.

About

'lat' repository, forked from https://github.com/ryanfb/ancientgreekocr-grc. The final training process for lat.traineddata

Resources

License

Packages

No packages published