'lat' repository, forked from https://github.com/ryanfb/ancientgreekocr-grc. The final training process for lat.traineddata
Clone or download
Fetching latest commit…
Cannot retrieve the latest commit at this time.
Permalink
Type Name Latest commit message Commit time
Failed to load latest commit information.
latinocr-lattraining @ 24a812b
tools
unicharambigs
.gitignore
.gitmodules
LICENSE
Makefile
README
font_properties
lat.config
lat.numbers.txt
lat.punc.txt
training_text.txt

README

Latin OCR Training for Tesseract
================================

Produces: lat.traineddata

You need wget, unzip and the Tesseract training tools to make this
training.

The following files have been automatically generated using the
tools in the lattraining git repository located at
  https://github.com/ryanfb/latinocr-lattraining

- training_text.txt
- lat.word.txt
- lat.freq.txt
- lat.unicharambigs

You can see the exact process for generating them in the lattraining
Makefile.

The Latin.unicharset file has been copied from Tesseract's
tesseract-ocr.langdata git repository.