Tesseract Language Trained Data
Clone or download
Fetching latest commit…
Cannot retrieve the latest commit at this time.
Permalink
Type Name Latest commit message Commit time
Failed to load latest commit information.
3.02 adding eng0 which lacks a language model Jul 15, 2015
4.0.0
.gitignore
CNAME
README.md
get-langs.sh
gzip-traineddata.sh
langs.txt

README.md

tessdata - Tesseract Language Trained Data

Accessible URLs

4.0.0

3.02

Sources

4.0.0

For 4.0.0, the trained data is forked from TessData Github, and gzip by following command:

$ sh gzip-traineddata.sh 4.0.0

3.02

Most of these were downloaded and extracted from Tesseract's Google Code page. This repository also includes John Lin's meme-ocr traineddata. It also includes the latest (2014-05-01) release of the Ancient Greek traineddata.