Docker for building lat.traineddata from scratch
Switch branches/tags
Clone or download
ryanfb Merge pull request #1 from ryanfb/tesstrain
Merge tesstrain back to master
Latest commit 0dd0f0b Jan 13, 2016
Type Name Latest commit message Commit time
Failed to load latest commit information.
latinocr-lat @ b6885bc Update latinocr-lat submodule Jan 7, 2016
.gitmodules Remove latinocr-lattraining top-level submodule Nov 18, 2015
Dockerfile Add notes on how we clone/build latinocr-lat Jan 13, 2016
LICENSE Add LICENSE Jan 12, 2015 Tesseract 3.04.00 Aug 26, 2015


A Dockerfile for building and installing Tesseract 3.04.00 and the training tools onto an Ubuntu image, then building lat.traineddata from scratch.

  • Run git submodule update --init --recursive to clone the submodules
  • Test the build with docker build .
  • Tag the build with docker build -t tesseract_latinocr .
  • Run the build with docker run -d --name latinocr_build tesseract_latinocr (which you can monitor with docker attach latinocr_build)
  • Copy the finished build artifact out with docker cp latinocr_build:/home/latinocr-lat/lat.traineddata .