tesseractshadow/tesseract4re with few additional languages and handy aliases
How to use:
ocr: following command will convert my_document.pdf to text files pages_000.txt, pages_001.txt etc...
docker run --rm -it --network none -v $PWD:/home/work mhlavac/docker-tesseract ocr my_document.pdf deu+ces+eng
ocr-deu: same as above but for deu
docker run --rm -it --network none -v $PWD:/home/work mhlavac/docker-tesseract ocr-deu my_document.pdf
img2txt-deu: same as above but for image, this will generate output.txt
docker run --rm -it --network none -v $PWD:/home/work mhlavac/docker-tesseract img2txt-deu my_document.png output
You can get even faster if you use aliases for your own machine
alias ocr-deu='docker run --rm -it --network none -v $PWD:/home/work mhlavac/docker-tesseract ocr-deu'
alias ocr-ces='docker run --rm -it --network none -v $PWD:/home/work mhlavac/docker-tesseract ocr-ces'
alias ocr-eng='docker run --rm -it --network none -v $PWD:/home/work mhlavac/docker-tesseract ocr-eng'
alias img2txt-deu='docker run --rm -it --network none -v $PWD:/home/work mhlavac/docker-tesseract img2txt-deu'
alias img2txt-ces='docker run --rm -it --network none -v $PWD:/home/work mhlavac/docker-tesseract img2txt-ces'
alias img2txt-eng='docker run --rm -it --network none -v $PWD:/home/work mhlavac/docker-tesseract img2txt-eng'