./pdf2sandwich-pdf <file.pdf>
Inspired by
- https://quaintproject.wordpress.com/2015/09/30/searchable-pdf-from-scan-under-linux/
- https://diging.atlassian.net/wiki/spaces/DCH/pages/5275668/Tutorial%3A+Text+Extraction+and+OCR+with+Tesseract+and+ImageMagick
There's no need to solve problems if someone already did that. This project seems to be promising: OCRmyPDF