add a cmd line switch to generate a txt file to along with the pdf #11

OCRmyPDF-issuebot · 2015-09-14T01:15:48Z

Issue by fritz-hh
Sun Sep 28 20:27:47 2014
Originally opened as fritz-hh/OCRmyPDF#93

OCRmyPDF-issuebot · 2015-09-14T01:15:48Z

Comment by zorglups
Tue Mar 10 21:27:05 2015

I planned to do it to allow post treatment based on the ocred content.

jbarlow83 · 2016-01-12T01:29:39Z

In the spirit of Unix - a command should do one thing well - I'm going to leave this out.

Getting text out is not trivial and everyone will have different requirements.

Use pdftotext from poppler-utils or extractText() in PyPDF2 to get the text out.

OCRmyPDF-issuebot added the enhancement label Sep 14, 2015

jbarlow83 added the wontfix label Jan 12, 2016

jbarlow83 closed this as completed Jan 12, 2016

Provide feedback