This tool is PoC (Proof of Concept).
Jisui is a helper tool to create e-book.
Ordinary the scanned book have not text information, so you cannot search text from the PDF.
Jisui extract texts from a scanned book (PDF) and merge the text to PDF.
This tool is depending on Google Cloud Vision API to extract texts.
So you need GCP account & own project.
Jisui (自炊) is Japanese slung which means that scanning a book to make e-book.
- GCS bucket
- GCP credential file
- Font file e.g. https://moji.or.jp/ipafont/
$ go get github.com/sachaos/jisui
$ jisui -bucket [your GCS bucket] -font [Downloaded font] -output result.pdf [scanned PDF file]
You can see example PDF file.
Please download and open it in PDF viewer.
You can recongnize the difference when you search text.