Skip to content
Mario Giannini edited this page Dec 27, 2017 · 5 revisions

The Library Pi project was a demonstration project for the Raspberry Pi computer. It is a web interface that allows you to upload a ZIP file of JPG images, which it then OCR’s using tesseract and stores into a MySQL database. Once processing is done, you can then read and perform text-searches against the uploaded book. The following tools are used:

  • Apache and PHP for the front-end
  • MySQL for the database back end
  • Tesseract for the OCR’ing of jpg files
  • Internet Archive Book Reader for display of books.

Note that full documentation on the development of Library Pi is available at www.librarypi.com The Internet Archive Book Reader is included, but original source is at https://archive.org/details/BookReader

Installation
Copy the Site PHP files to your web server. The setup of the database connection can be done manually, or you must give the Apache server write access to the sites root folder.

Run the Stage3.sql script against your MySQL database. It assumes a database named ‘lp’ will be created and used.

Tesseract needs to be installed, and configured with your desired language, including files like osd.traineddata and/or eng.traineddata, etc. Tesseract must be able to be invoked using the exec command in PHP.

Clone this wiki locally