This short script should install a complete Document Management System on a Raspberry Pi ( Wheezy ) or other Debian based Systems. This script takes only care of the installation Process from other tools, At the End, i hope, you can access all your scanned Documents via Webbrwoser and you can search for words and strings.
Switch branches/tags
Nothing to show
Clone or download
Fetching latest commit…
Cannot retrieve the latest commit at this time.
Failed to load latest commit information.



This scirpts works only with Debian Wheezy at the moment (10.04.2016)

Based on a Problem with OCRmyPDF in Jessie. I try to fix the code soon as possible. Work in Progress. (10.04.2016)

Here is a full image (tested with Raspberry B).

#Wheezy complete image (16gb):



User: pi PW: sdm

samba: pi pw: sdm

direct link:

via Bittorrent Sync (6mbit upload max)

What is SimpleDocumentManagement?

SimpleDocumentManagement is a lightweight solution for digital document management with OCR, which works on the Raspberry Pi (tested with Raspbian). It based on other projects, create a structure and is an installer for the document management. You will have access to the documents by using a webinterface, an desktop application and samba network share.

It is designed to turn a clean, fresh installed raspberry Pi to a document managment system, with easy functions.

Know issues

  • German umlauts (äöu etc.) does not work.
  • Use old versions of some scripts because of compatibility.

Install instructions (detail)

  1. if github is not installed (sudo apt-get install git)
  2. Clone or download the project from GitHub (command: git clone )
  3. Chnage the folder (command: cd SimpleDocumentManagement)
  4. Change the configurations at ~/contrib/ (command: nano contrib/
  5. Change the permission for the installer. (command: sudo chmod +x
  6. Run the installer (command: sudo ./
  7. sudo reboot

recommend optional commandes (if you have trouble with recoll GUI)

  1. sudo chmod +x
  2. sudo ./

Install instructions (just commands)

  1. git clone
  2. cd SimpleDocumentManagement
  3. OPTIONAL - nano contrib/ (no need, if you want change anything)
  4. sudo chmod +x
  5. sudo ./
  6. sudo reboot
  7. cd SimpleDocumentManagement
  8. sudo chmod +x
  9. sudo ./


This script installs a Samba server for user "pi". If you have Samba already installed, you can have some trouble.

If you do not want the Samba, please disable it in the config (contrib/


This script should install a complete document management systeme.g. on a Raspberry Pi (Raspbian) or other Debian based systems. It takes only care of the installation process from other tools. At the end you can access all your scanned documents via webbrowser on http://IP:8080. All your documents should now have fulltext search possibility.

How does it work

If you scan a document (.tiff or .pdf) put it into the RAW folder. A script converts .tiff to .pdf (if the file is a .pdf nothings happened) and starts the OCRmyPDF script for this file (each file in the RAW Folder). After this part, you get a searchable PDF.

The search engine Recoll looks for new files at the handled folder. If there is a new file, Recoll will add this file to his database. Now you can use Recoll (desktop app and webinterface) to search your documents. For instance searching for "billing" will show you all the Documents with this word.

If you want to search your documents with your e.g. tablet you can access the Recoll Webui via http://IP:8080.


  • Take care that you never release your Stuff to the Internet!

In some cases it can be possible, that you cant see any files ( located at /home/USER/.recoll ). So the script cant added the right link to your files.

First reboot your device, than use the ( sudo chmod +x and then sudo ./ )

Know issues

  • German umlauts (äöu etc.) does not work well.
  • Use old versions of some scripts because of compatibility.

Upcomming feature (hopefull)

  • Send documents via Mail to the SDM-Pi
  • Dockr Container
  • Torrent to a full working Image


At this point I want to say thanks to Jan Karres ( for the scripts (,, and the This script based on his work (article in German:

Also I want to say thanks to the developer of Recoll and the Github user Koniu for his Webrecoll Projekt ( and Github user fritz-hh for his work on OCRmyPDF (