Skip to content
No description, website, or topics provided.
Python C XSLT JavaScript HTML CSS Other
Branch: master
Clone or download

Latest commit

Fetching latest commit…
Cannot retrieve the latest commit at this time.

Files

Permalink
Type Name Latest commit message Commit time
Failed to load latest commit information.
app
virtual
.DS_Store
.gitattributes
README.md

README.md

PythonPDFScraper

Description

A local webserver giving you the ability to upload PDF files. Take the PDF and OCR it and/or dump the text.

Installation

First, download the repository.

On a Mac, install XCode:

xcode-select --install

Then, install ocrmypdf: https://github.com/jbarlow83/OCRmyPDF

Lastly, install the requirements (optional - or run the virtual enviornment):

sudo pip3 install -r requirements.txt

To Run

source virtual/bin/activate

python3 app.py runserver

A new web page should open up in your default web browser.

Issues

The progress bar for OCRing documents is only visible in the Terminal for now. So, be patient while it works!

Thanks To

jbarlow83 for pikepdf and ocrmypdf https://github.com/blueimp/jQuery-File-Upload

You can’t perform that action at this time.