match submission PDF to student based on ID
Switch branches/tags
Nothing to show
Clone or download
Fetching latest commit…
Cannot retrieve the latest commit at this time.
Permalink
Failed to load latest commit information.
.gitignore
LICENSE
README.md
requirements.txt
sid.jpeg
sid.pdf
test.py

README.md

sidocr

Matches submission PDF to student based on ID by aligning submission with the template, cropping the SID, and running OCR to extract the SID.

#Installation

Clone the repository.

git clone git@github.com:alvinwan/sidocr.git

If Homebrew is not already installed, install package manager Homebrew

/usr/bin/ruby -e "$(curl -fsSL https://raw.githubusercontent.com/Homebrew/install/master/install)"

Using brew, install Tesseract

brew install tesseract

Setup your virtual environment. The following will create a new environment called sidocr.

conda create -n sidocr python=3.4

Activate your virtual environment, and install all dependencies from requirements.txt.

source activate sidocr
pip install -r requirements.txt

At this point, you may exit your virtual environment

source deactivate

Installation complete. See "How to Use" to get started.

How to Use

Make sure to activate your virtual environment, if you haven't already. (If you are in the environment, your prompt will be prefixed by (sidocr))

source activate sidocr