GitHub - hariharanpalani/document-classification

This project used OCR Engine - Tesseract which can be found - here

Steps to do,

Install the OS specific tesseract package.
Run pip install defined in requirements.txt file
Currently all the files in input folder has been processed to output type of document.

Name		Name	Last commit message	Last commit date
Latest commit History 7 Commits
input		input
.gitignore		.gitignore
README.md		README.md
crop_morphology.py		crop_morphology.py
main.py		main.py
patterns.py		patterns.py
requirements.txt		requirements.txt