Document-Scanner

Document-Scanner is open-source python package to scan, segment and tranform images of documents as if the documents is scanned by a scanner. It includes predefined pipelines on preprocessing, frame detection, transformation and post processing to add styles.

Pipeline

Convert to HSV color space

The following pipelines is applied first on intensity slice , or the Value phase, of the original image. If failed to find frame in the intensity image, apply exactly the same processes to saturation image.
Preprocessing
1. Blur with Median filter
2. Histogram equalization
3. Morphological operation (Opening)
4. (Optional) Threshold based segmentation.
  
  Here we assume that the document of interest is mainly white while background is darker. Then we can extract document from background with a proper threshold. After histogram, maybe we can just assume the document lays in the half brighter part on histogram.
5. Canny edge detector
6. Contour detection
7. Morphological Erosion
8. Morphological Dilation
  
  This step is to dilate the contour to reduce the impact of non-linear edge when calculating connectivity.
Hough Transform
Intersection 1. Find the cartesian coordination of intersection points 1. Calculate connectivity on every intersections on four direction: up, right, bottom, left. 1, Corner Compute the possiblity on every intersection points to decide the orientation of corner.
Frame detection
1. Find possible frames
2. Select the most possible frame
Warp
(TODO) Post process

Demo

Use /scripts/scan_demo.py to see what's happen.

Put images under /data/images and run the scripts.

Usage

Dependencies

The minimum required dependencies to run document-scanner are:

Python>=3.6
OpenCV4
scikit-image
pandas
numpy

Use the following command to install dependencies with pip:

pip install -r requirements.txt

Name		Name	Last commit message	Last commit date
Latest commit History 119 Commits
.github		.github
data/images/segment		data/images/segment
doc_scanner		doc_scanner
scripts		scripts
tests		tests
.gitignore		.gitignore
.travis.yml		.travis.yml
LICENSE		LICENSE
README.md		README.md
requirements.txt		requirements.txt
setup.cfg		setup.cfg
setup.py		setup.py
tasks.py		tasks.py
tox.ini		tox.ini

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Document-Scanner

Pipeline

Demo

Usage

Dependencies

Contribution

About

Releases

Packages

Contributors 3

Languages

License

fMeow/document-scanner

Folders and files

Latest commit

History

Repository files navigation

Document-Scanner

Pipeline

Demo

Usage

Dependencies

Contribution

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Contributors 3

Languages

Packages