Skip to content

Latest commit

 

History

History
22 lines (16 loc) · 1.21 KB

separation.md

File metadata and controls

22 lines (16 loc) · 1.21 KB
layout title description parent grand_parent nav_order permalink github-path
newocr
Separation
How character separation works during scanning in NewOCR.
Scanning
Explanation
1
/explanation/scanning/separation
explanation/scanning/separation.md

Character Separation

Character separation is a simple process in scanning an image in NewOCR. Since each piece of a character (The dot of an I, the top part of an equals sign, the two separate circles of a percent, etc.) are defined as completely separate characters, no character merging is required yet.

Line Separation

{% include line-separation.md %}

Character Separation

The first thing the OCR does it go through all black pixels of the input image (after image binarization), and for every black pixel it gets, it gets all touching pixels recursively. These are then compiled into a single character object.