layout | title | description | parent | grand_parent | nav_order | permalink | github-path |
---|---|---|---|---|---|---|---|
newocr |
Separation |
How character separation works during scanning in NewOCR. |
Scanning |
Explanation |
1 |
/explanation/scanning/separation |
explanation/scanning/separation.md |
Character separation is a simple process in scanning an image in NewOCR. Since each piece of a character (The dot of an I, the top part of an equals sign, the two separate circles of a percent, etc.) are defined as completely separate characters, no character merging is required yet.
{% include line-separation.md %}
The first thing the OCR does it go through all black pixels of the input image (after image binarization), and for every black pixel it gets, it gets all touching pixels recursively. These are then compiled into a single character object.