Skip to content

Latest commit

 

History

History
28 lines (19 loc) · 1.36 KB

calculation.md

File metadata and controls

28 lines (19 loc) · 1.36 KB
layout title description keywords parent grand_parent nav_order permalink github-path
newocr
Calculation
How character separation works during training in NewOCR.
Calculation, Sorting, Character Segmentation
Training
Explanation
3
/explanation/training/calculation
explanation/training/calculation.md

Character Calculation

Arguably the most important step in the OCR, the system next needs to derive the data it's going to store in the database.

Character Segmentation

{% include calculation.md %}

Storing The Data

After the data is calculated for each character, the 16 data points are separately averaged with all other characters, and then the resulting points are stored in the database.

Font Sizes

If the option is enabled, font sizes may be stored in the database as well to detect the size of the scanned font during scanning. When a character is iterated over during training, the current line's font size is divided by the height of the character. All of each character's results of this are averaged and the single number per character us stored in the database. This way, it can be multiplied by the scanned character's height and the result will be the font size.