layout | title | description | keywords | parent | grand_parent | nav_order | permalink | github-path |
---|---|---|---|---|---|---|---|---|
newocr |
Calculation |
How character separation works during training in NewOCR. |
Calculation, Sorting, Character Segmentation |
Training |
Explanation |
3 |
/explanation/training/calculation |
explanation/training/calculation.md |
Arguably the most important step in the OCR, the system next needs to derive the data it's going to store in the database.
{% include calculation.md %}
After the data is calculated for each character, the 16 data points are separately averaged with all other characters, and then the resulting points are stored in the database.
If the option is enabled, font sizes may be stored in the database as well to detect the size of the scanned font during scanning. When a character is iterated over during training, the current line's font size is divided by the height of the character. All of each character's results of this are averaged and the single number per character us stored in the database. This way, it can be multiplied by the scanned character's height and the result will be the font size.