Skip to content

NastyBoget/hrtr

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

60 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Handwritten Russian text recognition

Datasets

Name Size
Cyrillic Handwriting Dataset train=72286, test=1544
HKR train=45470, val=9359, test1=5057, test2=5057
HKR (reality) train=45559, val=9375, test1=4966, test2=5043

HKR: val 9133 words in train, 242 words not in train

Metrics

  • Accuracy: $$accuracy=\frac{1}{N}\sum_{i=j}c_{ij}$$

  • For characters: $$CER(prediction,real)=\frac{substitutions+insertions+deletes}{len(real)}$$

  • For words: $$WER(prediction,real)=\frac{substitutions+insertions+deletes}{len(real)}$$

Existing solutions

Metric HKR test1 HKR test2 HKR all
CER 4.13 6.31 4.5
WER 18.91 23.69 19.2
Metric HKR all
Accuracy 82.0
CER 3.49
WER 13.0