Skip to content

haoyunjixiang/STR

Repository files navigation

STR

Scene Text Det and Rec Resource Basic Knowledge Strategy

  • Papers are sorted by published date.
  • IC is shorts for ICDAR.
  • Score is word-accuracy for recognition task.
    • For results on IC03, IC13, and IC15 dataset, papers used different numbers of samples per paper,
      but we did not distinguish between them
  • *CODE means official code and CODE(M) means that trained model is provided.
Conf. Date Title SVT IIIT5k IC03 IC13 Resources
'15-ICLR 14/12/18 Deep structured output learning for unconstrained text recognition 0.717 0.896 0.818 TF
SLIDE
VIDEO
'16-IJCV 15/05/07 Reading text in the wild with convolutional neural networks 0.807 0.933 0.908 KERAS
'16-AAAI 15/06/14 Reading Scene Text in Deep Convolutional Sequences
'17-TPAMI 15/07/21 An end-to-end trainable neural network for image-based sequence recognition and its application to scene text recognition 0.808 0.782 0.894 0.867 TORCH(M)
TF
TF
TF
TF
PYTORCH
PYTORCH(M)
BLOG(KR)
'16-CVPR 16/03/09 Recursive Recurrent Nets with Attention Modeling for OCR in the Wild 0.807 0.784 0.887 0.9
'16-CVPR 16/03/12 Robust scene text recognition with automatic rectification 0.819 0.819 0.901 0.886 PYTORCH
PYTORCH
'16-CVPR 16/06/27 CNN-N-Gram for Handwriting Word Recognition 0.8362 VIDEO
'16-BMVC 16/09/19 STAR-Net: A SpaTial Attention Residue Network for Scene Text Recognition 0.836 0.833 0.899 0.891
'17-arXiv 17/07/27 STN-OCR: A single Neural Network for Text Detection and Text Recognition 0.798 0.86 0.903 *MXNET(M)
PRJ
BLOG
'17-IJCAI 17/08/19 Learning to Read Irregular Text with Attention Mechanisms
'17-arXiv 17/09/06 Scene Text Recognition with Sliding Convolutional Character Models 0.765 0.816 0.845 0.852
'17-ICCV 17/09/07 Focusing Attention: Towards Accurate Text Recognition in Natural Images 0.859 0.874 0.942 0.933
'18-CVPR 17/11/12 AON: Towards Arbitrarily-Oriented Text Recognition 0.828 0.87 0.915 TF
'17-NIPS 17/12/04 Gated Recurrent Convolution Neural Network for OCR 0.815 0.808 0.978 *TORCH(M)
'18-AAAI 18/01/04 Char-Net: A Character-Aware Neural Network for Distorted Scene Text Recognition 0.844 0.836 0.915 0.908
'18-AAAI 18/01/04 SqueezedText: A Real-time Scene Text Recognition by Binary Convolutional Encoder-decoder Network 0.87 0.931 0.929
'18-CVPR 18/05/09 Edit Probability for Scene Text Recognition 0.875 0.883 0.946 0.944
'18-TPAMI 18/06/25 ASTER: An Attentional Scene Text Recognizer with Flexible Rectification 0.936 0.934 0.945 0.918 *TF(M)
PYTORCH
'18-ECCV 18/09/08 Synthetically Supervised Feature Learning for Scene Text Recognition 0.871 0.894 0.947 0.94
'19-AAAI 18/09/18 Scene Text Recognition from Two-Dimensional Perspective 0.821 0.92 0.914
'19-AAAI 18/11/02 Show, Attend and Read: A Simple and Strong Baseline for Irregular Text Recognition 0.845 0.915 0.91 *TORCH(M)
'19-CVPR 18/12/14 ESIR: End-to-end Scene Text Recognition via Iterative Image Rectification 0.902 0.933 0.913 PRJ
'19-PR 19/01/10 MORAN: A Multi-Object Rectified Attention Network for Scene Text Recognition 0.883 0.912 0.950 0.924 *PYTORCH(M)
'19-ICCV 19/04/03 What is wrong with scene text recognition model comparisons? dataset and model analysis 0.875 0.949 0.936 *PYTORCH(M)
BLOG_KR
'19-CVPR 19/04/18 Aggregation Cross-Entropy for Sequence Recognition 0.826 0.823 0.921 0.897 *PYTORCH
'19-CVPR 19/06/16 Sequence-to-Sequence Domain Adaptation Network for Robust Text Image Recognition 0.845 0.838 0.921 0.918
'19-ICCV 19/08/06 Symmetry-constrained Rectification Network for Scene Text Recognition 0.889 0.944 0.95 0.939
'20-AAAI 19/12/28 TextScanner: Reading Characters in Order for Robust Scene Text Recognition 0.895 0.926 0.925
'20-AAAI 19/12/21 Decoupled Attention Network for Text Recognition 0.892 0.943 0.95 0.939 *PYTORCH(M)
'20-AAAI 20/02/04 GTC: Guided Training of CTC 0.929 0.955 0.952 0.943 Detail
'20-ECCV 20/07/04 PlugNet: Degradation Aware Scene Text Recognition Supervised by a Pluggable Super-Resolution Unit 0.923 0.944 0.957 0.950 *PYTORCH
Detail
'21-CVPR 20/12/10 Sequence-to-Sequence Contrastive Learning for Text Recognition 0.913 0.948 0.940 *PYTORCH(M)
'21-CVPR 21/03/04 Toward Scene Text Recognition With Fewer Labels 0.913 0.948 0.940 *PYTORCH(M)
'21-CVPR 21/03/06 Read Like Humans: Autonomous, Bidirectional and Iterative Language Modeling for Scene Text Recognition 0.955 0.972 0.977 *PYTORCH(M)
Detail
'21-CVPR 21/05/04 Primitive Representation Learning for Scene Text Recognition 0.940 0.956 0.958 0.964 *PYTORCH(M)
BLOG
'21-ICCV 21/07/12 Joint Visual Semantic Reasoning: Multi-Stage Decoder for Text Recognition 0.922 0.952 0.955 Detail
'21-ICCV 21/08/09 From Two to One: A New Scene Text Recognizer with Visual Language Modeling Network 0.917 0.958 0.957 *PYTORCH(M)
Detail

Text Recognition with Semantic information

Conf. Date Title SVT IIIT5k IC13 IC15 Resources
'20-CVPR 20/05/10 SEED: Semantics Enhanced Encoder-Decoder Framework for Scene Text Recognition 89.6 93.8 92.8 80.0 Detail
*PYTORCH(M)
'20-CVPR 20/03/04 SRN: Towards Accurate Scene Text Recognition with Semantic Reasoning Networks 91.5 94.8 95.5 82.7 Detail
*PYTORCH(M)
'21-CVPR 21/03/06 ABInet: Read Like Humans: Autonomous, Bidirectional and Iterative Language Modeling for Scene Text Recognition 0.955 0.972 0.977 0.869 *PYTORCH(M)
Detail
‘22-AAAI 21/12/12 GTR: Visual Semantics Allow for Textual Reasoning Better in Scene Text Recognition 95.8 97.5 97.5 87.3 Detail
*PYTORCH(M)

Text Recognition base Classification

Conf. Date Title SVT IIIT5k IC13 IC15 Resources
... 21/02/12 Revisiting Classification Perspective on Scene Text Recognition 92.3 94.5 94.1 82.0 Detail
*PYTORCH(M)

Layout-Parse OCR

method collect

Conf. Date Title SROIE FUNSD Resources
21-ACM 2021/08/02 StrucTexT: Structured Text Understanding with Multi-ModalTransformers 96.88 83.09 code
video
point

About

Scene Text Det and Rec Resource

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages