OpenOCR aims to establish a unified training and evaluation benchmark for scene text detection and recognition algorithms, at the same time, serves as the official code repository for the OCR team from the FVL Laboratory, Fudan University.
We are actively developing and refining it and expect to release the first version as soon as possible.
We sincerely welcome the researcher to recommend OCR or relevant algorithms and point out any potential factual errors or bugs. Upon receiving the suggestions, we will promptly evaluate and critically reproduce them. We look forward to collaborating with you to advance the development of OpenOCR and continuously contribute to the OCR community!
- DPTR (Shuai Zhao, Yongkun Du, Zhineng Chen*, Yu-Gang Jiang. Decoder Pre-Training with only Text for Scene Text Recognition, ACM MM 2024. paper)
- IGTR (Yongkun Du, Zhineng Chen*, Yuchen Su, Caiyan Jia, Yu-Gang Jiang. Instruction-Guided Scene Text Recognition, 2024. Doc, Under TPAMI second round 2024. paper)
- SVTRv2 (Yongkun Du, Zhineng Chen*, Caiyan Jia, Yu-Gang Jiang. SVTRv2: Towards Arbitrary-Shaped Text Recognition with a Single Visual Model, 2024. paper coming soon)
- SMTR&FocalSVTR (Yongkun Du, Zhineng Chen*, Caiyan Jia, Xieping Gao, Yu-Gang Jiang. Out of Length Text Recognition with Sub-String Matching, 2024. paper)
- CDistNet (Tianlun Zheng, Zhineng Chen*, Shancheng Fang, Hongtao Xie, Yu-Gang Jiang. CDistNet: Perceiving Multi-Domain Character Distance for Robust Text Recognition, IJCV 2024. paper)
- MRN (Tianlun Zheng, Zhineng Chen*, Bingchen Huang, Wei Zhang, Yu-Gang Jiang. MRN: Multiplexed routing network for incremental multilingual text recognition, ICCV 2023. paper)
- TPS++ (Tianlun Zheng, Zhineng Chen*, Jinfeng Bai, Hongtao Xie, Yu-Gang Jiang. TPS++: Attention-Enhanced Thin-Plate Spline for Scene Text Recognition, IJCAI 2023. paper)
- CPPD (Yongkun Du, Zhineng Chen*, Caiyan Jia, Xiaoting Yin, Chenxia Li, Yuning Du, Yu-Gang Jiang. Context Perception Parallel Decoder for Scene Text Recognition, Under TPAMI minor revisons 2023. PaddleOCR Doc, paper)
- SVTR (Yongkun Du, Zhineng Chen*, Caiyan Jia, Xiaoting Yin, Tianlun Zheng, Chenxia Li, Yuning Du, Yu-Gang Jiang. SVTR: Scene Text Recognition with a Single Visual Model, IJCAI 2022 (Long). PaddleOCR Doc, paper)
- NRTR (Fenfen Sheng, Zhineng Chen*, Bo Xu. NRTR: A No-Recurrence Sequence-to-Sequence Model For Scene Text Recognition, ICDAR 2019. paper)
Reproduction schedule:
Method | Venue | Training | Evaluation | Contributor |
---|---|---|---|---|
CRNN | TPAMI 2016 | ✅ | ✅ | |
ASTER | TPAMI 2019 | ✅ | ✅ | pretto0 |
NRTR | ICDAR 2019 | ✅ | ✅ | |
SAR | AAAI 2019 | ✅ | ✅ | pretto0 |
MORAN | PR 2019 | ✅ | ✅ | Debug |
DAN | AAAI 2020 | ✅ | ✅ | |
RobustScanner | ECCV 2020 | ✅ | ✅ | pretto0 |
AutoSTR | ECCV 2020 | ✅ | ✅ | |
SRN | CVPR 2020 | ✅ | ✅ | pretto0 |
SEED | CVPR 2020 | ✅ | ✅ | |
ABINet | CVPR 2021 | ✅ | ✅ | YesianRohn |
VisionLAN | ICCV 2021 | ✅ | ✅ | YesianRohn |
SVTR | IJCAI 2022 | ✅ | ✅ | |
PARSeq | ECCV 2022 | ✅ | ✅ | |
MATRN | ECCV 2022 | ✅ | ✅ | |
MGP-STR | ECCV 2022 | ✅ | ✅ | |
CPPD | 2023 | ✅ | ✅ | |
LPV | IJCAI 2023 | ✅ | ✅ | |
MAERec(Union14M) | ICCV 2023 | ✅ | ✅ | |
LISTER | ICCV 2023 | ✅ | ✅ | |
CDistNet | IJCV 2024 | ✅ | ✅ | YesianRohn |
BUSNet | AAAI 2024 | ✅ | ✅ | |
DCTC | AAAI 2024 | TODO | ||
CAM | PR 2024 | ✅ | ✅ | |
OTE | CVPR 2024 | ✅ | ✅ | |
CFF | IJCAI 2024 | TODO | ||
DPTR | ACM MM 2024 | TODO | ||
VIPTR | ACM CIKM 2024 | TODO | ||
IGTR | 2024 | ✅ | ✅ | |
SMTR | 2024 | ✅ | ✅ | |
FocalSVTR-CTC | 2024 | ✅ | ✅ | |
SVTRv2 | 2024 | ✅ | ✅ | |
ResNet+Trans-CTC | ✅ | ✅ | ||
ViT-CTC | ✅ | ✅ |
Yiming Lei (pretto0) and Xingsong Ye (YesianRohn) from the FVL Laboratory, Fudan University, under the guidance of Professor Zhineng Chen, completed the majority of the algorithm reproduction work. Grateful for their outstanding contributions.
This codebase is built based on the PaddleOCR, PytorchOCR, and MMOCR. Thanks for their awesome work!