You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
For text detection, there are 127k training images and 200 validation images. The training images consist of 68K real scene images and 59K synthetic images. The real scene images are collected from Baidu image search and public datasets, including LSVT (Sun et al. 2019), RCTW-17 (Shi et al. 2017), MTWI 2018 (He and Yang 2018), CASIA-10K (He et al. 2018), SROIE (Huang et al. 2019), MLT 2019 (Nayef et al. 2019), BDI (Karatzas et al. 2011), MSRA TD500 (Yao et al. 2012) and CCPD 2019 (Xu et al. 2018). The synthetic images mainly focus on the scenarios for long texts, multi-direction texts and texts in table. The validation images are all from real scenes.
For text recognition, there are 18.5M training im ages and 18.7K validation images. Among the train ing images, 7M images are real scene images, which come from some public datasets and Baidu image search. The public datasets include LSVT, RCTW-17, MTWI 2018, CCPD 2019, openimages https://github.com/ openimages/dataset and InvoiceDatasets https://github.com/ FuxiJia/InvoiceDatasets. Besides, we scraped 750k finan cial report images from the web. We get 810k images from LSVT unlabeled data by using UIM strategy. We also ob tain about 3M croped images from Pubtabnet https://github. com/ibm-aur-nlp/PubTabNet. The remaining 11.5M syn thetic images mainly focus on scenarios for different back grounds, rotation, perspective transformation, noising, verti cal text, etc. The corpus of synthetic images comes from the real scene images. All the validation images also come from the real scenes.
请提供下述完整信息以便快速定位问题/Please provide the following information to quickly locate the problem
The text was updated successfully, but these errors were encountered: