Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

超轻量模型训练数据 #67

Closed
ltm920716 opened this issue May 19, 2020 · 5 comments
Closed

超轻量模型训练数据 #67

ltm920716 opened this issue May 19, 2020 · 5 comments

Comments

@ltm920716
Copy link

请问超轻量模型的训练数据(中英文数字)是公开的数据集么,还是私有数据集,可以共享么,谢谢

@LDOUBLEV
Copy link
Collaborator

感谢关注,训练数据有lsvt数据集和我们自己合成的数据,我们自己合成的数据暂时不会开源

@DuckJ
Copy link

DuckJ commented May 26, 2020

你好,识别部分能说明一下数据集有哪些和量级么?特别是中文数据集 @LDOUBLEV

@dyning
Copy link
Collaborator

dyning commented May 27, 2020

检测:
英文数据集,ICDAR2015
中文数据集,LSVT(https://rrc.cvc.uab.es/?ch=16) 街景数据集训练数据3w张图片
识别:
英文数据集,MJSynth和SynthText合成数据,数据量上千万。
中文数据集,LSVT(https://rrc.cvc.uab.es/?ch=16) 街景数据集根据真值将图crop出来,并进行位置校准,总共30w张图像。此外基于LSVT的语料,合成数据500w。

@DuckJ
Copy link

DuckJ commented May 28, 2020

@dyning 非常感谢回复告知。

@ltm920716
Copy link
Author

@LDOUBLEV @dyning 感谢回复

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

4 participants