Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

How to use SynthText for training? #6

Closed
lufo816 opened this issue Feb 17, 2017 · 14 comments
Closed

How to use SynthText for training? #6

lufo816 opened this issue Feb 17, 2017 · 14 comments

Comments

@lufo816
Copy link

lufo816 commented Feb 17, 2017

Text in SynthText is oriented and its label has 4 points, how to use these data for training?

@MhLiao
Copy link
Owner

MhLiao commented Feb 17, 2017

@lufo816 use the maximum rectangle(xmin,ymin,xmax,ymax) as the bounding box

@lufo816
Copy link
Author

lufo816 commented Feb 17, 2017

@MhLiao Understood, thanks a lot!

@vrbala
Copy link

vrbala commented Mar 8, 2017

@MhLiao I am trying to use SynthText to fine tune the model. However, I am not sure how to preprocess the data such that it is suitable to use with TextBoxes. Specifically, I couldn't figure out the schema of data in lmdb files. Could you kindly point me at some relevant references? Thanks in advance.

@MhLiao
Copy link
Owner

MhLiao commented Mar 14, 2017

@vrbala The schema of data in lmdb files is similar to the SSD. You can use the scripts of SSD to generate the lmdb data.

@vrbala
Copy link

vrbala commented Mar 15, 2017

@MhLiao Thanks so much.

@HelloTobe
Copy link

@MhLiao Hi, the SynthText datas are in decimal form, should i use the round function to make them into integers?
What pictures in SynthText did you use for training, validation or testing respectively? (I mean i don't know what pictures should be categorized into training, also is validation and testing.)
Thanks in advance.

@MhLiao
Copy link
Owner

MhLiao commented Apr 28, 2017

@HelloTobe yes. I used 800k images for training, 58750 images for validation.

@HelloTobe
Copy link

Thanks for your reply!
By the way, i'm wondering how long you finish your training.

@HelloTobe
Copy link

@MhLiao Hi, thanks for your reply!

Another problem is that some bounding boxes exceed image boundary(some coordinates are too large or negative), could you give me some adviece to solve this problem?

By the way, i'm wondering how long you finish your training.

@lomograb
Copy link

@HelloTobe You can enable CUDA or cuDNN (GPU MODE) for quick result of training time

@offlaner
Copy link

@MhLiao @lufo816 For training on synthText are we supposed to generate txt files in the form of :
xmin ymin xmax ymax "text"
instead of :
Xcentre Ycentre w h "text"

@MhLiao MhLiao closed this as completed Nov 27, 2018
@github14343
Copy link

@HelloTobe Have you solved the problem " bounding box exceeds image boundary." Thanks for help

@HelloTobe
Copy link

@github14343 I probably just clip the coordinate x such that 0 <= x <= H (or W).

@github14343
Copy link

@HelloTobe Thanks!

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

7 participants