Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

gt_image.txt should contain which type of information. #39

Closed
indiefool opened this issue Dec 20, 2019 · 6 comments
Closed

gt_image.txt should contain which type of information. #39

indiefool opened this issue Dec 20, 2019 · 6 comments

Comments

@indiefool
Copy link

No description provided.

@indiefool
Copy link
Author

No description provided.

cause i just converted the xml file directly into txt but it when i started training it ,it got stuck in one epoch.........
any help.... what i am thinking is should it only contain coordinates or what?
please reply fast i have a project to complete by 22nd ...........

@indiefool
Copy link
Author

will be thankful if someone provide me with an example

@janzd
Copy link
Owner

janzd commented Dec 20, 2019

This is an example of how an annotation file of one image should look like.

377,117,463,117,465,130,378,130,Genaxis Theatre
493,115,519,115,519,131,493,131,[06]
374,155,409,155,409,170,374,170,###
492,151,551,151,551,170,492,170,62-03
376,198,422,198,422,212,376,212,Carpark
494,190,539,189,539,205,494,206,###
374,1,494,0,492,85,372,86,###

There are 8 numbers seperated by commas that represent the coordinates of bounding boxes and the last value on each row is the text in that bounding box. Illegible text is substituted by three hashes and isn't considered when calculating loss during training. If you only have bounding box coordinates, you either have to modify the annotation files to add some random text (the text itself isn't used so it can be literally anything) or you have to modify the code that prepares data and add some random text there or even change it so that it doesn't require text values.

Replacing line[-1] here with a random string such as "ABC" should be enough.
https://github.com/kurapan/EAST/blob/2de0cfb98898c3a8c83b1f38835a15ea8d264d43/data_processor.py#L43

@indiefool
Copy link
Author

one more thing when i annotated my dataset i only got 4 coordinates,
how to get 8 coordinates,should i have to annotate it again?
i used labelimg for annotation

@indiefool
Copy link
Author

those 4 coordinates include xmax , xmin,ymax,ymin

@indiefool
Copy link
Author

ohkk. got the solution.Thanks.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants