Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Format of the training and validation data that is used for training. #1

Closed
AhmadShaik opened this issue Sep 9, 2020 · 2 comments
Closed

Comments

@AhmadShaik
Copy link

Thanks for sharing such a great work. I read the base paper its quite fascinating. I am trying to reproduce the results with my own custom training data.
I would like to know the detail format and folder structure of the train/validation data.

@asmekal
Copy link
Owner

asmekal commented Sep 9, 2020

Hi, thanks for your interest in our work

Our training data has 2-folder structure: "Image" with image data and "Markup" with xml markup data. You can find examples here These examples are public datasets which we publish in our other work.

One way is to structure your data in the same format as we have. When you just run python train.py -t TRAIN_PATH -v VALID_PATH

Alternatively, you may write data reader for your data structure similarly to XMLBarcodeMarkupReader, then specify option -mt YOUR_MARKUP_TYPE when running train.py

@AhmadShaik
Copy link
Author

Hi asmekal. Thank for the response. I am planning to pursue the first path.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants