Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

The data orgnization #5

Open
JCZ404 opened this issue Dec 1, 2021 · 0 comments
Open

The data orgnization #5

JCZ404 opened this issue Dec 1, 2021 · 0 comments

Comments

@JCZ404
Copy link

JCZ404 commented Dec 1, 2021

Hi,Thank you for your such great work! I feels a little bit confused about the training data used in your code. The data orgnization you mentioned in https://github.com/BryanPlummer/cite/tree/master/data_processing_example
is in h5 form right? I don't understand the meaning of <pair identifier> in data['pair'] in the h5 file, I guess the later element in the pair means whether this phrase is the ground truth phrase of the image, beacause in your code, you said we can use the augmented phrase for training, but what the meaning of the first element in the pair? Besides, when you count the ground truth phrase of the image, it seems worry in your code:
image
you count the num of the ground truth phrase before putting the current gt phrase into list. By the way, how did you generate the augmented phrase? can you explain a little bit about that? Is the result in your paper trained with these augmented phrase?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant