Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Regarding training process #50

Closed
wonderingalways opened this issue Jun 12, 2021 · 4 comments
Closed

Regarding training process #50

wonderingalways opened this issue Jun 12, 2021 · 4 comments

Comments

@wonderingalways
Copy link

Hello Omar,
Thanks for open sourcing the code for amazing work.

This is not really a issue more like a doubt.

  1. It was mentioned in the paper for MSMACRO that it was trained for 200k iterations with batch size of 32 to approximately reproduce the results., so effectively trained for 6.4 million triplets. So this means it was not trained on the full triples.small.tsv (39 mill points). Is my understanding on this correct?

  2. I am trying to training on MSMACRO triples. During training individual current batch loss is decreasing only for the initial few steps and oscillating for later iterations. Did u face the same issue while training ? Should it be viewed as model is not getting trained or since it seeing the new examples every batch it is expected this way?

image

@okhat
Copy link
Collaborator

okhat commented Jun 12, 2021

Yes, no one trains on the full triples! They just continue to sample more negatives.

It doesn't matter what the per-batch loss is like. How is the averaged or smoothed loss? That should continue to decrease in a slowing manner.

@wonderingalways
Copy link
Author

Hey Thanks for the reply.

  1. okay that's good to know. So for the results mentioned in the paper what kind of strategy is used for sampling negatives during training?

2.Yes the average loss is decreasing in a slowing manner

@okhat
Copy link
Collaborator

okhat commented Jun 13, 2021

For the paper, we simply train on the first N triples in the small triples file.

@wonderingalways
Copy link
Author

ohh okay. Thats clears the doubts.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants