New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
about reproduce performance #2
Comments
should I do something to reproduce the results performed by pretrained models? |
It seems that you just do the first stage traning(cross entropy loss), you should continue to do the second stage traning(self critical loss). |
Strange! Maybe some hyperparameters were not set correctly in the cleaned version. I will check it later. |
Thanks for replying |
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Thanks for opening source code.
The provided pretrained model can perform the results claimed in your paper, but when I fail to training model on flicker dataset to reproduce the results.
The text was updated successfully, but these errors were encountered: