Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

about reproduce performance #2

Closed
wanboyang opened this issue Oct 24, 2020 · 7 comments
Closed

about reproduce performance #2

wanboyang opened this issue Oct 24, 2020 · 7 comments

Comments

@wanboyang
Copy link

Thanks for opening source code.
The provided pretrained model can perform the results claimed in your paper, but when I fail to training model on flicker dataset to reproduce the results.

@wanboyang
Copy link
Author

I use the default setting and features provided in this project. The result on flicker test split is
scan_flicker
the SCST training starts about 30000 iterations, it is interesting to find the cider scores drops and always below 0.60

@wanboyang
Copy link
Author

should I do something to reproduce the results performed by pretrained models?

@YuanEZhou
Copy link
Owner

It seems that you just do the first stage traning(cross entropy loss), you should continue to do the second stage traning(self critical loss).

@wanboyang
Copy link
Author

QQ图片20201026141506
I achieved whole training processing stage, the CIDEr always below 0.62

@YuanEZhou
Copy link
Owner

Strange! Maybe some hyperparameters were not set correctly in the cleaned version. I will check it later.

@YuanEZhou
Copy link
Owner

Sorry for the late reply. For better performance on CIDEr, you should comment line12 and uncomment line 11.

@wanboyang
Copy link
Author

Thanks for replying

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants