Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Would you please share the checkpoint and its corresponding hyperparameters with me? #2

Closed
mengfeidu opened this issue May 1, 2023 · 1 comment

Comments

@mengfeidu
Copy link

Hello, thank you for your excellent work. I am currently attempting to implement your model in my local environment. However, I am facing difficulties in achieving the performance of the R2R test dataset displayed in your paper. While the performance in Val Unseen is similar to what you reported, it is notably lower in the test dataset. I was wondering if it would be possible for you to share the checkpoint of your best model and its corresponding hyperparameters with me. Thank you in advance!

@cwhao98
Copy link
Owner

cwhao98 commented May 6, 2023

Hi,

Thanks for your interest. Previously we conducted the experiments in a foreign server which is not available now. Sorry for the inconvenience. But we have two suggestions for your issue.

  • The performance maybe affected by different local environments. We use a language auxiliary loss to regularize the attention weight. Empirically, we find the loss weight (--LWeight and --AWeight) will make the performance a bit change when ranging from 0.5 to 2.0.

  • The agent takes a long time to converge, and sufficient training steps may be helpful. The machine we used has the limitation of two days in one trial, so we often train the model in a "resume" manner.

Thanks again.

@cwhao98 cwhao98 closed this as completed Jun 14, 2023
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants