Would you please share the checkpoint and its corresponding hyperparameters with me? #2

mengfeidu · 2023-05-01T12:29:21Z

Hello, thank you for your excellent work. I am currently attempting to implement your model in my local environment. However, I am facing difficulties in achieving the performance of the R2R test dataset displayed in your paper. While the performance in Val Unseen is similar to what you reported, it is notably lower in the test dataset. I was wondering if it would be possible for you to share the checkpoint of your best model and its corresponding hyperparameters with me. Thank you in advance!

cwhao98 · 2023-05-06T09:11:15Z

Hi,

Thanks for your interest. Previously we conducted the experiments in a foreign server which is not available now. Sorry for the inconvenience. But we have two suggestions for your issue.

The performance maybe affected by different local environments. We use a language auxiliary loss to regularize the attention weight. Empirically, we find the loss weight (--LWeight and --AWeight) will make the performance a bit change when ranging from 0.5 to 2.0.
The agent takes a long time to converge, and sufficient training steps may be helpful. The machine we used has the limitation of two days in one trial, so we often train the model in a "resume" manner.

Thanks again.

cwhao98 closed this as completed Jun 14, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Would you please share the checkpoint and its corresponding hyperparameters with me? #2

Would you please share the checkpoint and its corresponding hyperparameters with me? #2

mengfeidu commented May 1, 2023

cwhao98 commented May 6, 2023

Would you please share the checkpoint and its corresponding hyperparameters with me? #2

Would you please share the checkpoint and its corresponding hyperparameters with me? #2

Comments

mengfeidu commented May 1, 2023

cwhao98 commented May 6, 2023