About the reproduction of VCR experiment results #8

Tclz · 2021-09-11T06:26:19Z

Hi，
Thanks for your great work!
When i use the following command to train a model, it seems can't reach the expected results in the paper.
horovodrun -np 1 python train_vcr_adv.py --config config/train-vcr-base-4gpu-adv.json \ --output_dir vcr/output_base
Only use one GPU，I got these results
100%|##########| 8000/8000 [4:58:12<00:00, 1.98s/it][1,0]<stderr>:09/10/2021 08:48:59 - INFO - __main__ - ============Step 8000============= [1,0]<stderr>:09/10/2021 08:48:59 - INFO - __main__ - 1280000 examples trained at 71 ex/s [1,0]<stderr>:09/10/2021 08:48:59 - INFO - __main__ - =========================================== [1,0]<stderr>: [1,0]<stderr>:09/10/2021 08:48:59 - INFO - __main__ - start running validation... [1,0]<stderr>: [[[[1,0]<stderr>:09/10/2021 08:54:06 - INFO - __main__ - validation finished in 307 seconds, score_qa: 72.28 score_qar: 75.06 score: 54.35

I am confused that this result is a few percentage points different from the one mentioned in the paper.
What should i do? Thanks in advance!!!

The text was updated successfully, but these errors were encountered:

zhegan27 · 2021-09-13T23:57:01Z

Thanks for your interest in the code. In the leaderboard (https://visualcommonsense.com/leaderboard/), if you see Entry #18, there seems to be a third party running this code to reproduce the results.

Since you only have 1 gpu, please change the hyper-parameter "gradient_accumulation_steps" from 5 to 20, and have a try again. Let me know how this goes, and hopefully the performance can catch up.

Tclz · 2021-09-14T02:39:22Z

Thanks for your interest in the code. In the leaderboard (https://visualcommonsense.com/leaderboard/), if you see Entry #18, there seems to be a third party running this code to reproduce the results.

Since you only have 1 gpu, please change the hyper-parameter "gradient_accumulation_steps" from 5 to 20, and have a try again. Let me know how this goes, and hopefully the performance can catch up.

Thanks for your reply! I am trying to modify the configuration file and re-experiment, after which I will feed back the results.

Tclz · 2021-09-16T07:30:38Z

Hi, this strategy worked. After I adjusted the hyper-parameter, when the training level reached 90%, I got the following results.
88%|########7 | 7000/8000 [15:35:25<2:10:08, 7.81s/it][1,0]:09/14/2021 18:04:58 - INFO - main - ============Step 7000=============
[1,0]:09/14/2021 18:04:58 - INFO - main - 4481024 examples trained at 79 ex/s
[1,0]:09/14/2021 18:04:58 - INFO - main - ===========================================
[1,0]: [1,0]:09/14/2021 18:04:58 - INFO - main - start running validation...
[1,0]: [[[[1,0]:09/14/2021 18:10:05 - INFO - main - validation finished in 307 seconds, score_qa: 75.16 score_qar: 78.46 score: 59.18 1,0]:
89%|########8 | 7100/8000 [15:53:25<1:57:14, 7.82s/it][1,0]:09/14/2021 18:22:58 - INFO - main - ============Step 7100=============
Thanks for your solution!

zhegan27 closed this as completed Sep 17, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

About the reproduction of VCR experiment results #8

About the reproduction of VCR experiment results #8

Tclz commented Sep 11, 2021 •

edited

zhegan27 commented Sep 13, 2021

Tclz commented Sep 14, 2021

Tclz commented Sep 16, 2021

About the reproduction of VCR experiment results #8

About the reproduction of VCR experiment results #8

Comments

Tclz commented Sep 11, 2021 • edited

zhegan27 commented Sep 13, 2021

Tclz commented Sep 14, 2021

Tclz commented Sep 16, 2021

Tclz commented Sep 11, 2021 •

edited