Some questions for the paper #11

wanng-ide · 2022-01-19T09:57:04Z

What is the difference between the score in Table 5 and Table 8?
77.19 in Table 5 results on test-dev set of VQAv2, and,
77.68 in Table 8 results on test-dev set of VQAv2.

zdou0830 · 2022-01-19T17:19:20Z

See #6. Thanks!

wanng-ide · 2022-01-19T18:01:53Z

@zdou0830 Thanks!

I have another question about the codes.

I tried your pretrained model to finetune VQA v2 with default setting.
However, the score of val is only around 72.55.

It should be more than 80..

Could you share your experiment settings of finetuning VQA v2 and pretraining tasks?

zdou0830 · 2022-01-19T18:04:30Z

The command should be

python run.py with data_root=$DATA_DIR num_gpus=8 num_nodes=1 task_finetune_vqa_clip_bert per_gpu_batchsize=4 clip16 text_roberta image_size=576 clip_randaug load_path=meter_clip16_288_roberta_pretrain.ckpt

Hope this thread would help #7.

wanng-ide · 2022-01-19T18:05:40Z

@zdou0830

This setting is the same as mine ...

wanng-ide · 2022-01-19T18:07:51Z

If I use your finetuned model to do the test only task of VQA v2, the result is 77.66.
This model works.

wanng-ide · 2022-01-19T18:09:22Z

zdou0830 · 2022-01-19T18:11:08Z

You can try testing the last checkpoint and submitting the resulting json file to evalai.

wanng-ide · 2022-01-20T01:56:12Z

The result from evalai is 71.53.
Should I finetune the model with more step?

zdou0830 · 2022-01-20T02:07:01Z

The VQA dataset can be downloaded here: https://drive.google.com/file/d/1qT7YWHpLg-fAL43daKlOsYx2EbbQk--d/view?usp=sharing.

The training command is

python run.py with data_root=$DATA_DIR num_gpus=8 num_nodes=1 task_finetune_vqa_clip_bert per_gpu_batchsize=4 clip16 text_roberta image_size=576 clip_randaug load_path=meter_clip16_288_roberta_pretrain.ckpt

The testing command is

python run.py with data_root=$DATA_DIR num_gpus=8 num_nodes=1 test_only=True task_finetune_vqa_clip_bert per_gpu_batchsize=4 clip16 text_roberta image_size=576 load_path=last.ckpt

The provided VQA-finetuned checkpoint is trained in this way, so if you follow these steps correctly, you should be able to get a score of ~77.6 on test-dev. I didn't look at the dev scores and the number of training epochs was set to 10 as in config.py. The fine-tuning took about 2 days on 8 V100s for reference.

wanng-ide · 2022-01-20T02:17:19Z

Ok, I will have a try! Thank you for your patient.

wanng-ide · 2022-01-21T08:18:47Z

I found a problem.
If I use only one node to finetune the pretrained model, the result will be better than two nodes (around 5% in VQA v2 Val).

That might be the reason.

May you share your pretraining log?
I will pretrain the model in two nodes.
I want to know the different between one node and two nodes.

zdou0830 · 2022-01-21T18:31:17Z

I didn't save the logs, but I did pre-train the models with 1/2/4 nodes and there were no significant differences, so I'd suggest you to debug your multi-node training settings.

jiyt17 · 2022-01-22T14:44:24Z

I also met the same problem, which probably results from multi-node training settings. I use slurm to multi-node train. May you share your train bash file, if you also use slurm.

zdou0830 · 2022-01-22T23:04:54Z

I didn't use slurm, but I uploaded the running file for distributed training on Microsoft machines (https://github.com/zdou0830/METER/blob/main/azure_distributed_run.py). Not sure if this is helpful.

jiyt17 · 2022-01-23T02:00:44Z

ok, thank u~

mactavish91 · 2022-10-15T11:25:36Z

I also met the same problem, which probably results from multi-node training settings. I use slurm to multi-node train. May you share your train bash file, if you also use slurm.

@jiyt17 Hello, have you solved the problems you encountered before?

mactavish91 · 2022-10-15T11:26:07Z

@zdou0830 Thanks!

I have another question about the codes.

I tried your pretrained model to finetune VQA v2 with default setting. However, the score of val is only around 72.55.

It should be more than 80..

Could you share your experiment settings of finetuning VQA v2 and pretraining tasks?

@wanng-ide Hello, have you solved the problems you encountered before?

zdou0830 closed this as completed Jan 19, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Some questions for the paper #11

Some questions for the paper #11

wanng-ide commented Jan 19, 2022

zdou0830 commented Jan 19, 2022

wanng-ide commented Jan 19, 2022

zdou0830 commented Jan 19, 2022

wanng-ide commented Jan 19, 2022

wanng-ide commented Jan 19, 2022

wanng-ide commented Jan 19, 2022

zdou0830 commented Jan 19, 2022

wanng-ide commented Jan 20, 2022

zdou0830 commented Jan 20, 2022

wanng-ide commented Jan 20, 2022

wanng-ide commented Jan 21, 2022

zdou0830 commented Jan 21, 2022

jiyt17 commented Jan 22, 2022

zdou0830 commented Jan 22, 2022

jiyt17 commented Jan 23, 2022

mactavish91 commented Oct 15, 2022

mactavish91 commented Oct 15, 2022

Some questions for the paper #11

Some questions for the paper #11

Comments

wanng-ide commented Jan 19, 2022

zdou0830 commented Jan 19, 2022

wanng-ide commented Jan 19, 2022

zdou0830 commented Jan 19, 2022

wanng-ide commented Jan 19, 2022

wanng-ide commented Jan 19, 2022

wanng-ide commented Jan 19, 2022

zdou0830 commented Jan 19, 2022

wanng-ide commented Jan 20, 2022

zdou0830 commented Jan 20, 2022

wanng-ide commented Jan 20, 2022

wanng-ide commented Jan 21, 2022

zdou0830 commented Jan 21, 2022

jiyt17 commented Jan 22, 2022

zdou0830 commented Jan 22, 2022

jiyt17 commented Jan 23, 2022

mactavish91 commented Oct 15, 2022

mactavish91 commented Oct 15, 2022