Code breaking when training TASK16 (Foil). #67

iacercalixto · 2020-09-22T20:30:13Z

Hi,

I am trying to use this code-base to fine-tune any of the available pretrained models on the FOIL dataset, but I am having some problems. Is the code supposed to work when running the example from the README (script train_tasks.py) with TASK16?

To reproduce the error, you can run:

python3 train_tasks.py --bert_model bert-base-uncased --from_pretrained pretrained_model.bin --config_file config/bert_base_6layer_6conect.json --tasks 16 --lr_scheduler 'warmup_linear' --train_iter_gap 4 --task_specific_tokens --save_name finetune_from_gcc

An error happens is in line 363 in vilbert/task_utils.py, namely the vil_binary_prediction and the target tensors' sizes do not match. After much digging, I can make it work when training from scratch by changing a few different files, but I would like to know this head (vil_binary_prediction) is supposed to work out-of-the-box.

Thanks!

The text was updated successfully, but these errors were encountered:

vedanuj · 2020-09-25T16:02:35Z

I don't think we have tested Task16 support tested properly. Please feel free to open a PR with your changes.

zongshenmu · 2020-09-26T02:14:34Z

How do you resolve the problem? I find the same situation in the NLVR task and doubt the problem in the vilbert.py.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Code breaking when training TASK16 (Foil). #67

Code breaking when training TASK16 (Foil). #67

iacercalixto commented Sep 22, 2020

vedanuj commented Sep 25, 2020

zongshenmu commented Sep 26, 2020

Code breaking when training TASK16 (Foil). #67

Code breaking when training TASK16 (Foil). #67

Comments

iacercalixto commented Sep 22, 2020

vedanuj commented Sep 25, 2020

zongshenmu commented Sep 26, 2020