How to train OFA for VQA in open-ended? #123

qyc-98 · 2022-06-10T16:44:45Z

Dear authors:
Thanks for the great work! In VQA validation, If I want the model to predict the most likely next token (i.e. generating a token in the answer) from the output logits. And then I append this token to the input and repeat this step until the model predicts ⟨EOS⟩. What could I do to achieve it? Thanks a lot!

qyc-98 · 2022-06-11T05:40:15Z

And I want to train and validate both in this manner. Thanks for your precious time!

yangapku · 2022-06-14T08:07:20Z

Hi, currently the VQA task code supports beam-search inference during validation and testing (in contrast with all-candidate inference, please refer to readme), but the finetuning objective still must be constrained with a pre-defined candidate answer set stored in trainval_ans2label.pkl file. We are working to add a new config to support unconstrained finetuning (which does not need a pre-defined candidate answer set). The code update is still under testing and will be merged in this week.

ilovecv · 2022-07-07T06:54:36Z

@yangapku Hi, any updates on this? Thanks!

yangapku · 2022-07-22T09:11:10Z

Hi, a pull request related to this issue #124 has been proposed recently, which will add a new config to activate unconstrained finetuning. However, we find bugs are still existing in this PR, which will result in zero score during evaluation. We are still working on making it function correctly and will merge it ASAP.

qyc-98 · 2022-07-26T08:29:59Z

Hi,
Thanks for your great job!

RishabhMaheshwary · 2022-08-09T20:07:33Z

Any update on this?

yangapku · 2022-09-21T06:47:35Z

@qyc-98 @RishabhMaheshwary @ilovecv Hi, we have found the bug and fixed it! Now the latest codebase supports open-ended (unconstrained) VQA finetuning and evaluation. Please pull the latest code and refer to PR #124 & run_scripts/vqa/train_vqa_distributed.sh (Line 62-68) on how to activate it!

leng-yue · 2022-09-21T08:40:26Z

Hi, are there any performance data for the open-ended VQA fine-tuning?

yangapku · 2022-09-21T09:30:34Z

@leng-yue We have tested open-ended VQA fine-tuning on OFA-base (without using EMA). It achieves 76.4 score on our VQA validation set. This performance can still be improved by using EMA and further hyper-param tuning.

leng-yue · 2022-09-22T00:41:49Z

Thanks for your response, the result looks good :)

JustinLin610 assigned yangapku Jun 11, 2022

yangapku closed this as completed Sep 22, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

How to train OFA for VQA in open-ended? #123

How to train OFA for VQA in open-ended? #123

qyc-98 commented Jun 10, 2022

qyc-98 commented Jun 11, 2022

yangapku commented Jun 14, 2022

ilovecv commented Jul 7, 2022

yangapku commented Jul 22, 2022

qyc-98 commented Jul 26, 2022

RishabhMaheshwary commented Aug 9, 2022

yangapku commented Sep 21, 2022 •

edited

leng-yue commented Sep 21, 2022

yangapku commented Sep 21, 2022

leng-yue commented Sep 22, 2022

How to train OFA for VQA in open-ended? #123

How to train OFA for VQA in open-ended? #123

Comments

qyc-98 commented Jun 10, 2022

qyc-98 commented Jun 11, 2022

yangapku commented Jun 14, 2022

ilovecv commented Jul 7, 2022

yangapku commented Jul 22, 2022

qyc-98 commented Jul 26, 2022

RishabhMaheshwary commented Aug 9, 2022

yangapku commented Sep 21, 2022 • edited

leng-yue commented Sep 21, 2022

yangapku commented Sep 21, 2022

leng-yue commented Sep 22, 2022

yangapku commented Sep 21, 2022 •

edited