You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
I would greatly value your assistance in offering guidance for initiating pre-training/fine-tuning on the Visual Question Answering (VQA) task, specifically in the following aspects:
The necessary format for the required dataset.
Minimum hardware requirements for its execution.
Please note that while this question might be straightforward and potentially addressed by reviewing the model documentation, I am seeking an expert opinion on this matter.
Thank you sincerely.
The text was updated successfully, but these errors were encountered:
Hello, we have released VQA checkpoints in this repo, you can try it out first to see if it works within your needs. Otherwise, you should just follow the instructions in the documentation, i.e. getting the expert labels ready and modify the training config scripts.
I would greatly value your assistance in offering guidance for initiating pre-training/fine-tuning on the Visual Question Answering (VQA) task, specifically in the following aspects:
Please note that while this question might be straightforward and potentially addressed by reviewing the model documentation, I am seeking an expert opinion on this matter.
Thank you sincerely.
The text was updated successfully, but these errors were encountered: