Visual features: For the off-the-shell image features extractor, Please follow this repo: https://github.com/USC-MCL/Project_Demo
Given visual features, you can read them in BertHop notebook and run training and evaluation
The code is originally biuld upon the following GitHub repo: https://github.com/YIKUAN8/Transformers-VQA.git