ReRe

Official Code for ReRe:Retrieval-Augmented Natural Language Reasoning For Explainable Visual Question Answering

Accepted from IEEE ICIP 2024 workshop: Integrating Image Processing with Large-Scale Language/Vision Models for Advanced Visual Understanding

Paper: https://arxiv.org/abs/2408.17006

Requirements

PyTorch 2.1.2
CLIP(install with pip install git+https://github.com/openai/CLIP.git)
transformers(install with pip install transformers)
accelerate==0.26.1
evaluate==0.4.1
torchvision==0.16.2
torchmetrics==1.3.0

Datasets Download

Download the images in /local_datasets/vqax

image for VQA-X: COCO train2014 and val2014 images

Pretrain Model Download

Download GPT-2 distilled model on '/local_datasets/vqax/'. This model is pretrained on image captioning model.

Model and Tokenizer are in drive

Code

Train the model using command line below. result of model will be saved in 'result' folder in every epoch.

python ReRe.py

Evaluation

For evaluate the result of ReRe, We are using Cider, Bleu, Meteor, Rouge, Bertscore to measure the quality of model output explanations. This metrics are widely used metric in NLE Task. For Accuracy, answer's correct or wrong is counted if output answer is in GT answers. To see finetuned model's score, Run the command line below.

python evaluation.py

Name		Name	Last commit message	Last commit date
Latest commit History 15 Commits
local_datasets/vqax		local_datasets/vqax
models		models
nle_data/VQA-X		nle_data/VQA-X
result		result
retrieval		retrieval
utils		utils
.gitignore		.gitignore
README.md		README.md
ReRe.py		ReRe.py
evaluation.py		evaluation.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

ReRe

Requirements

Datasets Download

Pretrain Model Download

Code

Evaluation

About

Uh oh!

Releases

Packages

Languages

yeonsue/ReRe

Folders and files

Latest commit

History

Repository files navigation

ReRe

Requirements

Datasets Download

Pretrain Model Download

Code

Evaluation

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages