GitHub - gedaye11/IMRAM: code for our CVPR2020 paper "IMRAM: Iterative Matching with Recurrent Attention Memory for Cross-Modal Image-Text Retrieval"

Requirements and Installation

We recommended the following dependencies.

Python 3.6
PyTorch 1.0
NumPy
Punkt Sentence Tokenizer:

import nltk
nltk.download()
> d punkt

Data preparation

Download the dataset files. We use splits produced by Andrej Karpathy. The raw images can be downloaded from from their original sources here, here and here.

The precomputed image features are extracted from the raw images using the bottom-up attention model from here. Image features for training set, validation set and testing set should be merged in order into one .npy file, respectively. More details about the image feature extraction can also be found in SCAN(https://github.com/kuanghuei/SCAN).

Data files can be found in SCAN (We use the same dataset split as theirs):

wget https://scanproject.blob.core.windows.net/scan-data/data_no_feature.zip

Place data_no_feature.zip in the directory of data.

Training and Evaluation

./script/tune_coco.sh

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
script		script
vocab		vocab
README.md		README.md
data.py		data.py
evaluation.py		evaluation.py
model.py		model.py
opts.py		opts.py
test_gpus.py		test_gpus.py
train_gpus.py		train_gpus.py
vocab.py		vocab.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Requirements and Installation

Data preparation

Training and Evaluation

About

Releases

Packages

Languages

gedaye11/IMRAM

Folders and files

Latest commit

History

Repository files navigation

Requirements and Installation

Data preparation

Training and Evaluation

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages