CMCAN

Implementation of our AAAI2022 paper, Show Your Faith: Cross-Modal Confidence-Aware Network for Image-Text Matching.

Data

vocab is available here: vocab.

ims_bbx.npy, ims_size.npy, precaps_stan.txt for Flickr30K: f30k_precomp.

ims_bbx.npy, ims_size.npy, precaps_stan.txt for MSCOCO: coco_precomp.

ims_dir_selfadj{4, 8, 12}.npy can be created by running the adj.py under directory ./context_extractor/.

Trained Model

The model trained on the Flickr30K dataset is available here: checkpoint_f30k_c499.3.pth.

Note

Any questions please contact huatianzhang@mail.ustc.edu.cn for immediate reply. Thanks.

Name		Name	Last commit message	Last commit date
Latest commit History 41 Commits
__pycache__		__pycache__
context_extractor		context_extractor
envs		envs
log		log
README.md		README.md
data.py		data.py
evaluation.py		evaluation.py
model.py		model.py
opts.py		opts.py
train.py		train.py
train.sh		train.sh
vocab.py		vocab.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

pycache

pycache

context_extractor

context_extractor

envs

envs

log

log

README.md

README.md

data.py

data.py

evaluation.py

evaluation.py

model.py

model.py

opts.py

opts.py

train.py

train.py

train.sh

train.sh

vocab.py

vocab.py

Repository files navigation

CMCAN

Data

Trained Model

Note

About

Releases

Packages

Contributors 2

Languages

CrossmodalGroup/CMCAN

Folders and files

Latest commit

History

Repository files navigation

CMCAN

Data

Trained Model

Note

About

Resources

Stars

Watchers

Forks

Languages