PNAIC: Partially Non-Autoregressive Image Captioning

This repository contains the reference code for the paper Partial Non-Autoregressive Image Captioning,

Requirements

Python 3
PyTorch (>1.0)
torchvision

Data preparation

Annotations and detection features for the COCO dataset are needed in this work. Please download the annotations file annotations.zip and extract it. Detection features are computed with the code provided by [1]. Please download the COCO features file coco_detections.hdf5 (~53.5 GB), in which detections of each image are stored under the <image_id>_features key. <image_id> is the id of each COCO image, without leading zeros (e.g. the <image_id> for COCO_val2014_000000037209.jpg is 37209), and each value should be a (N, 2048) tensor, where N is the number of detections.

Training

Evaluation

Acknowledgements

This repository is based on the framework of Meshed-Memory Transformer.

References

[1] P. Anderson, X. He, C. Buehler, D. Teney, M. Johnson, S. Gould, and L. Zhang. Bottom-up and top-down attention for image captioning and visual question answering. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2018.

Name		Name	Last commit message	Last commit date
Latest commit History 27 Commits
data		data
evaluation		evaluation
utils		utils
PNAIC.py		PNAIC.py
README.md		README.md
inference.py		inference.py
util.py		util.py
vocab.pkl		vocab.pkl

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

data

data

evaluation

evaluation

utils

utils

PNAIC.py

PNAIC.py

README.md

README.md

inference.py

inference.py

util.py

util.py

vocab.pkl

vocab.pkl

Repository files navigation

PNAIC: Partially Non-Autoregressive Image Captioning

Requirements

Data preparation

Training

Evaluation

Acknowledgements

References

About

Releases

Packages

Languages

feizc/PNAIC

Folders and files

Latest commit

History

Repository files navigation

PNAIC: Partially Non-Autoregressive Image Captioning

Requirements

Data preparation

Training

Evaluation

Acknowledgements

References

About

Resources

Stars

Watchers

Forks

Languages