GitHub - MicaTeo/knowingWhereToLook: Image Captioning using Adaptive Attention with PyTorch

PyTorch Implementation of Knowing When to Look: Adaptive Attention via a Visual Sentinal for Image Captioning Paper

Original Torch Implementation by Lu. et al can be found here

Dataset

I'm using the Flickr30k Dataset. You may download the images from here. If you wish to use the COCO Dataset, you will need to comment out 2 lines in the code.
I'm also using Karpathy's Train/Val/Test Split. You may download it from here.
You may also use the WORMAP.json file in the directory if you don't wish to create it again.

Files

preprocess.py Creates the WORDMAP.json file and the .h5 files
dataset.py Creates the custom dataset
util.py Functions to be used throught the code
models.py Defines the architectures
train_eval For Training and Evaluation
visualization.ipynb For Testing and Visualization

Testing

It's very simple! Place the test image in your directory, and name it as test.jpg, and then run the visualization.ipynbjupyter notebook file to get the results.

Results

The results of some validation and testing images of the Flickr30k from Karpathy's Split is shown below.

References

Thanks to @https://github.com/sgrvinod/a-PyTorch-Tutorial-to-Image-Captioning

Name		Name	Last commit message	Last commit date
Latest commit History 70 Commits
scripts		scripts
.gitignore		.gitignore
MLP_Final.pdf		MLP_Final.pdf
README.md		README.md
dataset.py		dataset.py
models.py		models.py
preprocess.py		preprocess.py
train_eval.py		train_eval.py
util.py		util.py
visualize.ipynb		visualize.ipynb
xception.py		xception.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

PyTorch Implementation of Knowing When to Look: Adaptive Attention via a Visual Sentinal for Image Captioning Paper

Dataset

Files

Testing

Results

References

About

Releases

Packages

Languages

MicaTeo/knowingWhereToLook

Folders and files

Latest commit

History

Repository files navigation

PyTorch Implementation of Knowing When to Look: Adaptive Attention via a Visual Sentinal for Image Captioning Paper

Dataset

Files

Testing

Results

References

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages