Name		Name	Last commit message	Last commit date
parent directory ..
png		png
README.md		README.md
create_input_files.py		create_input_files.py
dataset.py		dataset.py
download.sh		download.sh
model.py		model.py
sample.py		sample.py
train.py		train.py

README.md

Image Captioning

The goal of image captioning is to convert a given input image into a natural language description. The encoder-decoder framework is widely used for this task. The image encoder is a convolutional neural network (CNN). In this tutorial, we used resnet-152 model pretrained on the ILSVRC-2012-CLS image classification dataset. The decoder is a long short-term memory (LSTM) network.

Usage

1. Download the COCO-2014 dataset

2. Preprocessing

python create_input_files

3. Train the model

python train.py

4. Test the model

python sample.py --image='png/example.png'

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

image_captioning

image_captioning

README.md

Image Captioning

Usage

1. Download the COCO-2014 dataset

2. Preprocessing

3. Train the model

4. Test the model

Files

image_captioning

Directory actions

More options

Directory actions

More options

Latest commit

History

image_captioning

Folders and files

parent directory

README.md

Image Captioning

Usage

1. Download the COCO-2014 dataset

2. Preprocessing

3. Train the model

4. Test the model