Support Image Captioning (and other tasks) #12

dennybritz · 2017-03-04T21:05:02Z

In theory it should be easy to support Image Captioning by just swapping out the encoder with something like ResNet/Inception (e.g. tensorflow.contrib.slim.python.slim.nets.inception_v3). However, there are a few things that need to happen to support problems other than text-to-text.

Currently, the parameters to the train/inference scripts are specific to text Sequence-To-Sequence, e.g. source_vocabulary, source_delimiter, etc. We probably need another abstraction layer that defines what kind of task the user is solving and adjust flags/parameters based on it. For example, I could imagine having a Task class, with TextToText, ImageToText, ..., subclasses. The user then passes the type of task as part of the config and the task class is responsible for setting the appropriate parameters and creating the model.
Support for pre-trained networks. For example, when training image captioning models one typically initializes the encoder network with pre-trained image classification network weights. This can probably the done through some kind of SessionRunHook that loads a subset of the variables. In other words, the hooks used in the training script must be configurable.

The text was updated successfully, but these errors were encountered:

ayushidalmia · 2017-08-04T10:28:15Z

@dennybritz What is the status of this one?

dennybritz added the feature label Mar 4, 2017

dennybritz mentioned this issue Mar 6, 2017

Train an Image Captioning model on MS COCO Data #18

Open

dennybritz self-assigned this Mar 6, 2017

dennybritz closed this as completed Mar 12, 2017

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Support Image Captioning (and other tasks) #12

Support Image Captioning (and other tasks) #12

dennybritz commented Mar 4, 2017 •

edited

ayushidalmia commented Aug 4, 2017

Support Image Captioning (and other tasks) #12

Support Image Captioning (and other tasks) #12

Comments

dennybritz commented Mar 4, 2017 • edited

ayushidalmia commented Aug 4, 2017

dennybritz commented Mar 4, 2017 •

edited