Skip to content

Think and Tell - image captioning from brain data

Notifications You must be signed in to change notification settings

seang123/Masters-Thesis

Repository files navigation

Masters Thesis

Think and Tell - image captioning from brain data


Links

  • Im2txt

  • NeuralTalk2

  • Another implementation

  • Caffe Model Zoo

    • Provides ResNet pre-trained networks that "won the 1st places in: ImageNet classification, ImageNet detection, ImageNet localization, COCO detection, and COCO segmentation", which could be good for the CNN layer.

Plan

  1. Implement the CNN-RNN architecture from the "Show and Tell" paper, download the pre-trained weights so save time if possible.
  • test it on the mscoc dataset
  • convert it to work with the fMRI data
  1. Add an attention mechanism
  2. Use a transformer network

TODO:

CNN-RNN network

  1. Create separate test set, to check for overfitting

About

Think and Tell - image captioning from brain data

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published