Skip to content

The project is an extension of the SENT2IMG application, where an attention mechanism is introduced to obtain precise captions and Okapi BM25 algorithm has been utilised to rank the captions.

Notifications You must be signed in to change notification settings

ashwathkris/Text-based-Image-retrieval-using-Image-Captioning

Repository files navigation

Steps to run the application:

  1. Download the Flickr8k dataset and store the images in the 'Flicker8k_Dataset' folder.
  2. Run each cell of the .ipynb file to view output generated at every step and to generate checkpoints. When training the model, a new checkpoint folder will be created and the 5 most recently trained checkpoints are saved.
  3. Run the 'app_flask_new.py' file to execute the application.

On uploading images to the application, the generated captions along with the image name is saved as a JSON object and image is stored in a 'gallery' folder.

About

The project is an extension of the SENT2IMG application, where an attention mechanism is introduced to obtain precise captions and Okapi BM25 algorithm has been utilised to rank the captions.

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published