This deep learning model is trained on the flickr_8k dataset. It uses n-gram model for text prediction and InceptionV3 for generating image embeddings. I have also used glove 6B 200d word embeddings .
The trained model and tokenizers can be found inside the assets folder.
MIT
Free Software, Hell Yeah!