Image Captioning Using CNN and LSTM

Caption generation is a challenging artificial intelligence problem where a textual description must be generated for a given photograph.

It requires both methods from computer vision to understand the content of the image and a language model from the field of natural language processing to turn the understanding of the image into words in the right order. Recently, deep learning methods have achieved state-of-the-art results on examples of this problem.

Deep learning methods have demonstrated state-of-the-art results on caption generation problems. What is most impressive about these methods is a single end-to-end model can be defined to predict a caption, given a photo, instead of requiring sophisticated data preparation or a pipeline of specifically designed models.

Dataset: Flickr 8k : https://www.kaggle.com/adityajn105/flickr8k Description: https://github.com/jbrownlee/Datasets/releases/download/Flickr8k/Flickr8k_text.zip

Name		Name	Last commit message	Last commit date
Latest commit History 12 Commits
model_weights		model_weights
saved		saved
Image Captioning .ipynb		Image Captioning .ipynb
README.md		README.md
descriptions_1.txt		descriptions_1.txt
encoded_test_features.pkl		encoded_test_features.pkl

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Image Captioning Using CNN and LSTM

Model

Final Results:

Some Fails:

About

Releases

Packages

Languages

Aryavir07/Image-Captioning-Using-CNN-and-LSTM

Folders and files

Latest commit

History

Repository files navigation

Image Captioning Using CNN and LSTM

Model

Final Results:

Some Fails:

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages