Image_Captioning_Keras

You saw an image and your brain can easily tell what the image is about, but can a computer tell what the image is representing? Computer vision researchers worked on this a lot and they considered it impossible until now! With the advancement in Deep learning techniques, availability of huge datasets and computer power, we can build models that can generate captions for an image.

I used deep learning techniques of Convolutional Neural Networks and a type of Recurrent Neural Network (LSTM) in this project. Image caption generator is a task that involves computer vision and natural language processing concepts to recognize the context of an image and describe them in a natural language like English.

I implemented the caption generator using CNN (Convolutional Neural Networks) and LSTM (Long short term memory). The image features will be extracted from Xception which is a CNN model trained on the imagenet dataset and then we feed the features into the LSTM model which will be responsible for generating the image captions.

Model Architecture

Running the Model

(i) Clone the repository
(ii) Download Flicke8k dataset from here. It contains images for the captioning.
(iii) Download Flickr_8k_text. That contains captions for the related images. It has 5 captions per image.
(iv) Run the Testing.ipynb file.

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
.ipynb_checkpoints		.ipynb_checkpoints
Feature_extraction		Feature_extraction
models		models
1001773457_577c3a7d70.jpg		1001773457_577c3a7d70.jpg
54501196_a9ac9d66f2.jpg		54501196_a9ac9d66f2.jpg
Image_captioning_model.ipynb		Image_captioning_model.ipynb
README.md		README.md
Testing.ipynb		Testing.ipynb
model.png		model.png

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

.ipynb_checkpoints

.ipynb_checkpoints

Feature_extraction

Feature_extraction

models

models

1001773457_577c3a7d70.jpg

1001773457_577c3a7d70.jpg

54501196_a9ac9d66f2.jpg

54501196_a9ac9d66f2.jpg

Image_captioning_model.ipynb

Image_captioning_model.ipynb

README.md

README.md

Testing.ipynb

Testing.ipynb

model.png

model.png

Repository files navigation

Image_Captioning_Keras

Model Architecture

Running the Model

About

Releases

Packages

Languages

VishalShah1999/Image_Captioning_Keras_LSTM

Folders and files

Latest commit

History

Repository files navigation

Image_Captioning_Keras

Model Architecture

Running the Model

About

Topics

Resources

Stars

Watchers

Forks

Languages