Automatic Image Captioning System using Deep Learning

System Overview

Automatic image captioning, in other words, generating natural language descriptions according to the content observed in an image, is an important part of scene understanding, which combines the knowledge of computer vision and natural language processing. The application of image caption is extensive and significant, for example, the realization of human-computer interaction. In this project, an automatic image caption generation system has been developed with a user-friendly UI that provides a caption to the user upon the upload of an image with proper extension.

Deep Learning Model

Image Feature Extractor:

Here we have used a pretrained VGG16 model to extract features from the images of flicker8k dataset. The pretarined weights can be downloaded from here.

Sequence Processor:

A Long Short-Term Memory (LSTM) based model has been developed for training the text dataset of flicker8k.

Decoder:

Both the feature extractor and sequence processor output a fixed-length vector. These are merged together and processed by a Dense layer to make a final prediction.

User Interface:

Flask has been used to develop the UI through which user can get captions for new images. It has certain features:

Upload image option
Checking allowed extensions of image
Generate caption option that when clicked, acquires captions of image from the saved model and display it to the users
Displaying uploaded image

Name		Name	Last commit message	Last commit date
Latest commit History 11 Commits
images		images
template files		template files
README.md		README.md
flask_deployment.ipynb		flask_deployment.ipynb
image_captioning_8k.ipynb		image_captioning_8k.ipynb
model.json		model.json
model_tokenizer		model_tokenizer

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

images

images

template files

template files

README.md

README.md

flask_deployment.ipynb

flask_deployment.ipynb

image_captioning_8k.ipynb

image_captioning_8k.ipynb

model.json

model.json

model_tokenizer

model_tokenizer

Repository files navigation

Automatic Image Captioning System using Deep Learning

System Overview

Deep Learning Model

Image Feature Extractor:

Sequence Processor:

Decoder:

User Interface:

Before Uploading Image::

After Uploading Image:

Checking Allowed Extensions of Image (Error Message):

After clicking Generate Caption option:

After clicking Generate Caption option Without Uploading Image:

About

Releases

Packages

Languages

Moumita-Sen-Sarma/Automatic-Image-Captioning-System-using-Deep-Learning

Folders and files

Latest commit

History

Repository files navigation

Automatic Image Captioning System using Deep Learning

System Overview

Deep Learning Model

Image Feature Extractor:

Sequence Processor:

Decoder:

User Interface:

Before Uploading Image::

After Uploading Image:

Checking Allowed Extensions of Image (Error Message):

After clicking Generate Caption option:

After clicking Generate Caption option Without Uploading Image:

About

Resources

Stars

Watchers

Forks

Languages