Image Captioning

The main goal of this project model is to assign each pixel of an image in a category label. This network provides a complete understanding of the scene. It predicts the label, location as well as shape of each element in the image.

Image captioning is the process of generating textual description from an image.

The first part is handled by CNNs and the second is handled by RNNs. Use both Natural Language Processing and Computer Vision to generate the captions.

If we are told to describe this image,

“মাঠের মধ্যে একটি ছেলে বল ধরে আছে ।” or “খালি গায়ে শিশুটি খুশিতে বল নিয়ে দাড়িয়ে আছে।"

While forming the description, we are seeing the image but at the same time, we are looking to create a meaningful sequence of words.

Model

Developing Deep Learning Model -> Google Colab
Generate New Captions
Preparing Photo Data
Preparing Text Data
- Each photo has two described captions
Evaluate Model

Design Approach

RNN + CNN
Encoder-decoder model

EncoderCNN

Extract feature vector from input image
Based on pretrained ResNet50
Only require very small modifies

DecoderRNN

LSTM: Long Short Term Memory networks
Multiple Copies of the same network
Contained three gates to control the cell state
Capable of learning long-term dependencies.

Tools

Notepad++, Avro (Bangla Writings), Flatten, Convolution2D, Dropout, LSTM, TimeDistributed, Embedding, Bidirectional, Activation, RepeatVector, Concatenate.

Name		Name	Last commit message	Last commit date
Latest commit History 11 Commits
Power Point Slide		Power Point Slide
img		img
model 1 outputs		model 1 outputs
model 1		model 1
ran it on colab		ran it on colab
.gitattributes		.gitattributes
Bdataset_Bangla_Image_Captioning.ipynb		Bdataset_Bangla_Image_Captioning.ipynb
Discussion on PSPNet.docx		Discussion on PSPNet.docx
README.md		README.md
flikr8k_image_captioning_model_001.ipynb		flikr8k_image_captioning_model_001.ipynb
image_captioning_model_002.ipynb		image_captioning_model_002.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Image Captioning

Image captioning is the process of generating textual description from an image.

Model

Design Approach

EncoderCNN

DecoderRNN

Tools

Result Analysis

Poster

About

Releases

Packages

Languages

shahilazmayish/Bangla-Image-Captioning

Folders and files

Latest commit

History

Repository files navigation

Image Captioning

Image captioning is the process of generating textual description from an image.

Model

Design Approach

EncoderCNN

DecoderRNN

Tools

Result Analysis

Poster

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages