GitHub - Anurich/Image-captioning: Image Captioning is the process of generating textual description of an image. It uses both Natural Language Processing and Computer Vision to generate the captions. So in this work I try to implement neural network that are capable of generating text given an image.

Image captioning

Image Captioning is the process of generating textual description of an image. It uses both Natural Language Processing and Computer Vision to generate the captions. So in this work I try to implement neural network that is capable of generating text given an image.

Network Architecture

In this work I use two different network one for extracting features from image and second is LSTM network for generating the text using that image.

Dataset.

The dataset used in this project is COCO dataset for image captioning 2015, I cannot upload dataset here but you can find it by clicking the link below. Link: https://cocodataset.org/#download

File structure

dataset.py :- This file contain the custom dataloader that return train data and test data.
vocabulary.py :- This file is used for creating the vocabulary.
model.py :- This file is basically where I define model and sample function for testing
train.py :- This file is basically used for training the network
test.py :- As name suggest this file is for testing

Installation.

Python==3.6.6
Pytorch==1.6.0

Predictions.

Important Note.

I am just the beginner and learning about this fascinating field, Please feel free to point out my mistake as well as feel free to contribute. Hope to upload more interesting project in future.

Name		Name	Last commit message	Last commit date
Latest commit History 15 Commits
README.md		README.md
dataset.py		dataset.py
image1.png		image1.png
image2.png		image2.png
img.png		img.png
index_to_word.pickle		index_to_word.pickle
main.py		main.py
model.py		model.py
test.py		test.py
train.py		train.py
vocabulary.py		vocabulary.py
word_to_index.pickle		word_to_index.pickle

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Image captioning

Network Architecture

Dataset.

File structure

Installation.

Predictions.

Important Note.

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Image captioning

Network Architecture

Dataset.

File structure

Installation.

Predictions.

Important Note.

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages