Skip to content

Leveraging convolutional neural networks (CNNs) for image feature extraction and recurrent neural networks (RNNs) for language generation, to generate appropriate captions for colored images.

Notifications You must be signed in to change notification settings

vansh-py04/Image-Captioning

Repository files navigation

Image-Captioning

Image Captioning is the task of translating an input image into a textual description. As such, it connects Vision and Language in a generative fashion, with applications that range from multi-modal search engines to help visually impaired people.

Dataset used : https://www.kaggle.com/datasets/adityajn105/flickr8k

Data split : Training data = 6000 Images, Testing data = 1000 Images, Validation data = 1000 Images

Model Architechture

image

Model Weight losses

Screenshot 2024-04-07 003549 Based on this, model_19 was used for making predictions

Predictions

image

image

About

Leveraging convolutional neural networks (CNNs) for image feature extraction and recurrent neural networks (RNNs) for language generation, to generate appropriate captions for colored images.

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published