Skip to content

Fused RNN and CNN to predict image activity by amalgamating Xception model output and text input for enhanced accuracy.

Notifications You must be signed in to change notification settings

aryanjain28/Image-Caption-Generator

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

9 Commits
 
 
 
 

Repository files navigation

Image Caption Generator

Trained a model by merging RNN and CNN to generate the activity occuring on the image. The model takes pre-trained output from another model, Xception, as image's input and takes another input from text and merges them and generates predictions.

BLEU scores

Below is a chart that I prepared for the BLEU score of my VGG16 and Xception model. 
The higher the BLEU score, the higher the accuracy of Model.
You can see the Xception Model-4 has the highest score, and that is the model used in this project.

Screenshots

Below are some screenshots, half of them are from training set while others are randomly picked from internet.
As observed below some results are exceptionally good, while some are too bad.

References

https://machinelearningmastery.com/
https://data-flair.training/
https://www.youtube.com/

About

Fused RNN and CNN to predict image activity by amalgamating Xception model output and text input for enhanced accuracy.

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published