ImageCaptionAttention

Personal Project in Computer Vision related attention network implementation

Project Outcome

In this project, I implemented an attention network on the CRNN network where CNN as an image encoder, RNN for a text encoder, and LSTM as a sequence text reminder, with the addition of implementing an attention network that is useful for spreading feature vectors at the LSTM layer successfully reducing the loss value (epoch 1 : 3.18632) to (epoch 25: 2.20923) with a reduced loss margin of 0.997 at 25 epochs for almost 2 hours using a Tesla P4 GPU and 2 workers.

Architecture Neural Network Implemented

Sample Result

Sample 1	Sample 2	Sample 3

Name		Name	Last commit message	Last commit date
Latest commit History 8 Commits
result		result
Image_Captioning_on_Small_Flickr_Dataset.ipynb		Image_Captioning_on_Small_Flickr_Dataset.ipynb
LICENSE		LICENSE
README.md		README.md
WhatsApp Image 2022-08-03 at 14.32.08.jpeg		WhatsApp Image 2022-08-03 at 14.32.08.jpeg

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

ImageCaptionAttention

Project Outcome

Architecture Neural Network Implemented

Sample Result

About

Releases

Packages

Languages

License

NnA301023/ImageCaptionAttention

Folders and files

Latest commit

History

Repository files navigation

ImageCaptionAttention

Project Outcome

Architecture Neural Network Implemented

Sample Result

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages