flickr8k-dataset

Comparitive analysis of image captioning model using RNN, BiLSTM and Transformer model architectures on the Flickr8K dataset and InceptionV3 for image feature extraction.

transformers rnn image-captioning inceptionv3 bilstm flickr8k-dataset

Updated Jan 12, 2024
Python

DarkKnightSgh / Text-Image-Text

Star

Text-Image-Text is a bidirectional system that enables seamless retrieval of images based on text descriptions, and vice versa. It leverages state-of-the-art language and vision models to bridge the gap between textual and visual representations.

python information-retrieval transformers image-text flickr8k-dataset text-image streamlit semantic-embedding huggingface-transformers

Updated Apr 27, 2024
Python

Improve this page

Add a description, image, and links to the flickr8k-dataset topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the flickr8k-dataset topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

flickr8k-dataset

Here are 6 public repositories matching this topic...

eric-ai-lab / ComCLIP

nssharmaofficial / ImageCaption_Flickr8k

satojkovic / DeepCaptioning

pjain9696 / image_captions_generator

therrshan / image-captioning

DarkKnightSgh / Text-Image-Text

Improve this page

Add this topic to your repo