flickr30k

"Flickr30k_image_captioning" is a project or repository focused on image captioning using the Flickr30k dataset. The project aims to develop and showcase algorithms and models that generate descriptive captions for images.

nlp computer-vision deep-learning language-modeling cnn neural-networks image-recognition image-captioning sequence transfer-learning datasets image-analysis attention-mechanism encoder-decoder caption-generation flickr30k image-to-text-generation

Updated May 2, 2023
Jupyter Notebook

bkhanal-11 / clip-openai

Star

Implementation of CLIP from OpenAI using pretrained Image and Text Encoders.

vit clip flickr30k all-mpnet-base-v2

Updated Dec 12, 2023
Jupyter Notebook

awsaf49 / flickr-dataset

Star

Download flickr8k, flickr30k image caption datasets

image flickr dataset clip captioning-images image-text flickr8k flickr30k siglip

Updated Feb 6, 2024

thisisankit27 / SnapSpeak

Star

Visual Elocution Synthesis

docker tesseract-ocr image-captioning flickr30k

Updated Mar 29, 2024
Python

KimRass / CLIP

Star

PyTorch implementation of 'CLIP' (Radford et al., 2021) from scratch and training it on Flickr8k + Flickr30k

multi-modal clip linear-classification flickr8k zero-shot-classification flickr30k text-image-retrieval

Updated Mar 14, 2024
Python

Delphboy / karpathy-splits

Star

Karpathy Splits json files for image captioning

image-caption mscoco-dataset flickr8k-dataset flickr30k karpathy-split

Updated Apr 4, 2024

eric-ai-lab / ComCLIP

Star

Official implementation and dataset for the NAACL 2024 paper "ComCLIP: Training-Free Compositional Image and Text Matching"

causality clip svo slip vision-and-language compositionality flickr8k-dataset image-text-matching flickr30k image-text-retrieval winoground blip2

Updated Aug 18, 2024
Python

nssharmaofficial / image-caption-generator

Sponsor

Star

Image captioning model with Resnet50 encoder and LSTM decoder

encoder decoder pytorch embeddings lstm image-captioning vocabulary-builder resnet50 image-caption-generator flickr30k

Updated Sep 6, 2024
Python

Sh-31 / ImgCap

Star

ImgCap is an image captioning model designed to automatically generate descriptive captions for images. It has two versions CNN + LSTM model and CNN + LSTM + Attention mechanism model.

torch lstm resnet deeplearning imagecaptioning torchtext torchvision flickr30k

Updated Sep 10, 2024
Python

spoortimorabad / ImageCaptioningGeneration-Using-Swin-Transformer-and-GRU-attention-Mechansim

Star

Image captioning generation using Swin transformer and GRU attention mechanism

tensorflow captions gru mit-license imagecaptioning swin-transformer flickr30k

Updated Oct 8, 2024
Jupyter Notebook

Improve this page

Add a description, image, and links to the flickr30k topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the flickr30k topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

flickr30k

Here are 13 public repositories matching this topic...

nirajankarki5 / Flickr30k-Image-Caption-Generator-Using-Deep-Learning

HanCai98 / Flickr30k-Dataset

adas0910 / densecap-flickr30K-entities

spoluan / flickr30k_image_captioning

bkhanal-11 / clip-openai

awsaf49 / flickr-dataset

thisisankit27 / SnapSpeak

KimRass / CLIP

Delphboy / karpathy-splits

eric-ai-lab / ComCLIP

nssharmaofficial / image-caption-generator

Sh-31 / ImgCap

spoortimorabad / ImageCaptioningGeneration-Using-Swin-Transformer-and-GRU-attention-Mechansim

Improve this page

Add this topic to your repo