#

flickr30k

Here are 13 public repositories matching this topic...

adas0910 / densecap-flickr30K-entities

Processing data produced by flickr30k_entities to use as regional description for densecap model

python json image-captioning h5 densecap flickr30k regional-description

Updated Nov 11, 2022
Python

spoortimorabad / ImageCaptioningGeneration-Using-Swin-Transformer-and-GRU-attention-Mechansim

Image captioning generation using Swin transformer and GRU attention mechanism

tensorflow captions gru mit-license imagecaptioning swin-transformer flickr30k

Updated Oct 8, 2024
Jupyter Notebook

spoluan / flickr30k_image_captioning

"Flickr30k_image_captioning" is a project or repository focused on image captioning using the Flickr30k dataset. The project aims to develop and showcase algorithms and models that generate descriptive captions for images.

nlp computer-vision deep-learning language-modeling cnn neural-networks image-recognition image-captioning sequence transfer-learning datasets image-analysis attention-mechanism encoder-decoder caption-generation flickr30k image-to-text-generation

Updated May 2, 2023
Jupyter Notebook

bkhanal-11 / clip-openai

Implementation of CLIP from OpenAI using pretrained Image and Text Encoders.

vit clip flickr30k all-mpnet-base-v2

Updated Dec 12, 2023
Jupyter Notebook

HanCai98 / Flickr30k-Dataset

Preprocess the Flickr30k dataset

data-preprocessing flickr30k

Updated Dec 7, 2021
Python

Sh-31 / ImgCap

ImgCap is an image captioning model designed to automatically generate descriptive captions for images. It has two versions CNN + LSTM model and CNN + LSTM + Attention mechanism model.

torch lstm resnet deeplearning imagecaptioning torchtext torchvision flickr30k

Updated Sep 10, 2024
Python

Delphboy / karpathy-splits

Karpathy Splits json files for image captioning

image-caption mscoco-dataset flickr8k-dataset flickr30k karpathy-split

Updated Apr 4, 2024

KimRass / CLIP

PyTorch implementation of 'CLIP' (Radford et al., 2021) from scratch and training it on Flickr8k + Flickr30k

multi-modal clip linear-classification flickr8k zero-shot-classification flickr30k text-image-retrieval

Updated Mar 14, 2024
Python

thisisankit27 / SnapSpeak

Visual Elocution Synthesis

docker tesseract-ocr image-captioning flickr30k

Updated Mar 29, 2024
Python

awsaf49 / flickr-dataset

Download flickr8k, flickr30k image caption datasets

image flickr dataset clip captioning-images image-text flickr8k flickr30k siglip

Updated Feb 6, 2024

nirajankarki5 / Flickr30k-Image-Caption-Generator-Using-Deep-Learning

A deep learning model that generates descriptions of an image.

machine-learning deep-learning caption-generation flickr30k

Updated Mar 11, 2021
Jupyter Notebook

nssharmaofficial / image-caption-generator

Image captioning model with Resnet50 encoder and LSTM decoder

encoder decoder pytorch embeddings lstm image-captioning vocabulary-builder resnet50 image-caption-generator flickr30k

Updated Sep 6, 2024
Python

eric-ai-lab / ComCLIP

Official implementation and dataset for the NAACL 2024 paper "ComCLIP: Training-Free Compositional Image and Text Matching"

causality clip svo slip vision-and-language compositionality flickr8k-dataset image-text-matching flickr30k image-text-retrieval winoground blip2

Updated Aug 18, 2024
Python

Improve this page

Add a description, image, and links to the flickr30k topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the flickr30k topic, visit your repo's landing page and select "manage topics."