flickr30k
Here are 13 public repositories matching this topic...
Processing data produced by flickr30k_entities to use as regional description for densecap model
-
Updated
Nov 11, 2022 - Python
Image captioning generation using Swin transformer and GRU attention mechanism
-
Updated
Oct 8, 2024 - Jupyter Notebook
"Flickr30k_image_captioning" is a project or repository focused on image captioning using the Flickr30k dataset. The project aims to develop and showcase algorithms and models that generate descriptive captions for images.
-
Updated
May 2, 2023 - Jupyter Notebook
Karpathy Splits json files for image captioning
-
Updated
Apr 4, 2024
Implementation of CLIP from OpenAI using pretrained Image and Text Encoders.
-
Updated
Dec 12, 2023 - Jupyter Notebook
Download flickr8k, flickr30k image caption datasets
-
Updated
Feb 6, 2024
ImgCap is an image captioning model designed to automatically generate descriptive captions for images. It has two versions CNN + LSTM model and CNN + LSTM + Attention mechanism model.
-
Updated
Sep 10, 2024 - Python
PyTorch implementation of 'CLIP' (Radford et al., 2021) from scratch and training it on Flickr8k + Flickr30k
-
Updated
Mar 14, 2024 - Python
Visual Elocution Synthesis
-
Updated
Mar 29, 2024 - Python
Official implementation and dataset for the NAACL 2024 paper "ComCLIP: Training-Free Compositional Image and Text Matching"
-
Updated
Aug 18, 2024 - Python
Image captioning model with Resnet50 encoder and LSTM decoder
-
Updated
Sep 6, 2024 - Python
A deep learning model that generates descriptions of an image.
-
Updated
Mar 11, 2021 - Jupyter Notebook
Improve this page
Add a description, image, and links to the flickr30k topic page so that developers can more easily learn about it.
Add this topic to your repo
To associate your repository with the flickr30k topic, visit your repo's landing page and select "manage topics."