The LSTM model generates captions for the input images after extracting features from pre-trained VGG-16 model. (Computer Vision, NLP, Deep Learning, Python)
-
Updated
Aug 28, 2019 - Jupyter Notebook
The LSTM model generates captions for the input images after extracting features from pre-trained VGG-16 model. (Computer Vision, NLP, Deep Learning, Python)
Image Captioning with Keras
Generate text captions for audio files & youtube video using OpenAI Whisper on Google Colab. Multiple languages support.
This is a repo providing same stable diffusion experiments, regarding textual inversion task and captioning task
caption generator using lavis and argostranslate
Aid for blinds. This AI will describe the surrounding, it will tell who is in front of him (if that person is a known person to AI using Facial Recognition) and it will also help him to know what is written (Optical Character Recognition)
Automatically generate Alt Text for images and other objects in Powerpoint presentations using MLLM/VLM
Image Caption Generator using CNN and LSTM
IN5400 Mandatory exercise 2
A real-time captioning system with support for large and small screen display.
Generate captions for any video you want. Super easy !
Image caption generator using tensorflow and coco dataset
Add a description, image, and links to the caption-generator topic page so that developers can more easily learn about it.
To associate your repository with the caption-generator topic, visit your repo's landing page and select "manage topics."