Skip to content
#

captions

Here are 197 public repositories matching this topic...

This project develops a web app integrating image captioning and text-to-speech features. Using CNNs for image feature extraction and GRUs for generating captions, users can upload images via a Streamlit interface. Captions are created and converted into speech using gTTS, providing a tool for visually impaired users and image-to-text conversion.

  • Updated Nov 5, 2024
  • Jupyter Notebook

Automatically generate multi-language subtitles using AWS AI/ML services. Machine generated subtitles can be edited to improve accuracy and downstream tracks will automatically be regenerated based on the edits. Built on Media Insights Engine (https://github.com/awslabs/aws-media-insights-engine)

  • Updated Oct 31, 2024
  • Vue

Improve this page

Add a description, image, and links to the captions topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the captions topic, visit your repo's landing page and select "manage topics."

Learn more