Skip to content
#

vision-api

Here are 242 public repositories matching this topic...

Developing a text-to-speech device that can capture an image (containing text), extract the text from image using OCR, translate the text to a desired language using G-Translator, and generate audio for the translated text using Google Cloud TTS

  • Updated Jun 14, 2024
  • Python

An innovative AI conversation API leveraging Google's Gemini for multimodal understanding. Combines FastAPI, Langchain, and Redis for robust, scalable, and privacy-conscious text and image-based interactions

  • Updated May 20, 2024
  • Python

Improve this page

Add a description, image, and links to the vision-api topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the vision-api topic, visit your repo's landing page and select "manage topics."

Learn more