Skip to content
#

multimodal

Here are 13 public repositories matching this topic...

big-AGI

Generative AI suite powered by state-of-the-art models and providing advanced AI/AGI functions. It features AI personas, AGI functions, multi-model chats, text-to-image, voice, response streaming, code highlighting and execution, PDF import, presets for developers, much more. Deploy on-prem or in the cloud.

  • Updated May 11, 2024
  • TypeScript

Gemini is an open-source application powered by the Google Gemini Vision API. It enables users to identify and learn about objects captured by their camera through a simple and interactive experience. Just say 'Hey Gemini' and show an object to the camera and say!

  • Updated Jan 3, 2024
  • TypeScript

Improve this page

Add a description, image, and links to the multimodal topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the multimodal topic, visit your repo's landing page and select "manage topics."

Learn more