Google's multimodal AI model APIs for text, image, audio, and video understanding
URL: Visit APIs.json URL
- Artificial Intelligence, Machine Learning, Generative AI, Multimodal, LLM
- Created: 2024
- Modified: 2024
Generate content using Google's Gemini models with text, image, audio, and video inputs
Human URL: https://ai.google.dev/
- Text Generation, Image Understanding, Video Understanding, Audio Understanding, Chat
Advanced reasoning and complex task handling
- Text Generation, Reasoning
Multimodal understanding of text and images
- Multimodal, Vision, Image Understanding
Most capable model for highly complex tasks
- Advanced AI, Complex Tasks
FN: Google AI