An official implementation of ShareGPT4V: Improving Large Multi-modal Models with Better Captions
-
Updated
Jun 6, 2024 - Python
An official implementation of ShareGPT4V: Improving Large Multi-modal Models with Better Captions
Digital Artificial Intelligence Agent
Language instructions to mycobot using GPT-4V
Analyze a Video and generate commentary about it with OpenAI's GPT-4V, Text-to-speech, LangChain, Streamlit, Replit, Twilio SendGrid, and OpenCV!
Explore the rich flavors of Indian desserts with TunedLlavaDelights. Utilizing the in Llava fine-tuning, our project unveils detailed nutritional profiles, taste notes, and optimal consumption times for beloved sweets. Dive into a fusion of AI innovation and culinary tradition
中文医学多模态大模型 Large Chinese Language-and-Vision Assistant for BioMedicine
Chain of Images for Intuitively Reasoning
Multi-Modal Multi-Embodied Hivemind-like Iteration of RTX-2
Implementation of MambaByte in "MambaByte: Token-free Selective State Space Model" in Pytorch and Zeta
Control Any Computer Using LLMs
Mobile-Agent: The Powerful Mobile Device Operation Assistant Family
Add a description, image, and links to the gpt4v topic page so that developers can more easily learn about it.
To associate your repository with the gpt4v topic, visit your repo's landing page and select "manage topics."