gpt4v
Here are 16 public repositories matching this topic...
Mobile-Agent: The Powerful Mobile Device Operation Assistant Family
-
Updated
Sep 26, 2024 - Python
Control Any Computer Using LLMs
-
Updated
Nov 10, 2024 - Python
[ECCV 2024] ShareGPT4V: Improving Large Multi-modal Models with Better Captions
-
Updated
Jul 1, 2024 - Python
GPT-4V in Wonderland: LMMs as Smartphone Agents
-
Updated
Jul 17, 2024 - Python
Implementation of MambaByte in "MambaByte: Token-free Selective State Space Model" in Pytorch and Zeta
-
Updated
Nov 11, 2024 - Python
中文医学多模态大模型 Large Chinese Language-and-Vision Assistant for BioMedicine
-
Updated
May 22, 2024 - Python
Language instructions to mycobot using GPT-4V
-
Updated
Dec 11, 2023 - Python
Multi-Modal Multi-Embodied Hivemind-like Iteration of RTX-2
-
Updated
Nov 11, 2024 - Python
This repository offers a Python framework for a retrieval-augmented generation (RAG) pipeline using text and images from MHTML documents, leveraging Azure AI and OpenAI services. It includes ingestion and enrichment flows, a RAG with Vision pipeline, and evaluation tools.
-
Updated
Nov 12, 2024 - Python
Chain of Images for Intuitively Reasoning
-
Updated
Nov 29, 2023 - Python
Analyze a Video and generate commentary about it with OpenAI's GPT-4V, Text-to-speech, LangChain, Streamlit, Replit, Twilio SendGrid, and OpenCV!
-
Updated
Dec 14, 2023 - Python
Digital Artificial Intelligence Agent
-
Updated
Dec 28, 2023 - Python
Explore the rich flavors of Indian desserts with TunedLlavaDelights. Utilizing the in Llava fine-tuning, our project unveils detailed nutritional profiles, taste notes, and optimal consumption times for beloved sweets. Dive into a fusion of AI innovation and culinary tradition
-
Updated
Mar 17, 2024 - Python
Improve this page
Add a description, image, and links to the gpt4v topic page so that developers can more easily learn about it.
Add this topic to your repo
To associate your repository with the gpt4v topic, visit your repo's landing page and select "manage topics."