aider is AI pair programming in your terminal
-
Updated
Nov 16, 2024 - Python
aider is AI pair programming in your terminal
Composio equip's your AI agents & LLMs with 100+ high-quality integrations via function calling
Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.
[CVPR 2024 Oral] InternVL Family: A Pioneering Open-Source Alternative to GPT-4o. 接近GPT-4o表现的开源多模态对话模型
Start building LLM-empowered multi-agent applications in an easier way.
Multilingual Voice Understanding Model
Devon: An open-source pair programmer
⚡️ Build Your Own chatgpt Bot|🧀 Discord/Slack/Kook/Telegram |⛓ ToolCall|🔖 Plugin Support | 🌻 out-of-box | gpt-4o
这是一个全自动(音频)视频翻译项目。利用Whisper识别声音,AI大模型翻译字幕,最后合并字幕视频,生成翻译后的视频。
TEN Agent is the world’s first real-time multimodal agent integrated with the OpenAI Realtime API, RTC, and features weather checks, web search, vision, and RAG capabilities.
A Multimodal Native Agent Framework for Smart Hardware and More
Extract clean data from anywhere, powered by vision-language models ⚡
End-to-end platform for building voice first multimodal agents
RAG-GPT, leveraging LLM and RAG technology, learns from user-customized knowledge bases to provide contextually relevant answers for a wide range of queries, ensuring rapid and accurate information retrieval.
Engy is an AI-powered development tool that generates fully functional web applications from natural language, streamlining the process from idea to working prototype.
High-quality and streaming Speech-to-Speech interactive agent in a single file. 只用一个文件实现的流式全双工语音交互原型智能体!
Add a description, image, and links to the gpt-4o topic page so that developers can more easily learn about it.
To associate your repository with the gpt-4o topic, visit your repo's landing page and select "manage topics."