A curated collection of practical AI agents and generative AI applications built with diverse tech stacks, demonstrating real-world implementations using OpenAI, Gemini, local models, and various AI frameworks.
This repository contains complete AI applications organized into 4 main categories:
- π― Starter Agents: Simple, single-purpose AI agents
- π§ Advanced Agents: Complex AI agents with sophisticated workflows
- π₯ Multi-Agent Teams: Collaborative AI systems with specialized agents
- π RAG Applications: Retrieval-Augmented Generation with knowledge bases
- π¨ Multimodal Apps: Applications combining text, images, audio, and video
π― Starter Agents
Simple, single-purpose AI agents perfect for learning and quick implementations:
AI21 Studio Chat - Jurassic model integration
Voice-Enabled Chatbot - Voice-enabled chatbot using ElevenLabs
Claude 4 Conversation Agent - Conversation agent using Claude models
Google PaLM Chat - Conversation agent using Google PaLM models
Local Llama Chat - Conversation agent using Local Llama models
OpenAI Chat Assistant - Chat assistant using OpenAI API
Claude Code Reviewer - Code reviewer using Claude Sonnet 4
π§ Advanced Agents
Sophisticated AI agents with complex reasoning and multi-step workflows:
Brand Video Monitor - Brand video monitoring and analysis
π₯ Multi-Agent Teams
Coordinated AI teams working together on complex tasks:
Content Creation Team - Content creation team with specialized agents
π RAG Applications
RAG applications with knowledge bases:
Contextual Video RAG - Advanced video RAG with contextual compression and semantic retrieval
Corrective Video RAG - Video analysis with three-tier evaluation and corrective strategies
π¨ Multimodal Apps
Applications combining text, images, audio, and video:
Gemini Video Analyzer - Video content analysis and insights
Gemini Sketch-to-Video - Turn sketches into animated videos with Gemini and Veo
ποΈ Development Roadmap
See our complete development roadmap and release schedule in Roadmap.md, which outlines:
- π Daily releases started
- π― Target of 100+ complete applications by end of 2025
- π Organized development across 5 strategic categories
- ποΈ Detailed week-by-week implementation plans
- π Future expansion into enterprise solutions and community ecosystem (Q4 2025)
Track our progress and upcoming releases to see what we're building next!
This collection was inspired by and references patterns from:
This project is licensed under the MIT License - see the LICENSE file for details.
OpenAI for GPT models and APIs
Anthropic for Claude models and APIs
Google for Gemini AI capabilities
Meta for Llama model family
Motia for AI Agents and Backend Automation
LangChain for agent frameworks
CrewAI for multi-agent orchestration
Agno for multi-agent systems
- All the amazing open source AI community
β Star this repository if you find it helpful!
π Watch for updates as we add more AI agents and applications!
π¬ Join the discussion in Issues to suggest new apps or improvements!