🧠 Advanced AI Agent System

Multi-Strategy AI Reasoning System implementing cutting-edge techniques from recent AI research papers. Built with Groq LLM, Tavily Search, and ChromaDB for production-ready AI agent capabilities.

🎯 Key Features

Feature	Description
🔗 Chain-of-Thought	Step-by-step reasoning with self-consistency voting
🌳 Tree-of-Thoughts	Multi-path exploration with beam search
⚡ ReAct Agent	Reasoning + Acting with real web search
👥 Multi-Agent	Planner → Worker → Critic collaboration
🧠 LLM Auto-Classifier	Intelligent strategy routing based on task type
🌐 Real Web Search	Tavily API integration for live information
💾 Vector Memory	ChromaDB for persistent knowledge storage
🛡️ Rate Limiting	API protection (10/min, 100/day per user)
🌊 Streaming	Real-time response streaming

🚀 Live Demo

Try it now: https://huggingface.co/spaces/SaiTejaSrivilli/ai-agent-system

📊 System Architecture

┌─────────────────────────────────────────────────────────────┐
│                      User Input                              │
└─────────────────────────┬───────────────────────────────────┘
                          │
                          ▼
┌─────────────────────────────────────────────────────────────┐
│              🧠 LLM-Based Auto-Classifier                    │
│         (Intelligent routing based on task analysis)         │
└─────────────────────────┬───────────────────────────────────┘
                          │
          ┌───────────────┼───────────────┬───────────────┐
          ▼               ▼               ▼               ▼
   ┌─────────────┐ ┌─────────────┐ ┌─────────────┐ ┌─────────────┐
   │   Chain-of  │ │   Tree-of   │ │   ReAct     │ │   Multi-    │
   │   Thought   │ │   Thoughts  │ │   Agent     │ │   Agent     │
   │             │ │             │ │             │ │             │
   │ • 3 paths   │ │ • Beam=3    │ │ • Search    │ │ • Planner   │
   │ • Voting    │ │ • Depth=3   │ │ • Memory    │ │ • Worker    │
   │ • Consensus │ │ • Scoring   │ │ • Tools     │ │ • Critic    │
   └─────────────┘ └─────────────┘ └─────────────┘ └─────────────┘
          │               │               │               │
          └───────────────┴───────────────┴───────────────┘
                                  │
                                  ▼
┌─────────────────────────────────────────────────────────────┐
│                    📤 Final Response                         │
│              (Answer + Metadata + Confidence)                │
└─────────────────────────────────────────────────────────────┘

🔄 Auto-Classification System

The system uses an LLM-based classifier (not keyword matching) to intelligently route queries:

def _classify(self, task: str) -> str:
    """LLM-based intelligent task classification."""
    classify_prompt = f"""Classify this task into ONE category:
    - cot: Math problems, calculations, logic puzzles
    - tot: Creative tasks, design, brainstorming
    - react: Research questions, factual queries, current events
    - multi: Complex writing, essays, detailed analysis
    
    Task: "{task}"
    Category:"""
    
    response = self.llm.generate(classify_prompt, temperature=0.1)
    # Returns: cot, tot, react, or multi

Classification Examples

Query	Strategy	Reason
"Calculate 15% tip on $85"	Chain-of-Thought	Math calculation
"Design a logo for a coffee shop"	Tree-of-Thoughts	Creative task
"What are the latest AI developments?"	ReAct Agent	Research/current events
"Write an analysis of remote work"	Multi-Agent	Complex writing task
"Explain quantum computing"	ReAct Agent	Factual explanation
"Create a marketing strategy"	Multi-Agent	Complex planning

🛠️ Technical Implementation

1. Chain-of-Thought (CoT) with Self-Consistency

# Generates 3 independent reasoning paths
# Uses majority voting for final answer
# Smart answer extraction with money detection ($5 not just 5)

Example Output:
- Path 1: "$5" (via step-by-step calculation)
- Path 2: "$5" (via different approach)  
- Path 3: "$5" (via verification)
- Final: "$5" with 100% confidence

2. Tree-of-Thoughts (ToT) with Beam Search

# Explores multiple solution branches
# Beam width: 3, Depth: 3
# Scores and prunes paths for best solutions

Example: "Design an AI fitness feature"
- Branch 1: Personalized workout AI (Score: 8.5)
- Branch 2: Real-time form correction (Score: 9.0) ← Selected
- Branch 3: Social fitness challenges (Score: 7.5)

3. ReAct Agent with Tools

# Available Tools:
# - web_search: Tavily API for real-time info
# - memory_search: ChromaDB vector search
# - memory_store: Save important information
# - calculate: Math operations

# Reasoning loop:
Thought → Action → Observation → Thought → ... → Answer

4. Multi-Agent Collaboration

# Three specialized agents:
# 1. Planner: Creates execution plan
# 2. Worker: Executes tasks with full content
# 3. Critic: Reviews and improves quality

# Produces direct content, not descriptions

🛡️ Rate Limiting

Protects API usage with per-user limits:

Limit	Value	Purpose
Per Minute	10 requests	Prevents spam
Per Day	100 requests	Protects daily quota

class RateLimiter:
    def __init__(self, max_per_minute=10, max_per_day=100):
        # Tracks requests by user IP
        # Shows friendly messages when limited
        # Displays current usage stats

📦 Installation

Local Development

# Clone repository
git clone https://github.com/SaiTejaSrivilli/ai-agent-system.git
cd ai-agent-system

# Install dependencies
pip install -r requirements.txt

# Set environment variables
export GROQ_API_KEY="your-groq-key"
export TAVILY_API_KEY="your-tavily-key"  # Optional

# Run
python app.py

Deploy to HuggingFace Spaces

Create a new Space on HuggingFace
Upload app.py and requirements.txt
Add secrets in Settings:
- GROQ_API_KEY (Required)
- TAVILY_API_KEY (Optional - for real web search)
Space will auto-deploy

🔑 API Keys

Key	Required	Free Tier	Get It
`GROQ_API_KEY`	✅ Yes	✅ Generous	console.groq.com
`TAVILY_API_KEY`	❌ Optional	✅ 1000/month	tavily.com

📁 Project Structure

ai-agent-system/
├── app.py              # Main application (~1200 lines)
├── requirements.txt    # Dependencies
├── README.md          # Documentation
└── LICENSE            # MIT License

Code Organization (app.py)

Lines 1-50:      Imports & Configuration
Lines 51-120:    LLM Client with Streaming
Lines 121-250:   Web Search Tool (Tavily + Fallback)
Lines 251-350:   Vector Memory (ChromaDB)
Lines 351-500:   Chain-of-Thought Reasoner
Lines 501-650:   Tree-of-Thoughts Reasoner
Lines 651-800:   ReAct Agent
Lines 801-950:   Multi-Agent System
Lines 951-1050:  Creative Agent (Orchestrator)
Lines 1051-1200: Gradio UI & Event Handlers

📊 Example Results

Chain-of-Thought (Math)

Query: "A bakery sells cupcakes for $3. Tom buys 5 and pays with $20. How much change?"

Reasoning:
Step 1: Cost = 5 × $3 = $15
Step 2: Change = $20 - $15 = $5

Answer: $5
Confidence: high (3/3 paths agreed)

ReAct Agent (Research)

Query: "What is Chain of Thought prompting?"

Thought: I need to search for information about CoT prompting
Action: web_search("Chain of Thought prompting AI")
Observation: [Search results about CoT...]
Thought: I found relevant information, let me synthesize
Answer: Chain of Thought prompting is a technique where...

Tools Used: web_search, memory_store

Multi-Agent (Complex Writing)

Query: "Write an analysis of remote work benefits"

Planner: Created 3-section plan
Worker: Generated full analysis content
Critic: Improved clarity and added examples

Output: [Complete 500+ word analysis]
Quality Score: 8.5/10

🔬 Research Papers Implemented

Paper	Authors	Year	Technique
Chain-of-Thought Prompting	Wei et al.	2022	Step-by-step reasoning
Self-Consistency	Wang et al.	2022	Multiple paths + voting
Tree of Thoughts	Yao et al.	2023	Tree search reasoning
ReAct	Yao et al.	2022	Reasoning + Acting

🎯 Skills Demonstrated

This project showcases:

AI/ML Engineering: LLM integration, prompt engineering, agent architectures
Software Architecture: Clean code, modular design, error handling
API Integration: Groq, Tavily, HuggingFace APIs
Full-Stack Development: Gradio UI, async processing, streaming
Production Practices: Rate limiting, graceful degradation, logging
Research Implementation: Converting academic papers to working code

🤝 Contributing

Contributions welcome! Please:

Fork the repository
Create a feature branch
Submit a pull request

📄 License

MIT License - see LICENSE for details.

👤 Author

Sai Teja Srivilli

🔗 LinkedIn
📂 GitHub
🤗 HuggingFace

⭐ Star this repo if you find it useful!

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

🧠 Advanced AI Agent System

🎯 Key Features

🚀 Live Demo

📊 System Architecture

🔄 Auto-Classification System

Classification Examples

🛠️ Technical Implementation

1. Chain-of-Thought (CoT) with Self-Consistency

2. Tree-of-Thoughts (ToT) with Beam Search

3. ReAct Agent with Tools

4. Multi-Agent Collaboration

🛡️ Rate Limiting

📦 Installation

Local Development

Deploy to HuggingFace Spaces

🔑 API Keys

📁 Project Structure

Code Organization (app.py)

📊 Example Results

Chain-of-Thought (Math)

ReAct Agent (Research)

Multi-Agent (Complex Writing)

🔬 Research Papers Implemented

🎯 Skills Demonstrated

🤝 Contributing

📄 License

👤 Author

About

Uh oh!

Releases

Packages

Uh oh!

Contributors 1

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
GITHUB_README.md		GITHUB_README.md
LICENSE		LICENSE
README.md		README.md
app.py		app.py
requirements.txt		requirements.txt

Folders and files

Latest commit

History

Repository files navigation

🧠 Advanced AI Agent System

🎯 Key Features

🚀 Live Demo

📊 System Architecture

🔄 Auto-Classification System

Classification Examples

🛠️ Technical Implementation

1. Chain-of-Thought (CoT) with Self-Consistency

2. Tree-of-Thoughts (ToT) with Beam Search

3. ReAct Agent with Tools

4. Multi-Agent Collaboration

🛡️ Rate Limiting

📦 Installation

Local Development

Deploy to HuggingFace Spaces

🔑 API Keys

📁 Project Structure

Code Organization (app.py)

📊 Example Results

Chain-of-Thought (Math)

ReAct Agent (Research)

Multi-Agent (Complex Writing)

🔬 Research Papers Implemented

🎯 Skills Demonstrated

🤝 Contributing

📄 License

👤 Author

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors 1

Languages

Packages