🤖 Autonomous Dev Agent

An AI-powered autonomous development system that uses multiple intelligent agents to build complete software projects from natural language descriptions. Built with LangGraph, Google Gemini, and Python.

🚀 Features

📋 PM Agent - Generates Product Requirements Documents from user prompts
🏗️ Architect Agent - Designs system architecture and technology stack
💻 Coder Agent - Generates production-ready code
👀 Reviewer Agent - Performs comprehensive code reviews
🧪 Tester Agent - Generates tests and verifies functionality
✅ Critic Agent - Final meta-review and quality assurance
🌐 REST API - FastAPI backend for integration
🎨 Web UI - Streamlit dashboard for visualization
🔄 Conditional Routing - Intelligent agent orchestration with LangGraph
💾 State Persistence - Checkpoint-based state management
🔄 Retry Logic - Automatic error recovery

📋 Architecture

User Request
    ↓
┌─────────────────────────────────────┐
│  PM Agent (Requirements)            │
└──────────────┬──────────────────────┘
               ↓
┌─────────────────────────────────────┐
│  Architect Agent (Design)           │
│  ⏸️ [Human Review Checkpoint]        │
└──────────────┬──────────────────────┘
               ↓
┌─────────────────────────────────────┐
│  Coder Agent (Code Generation)      │
└──────────────┬──────────────────────┘
               ↓
┌─────────────────────────────────────┐
│  Reviewer Agent (Code Review)       │
│  ↻ [Conditional: Pass?]             │
└──────────────┬──────────────────────┘
               ↓ (if pass)
┌─────────────────────────────────────┐
│  Tester Agent (Test Generation)     │
│  ↻ [Conditional: Tests Pass?]       │
└──────────────┬──────────────────────┘
               ↓ (if pass)
┌─────────────────────────────────────┐
│  Critic Agent (Meta-Review)         │
│  ↻ [Conditional: Rollback needed?]  │
└──────────────┬──────────────────────┘
               ↓ (if approved)
┌─────────────────────────────────────┐
│  ✅ Final Review & Delivery         │
└─────────────────────────────────────┘

🛠️ Tech Stack

LLM: Google Gemini 2.5 Flash
Orchestration: LangGraph
API Framework: FastAPI
Frontend: Streamlit
Database: SQLite (state persistence)
Code Execution: E2B Sandbox
State Management: Pydantic
Testing: pytest

📁 Project Structure

langgrpah/
├── agents/              # Agent implementations
│   ├── pm_agent.py
│   ├── architect_agent.py
│   ├── coder_agent.py
│   ├── reviewer_agent.py
│   ├── tester_agent.py
│   └── critic_agent.py
├── graph/              # LangGraph workflow
│   ├── state.py        # Pydantic state model
│   ├── builder.py      # Graph construction
│   └── conditional_routing.py
├── tools/              # Utility tools
│   ├── code_executor.py
│   ├── file_generator.py
│   └── validators.py
├── api/                # FastAPI endpoints
│   └── routes.py
├── ui/                 # Streamlit frontend
│   └── app.py
├── tests/              # Test suite
│   └── test_agents.py
├── config.py           # Configuration
├── main.py             # CLI entry point
├── requirements.txt    # Dependencies
└── .env.example        # Environment template

🚀 Quick Start

1. Clone & Setup

# Clone the repository
git clone <repo_url>
cd langgrpah

# Create virtual environment
python -m venv venv
source venv/bin/activate  # On Windows: venv\Scripts\activate

# Install dependencies
pip install -r requirements.txt

2. Configure Environment

# Copy the example env file
cp .env.example .env

# Edit .env with your API keys
# ANTHROPIC_API_KEY=your_key_here
# E2B_API_KEY=your_key_here

3. Run the System

Choose how to run the system:

Option A: Streamlit UI (Recommended for first use)

python main.py ui
# Open browser to http://localhost:8501

Option B: FastAPI Server

python main.py api
# Open browser to http://localhost:8000/docs

Option C: Command Line

python main.py pipeline "Build a simple REST API for task management"

📖 Usage Examples

Via CLI

python main.py pipeline "Build a user authentication system with FastAPI"

Via Python

from graph.state import AgentState
from graph.builder import get_graph

# Create state
state = AgentState(
    user_prompt="Build a todo API with authentication"
)

# Get graph and run
graph = get_graph()
result = graph.invoke(state.dict())

print(result['generated_files_list'])

Via API

# Start pipeline
curl -X POST http://localhost:8000/pipeline/start \
  -H "Content-Type: application/json" \
  -d '{"prompt": "Build a REST API"}'

# Check status
curl http://localhost:8000/pipeline/{session_id}/status

# Get generated code
curl http://localhost:8000/pipeline/{session_id}/output

🤝 Agent Workflow Details

PM Agent

Input: User natural language request Output: Product Requirements Document

The PM Agent:

Analyzes the user's request
Extracts requirements and features
Identifies constraints and scope
Creates a comprehensive PRD

Architect Agent

Input: PRD from PM Agent Output: Technical Design Document

The Architect Agent:

Proposes technology stack
Designs API endpoints
Plans database schema
Creates system architecture
Awaits human approval before proceeding

Coder Agent

Input: Technical design and requirements Output: Production-ready code files

The Coder Agent:

Implements all features
Writes clean, documented code
Follows software engineering best practices
Generates multiple files (main.py, config.py, database.py, api.py, utils.py)

Reviewer Agent

Input: Generated code Output: Code review feedback

The Reviewer Agent:

Checks code quality
Validates best practices
Identifies issues and improvements
Either approves or sends back to Coder

Tester Agent

Input: Generated code Output: Test files and execution results

The Tester Agent:

Generates comprehensive test suites
Creates unit, integration, and API tests
Executes tests in sandbox
Reports test coverage and results

Critic Agent

Input: All previous outputs Output: Final approval or rollback decision

The Critic Agent:

Performs meta-review of entire pipeline
Assesses requirement coverage
Evaluates architecture quality
Makes final go/no-go decision
Can trigger rollback to earlier stages

🧪 Testing

Run the test suite:

# Install test dependencies
pip install pytest

# Run all tests
pytest tests/

# Run specific test
pytest tests/test_agents.py::TestPMAgent

# Run with coverage
pytest --cov=agents --cov=graph --cov=tools tests/

🔄 State Management

The system uses a centralized AgentState Pydantic model that:

Flows through all agents
Maintains consistency
Enables checkpointing
Can be persisted to database
Validates data types

All agents read from and write to this shared state.

📊 Monitoring & Logging

The system logs all executions:

state.execution_history  # List of all agent executions
state.total_tokens_used  # Total tokens consumed
state.last_error         # Last error message (if any)

🔐 Security

Input validation on all APIs
Environment variable management for secrets
Sandboxed code execution (E2B)
Rate limiting (can be added)
CORS configuration (can be added)

🐳 Docker Support

# Build
docker build -t langgrpah .

# Run API
docker run -e ANTHROPIC_API_KEY=xxx -p 8000:8000 langgrpah python main.py api

# Run UI
docker run -e ANTHROPIC_API_KEY=xxx -p 8501:8501 langgrpah python main.py ui

📈 Performance Optimization

Parallel Agent Execution: Some agents could run in parallel
Caching: Cache agent outputs for similar requests
Token Optimization: Monitor token usage per agent
Async Operations: Already using async in FastAPI

🎯 Future Enhancements

Multi-modal input (voice, images)
Real-time websocket updates for UI
Database persistence with SQLAlchemy
More sophisticated rollback logic
Cost tracking and optimization
Plugin system for custom agents
Version control integration
Deployment automation (Docker, K8s)

🤝 Contributing

Contributions welcome! Areas for improvement:

Additional agents (DevOps, Security, Performance)
Enhanced test coverage
Better error handling
Performance improvements
Documentation

📝 License

MIT License - See LICENSE file for details

💡 How It Works

User submits request - "Build a task management API"
PM Agent analyzes - Creates detailed Requirements Document with features, scope, constraints
Architect designs - Creates Technical Design with tech stack, API endpoints, database schema
- ⏸️ Human Review - User reviews and approves/rejects architecture
Coder generates - Writes clean, production-ready code based on design
Reviewer checks - Reviews code for quality, best practices, and issues
Tester validates - Generates tests and verifies functionality
Critic reviews - Final meta-review with approval or rollback decision
Delivery - Complete codebase ready for use

🙋 Support

For issues, questions, or suggestions:

Open an issue on GitHub
Check existing documentation
Review example prompts

🎓 Learning Resources

Made with ❤️ using AI and Python

Name		Name	Last commit message	Last commit date
Latest commit History 1 Commit
agents		agents
api		api
graph		graph
memory		memory
tests		tests
tools		tools
ui		ui
.gitignore		.gitignore
Dockerfile		Dockerfile
PROJECT_OVERVIEW.md		PROJECT_OVERVIEW.md
README.md		README.md
__init__.py		__init__.py
config.py		config.py
docker-compose.yml		docker-compose.yml
main.py		main.py
requirements.txt		requirements.txt
setup.bat		setup.bat
setup.py		setup.py
setup.sh		setup.sh

Folders and files

Latest commit

History

Repository files navigation

🤖 Autonomous Dev Agent

🚀 Features

📋 Architecture

🛠️ Tech Stack

📁 Project Structure

🚀 Quick Start

1. Clone & Setup

2. Configure Environment

3. Run the System

Option A: Streamlit UI (Recommended for first use)

Option B: FastAPI Server

Option C: Command Line

📖 Usage Examples

Via CLI

Via Python

Via API

🤝 Agent Workflow Details

PM Agent

Architect Agent

Coder Agent

Reviewer Agent

Tester Agent

Critic Agent

🧪 Testing

🔄 State Management

📊 Monitoring & Logging

🔐 Security

🐳 Docker Support

📈 Performance Optimization

🎯 Future Enhancements

🤝 Contributing

📝 License

💡 How It Works

🙋 Support

🎓 Learning Resources

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages