AgentGraph: AI Architecture Consultant

A multi-agent system for real-time system design, security evaluation, and architecture optimization using Meta Llama 3.3 70B and LangGraph. Generates Mermaid diagrams, identifies vulnerabilities, and iteratively improves designs for cost-effectiveness and security.

Note

Learning Project: This repository is an active learning playground for mastering advanced Agentic AI patterns. It serves as a hands-on exploration of:

LangGraph — Multi-agent orchestration and state management
LangSmith — LLM observability, tracing, and prompt management

The project is continuously evolving as I deepen my understanding of production-grade agent architectures. Contributions and feedback are welcome!

Features

Multi-Agent System

4 Specialized Agents working in a coordinated loop:
- Architect — Generates system architecture with cost estimates
- Security — Identifies vulnerabilities and mitigation strategies
- Evaluator — Scores designs on security (40%), feasibility (30%), cost-effectiveness (30%)
- Mermaid Validator — Ensures diagram syntax correctness

Professional UI/UX

Minimal Aesthetic Streamlit UI — Clean, modern interface with real-time agent outputs
Vertical Mermaid Diagrams — graph TD layout for better readability (prevents horizontal compression)
Executive Session Summary — AI-generated chronological narrative of the entire design session

Engineering Best Practices

Prompt Registry Pattern — Clean separation of prompt logic from agent implementation (src/prompts/)
Dynamic LangSmith Tracing — Runtime-configurable with dynamic UUIDs for each session
Loop Prevention — Convergence detection via similarity tracking, feedback deduplication, and score stagnation detection
Robust Error Handling — Specific exception handling for JSON parsing and validation errors
Full English Localization — Professional English throughout (no mixed languages)

Cost-Conscious Design

Budget-aware architecture recommendations in Thai Baht (฿)
Pragmatic MVP-first approach (80% users, 20% features)
Small team feasibility assessment (3-5 engineers)

Installation

Prerequisites

Python 3.8 or higher
pip package manager
Novita API key (for Meta Llama 3.3 70B access)

Setup

Clone the repository:

git clone https://github.com/sitta07/AgentGraph.git
cd AgentGraph

Create a Python virtual environment:

python3 -m venv venv
source venv/bin/activate  # On Windows: venv\Scripts\activate

Install dependencies:

pip install -r requirements.txt

Configure environment variables:

# Create .env file in project root
echo "NOVITA_API_KEY=your_api_key_here" > .env

Quick Start

Web Interface (Streamlit)

streamlit run app.py

Then open your browser to http://localhost:8501.

Define your system requirements in the sidebar and set a monthly operating budget. The system will automatically:

Generate an initial architecture design
Run security validation
Evaluate against multiple criteria
Refine iteratively until passing (score ≥ 9.5/10)
Display Mermaid diagrams for each iteration
Generate an Executive Session Summary

CLI Execution

python main.py

The CLI outputs detailed logs from each agent and saves diagrams to the /diagrams directory.

Configuration

Environment Variables

Variable	Required	Description
`NOVITA_API_KEY`	✅	Your API key for Llama 3.3 70B access via Novita
`LLM_BASE_URL`	❌	API base URL (default: `https://api.novita.ai/v3/openai`)
`MODEL_NAME`	❌	Model identifier (default: `meta-llama/llama-3.3-70b-instruct`)
`MODEL_TEMPERATURE`	❌	LLM temperature (default: `0.2`)
`DEFAULT_BUDGET_THB`	❌	Default monthly budget in THB (default: `5000.0`)
`LANGSMITH_PROMPT_NAME`	❌	LangSmith prompt name (default: `agentgraph-architect`)

Agent Tuning

Key parameters in src/graph/nodes.py:

Parameter	Value	Description
Model	`meta-llama/llama-3.3-70b-instruct`	Via Novita API
Temperature	`0.2`	Low randomness for consistency
Passing Threshold	`≥ 9.5/10.0`	Near-perfect score required
Max Revisions	`3`	Iterations before stopping
Output Similarity	`≥ 90%`	Loop prevention threshold
Feedback Repetition	`≥ 85%`	Loop prevention threshold

Project Structure

AgentGraph/
├── app.py                          # Streamlit web UI with final summary generation
├── main.py                         # Graph builder & CLI execution
├── requirements.txt                # Python dependencies
├── .env                           # Environment variables (not in repo)
├── src/
│   ├── graph/
│   │   ├── state.py               # GraphState TypedDict & EvaluationRubric
│   │   ├── nodes.py               # 4 agent nodes + routing logic
│   │   └── __init__.py
│   ├── prompts/                   # 🆕 Prompt Registry
│   │   ├── __init__.py
│   │   ├── architect_prompts.py   # Architect agent prompts
│   │   ├── security_prompts.py   # Security agent prompts
│   │   └── evaluator_prompts.py  # Evaluator agent prompts + session summary
│   └── utils/
│       ├── convergence_utils.py    # Loop prevention functions
│       ├── cost_utils.py           # Cost calculation utilities
│       ├── diagram_generator.py    # Mermaid extraction & rendering
│       └── __init__.py
├── diagrams/                       # Generated architecture diagrams (PNG)
├── evals/                          # Evaluation rubrics & reports
└── README.md

How It Works

┌─────────────┐     ┌─────────────────┐     ┌─────────────┐
│  Architect  │────▶│ Mermaid Validator│────▶│   Security  │
└─────────────┘     └─────────────────┘     └─────────────┘
      ▲                                             │
      │                                             ▼
      │                                        ┌─────────────┐
      │                                        │  Evaluator  │
      │                                        └─────────────┘
      │                                               │
      │◀────────────────── feedback (if failed) ◀────┘

Architect → Generates initial graph TD Mermaid diagram with cost breakdown
Mermaid Validator → Checks syntax; returns to Architect if invalid
Security → Identifies vulnerabilities and pragmatic mitigations
Evaluator → Scores 0-10.0 based on security, feasibility, cost; requires ≥ 9.5 to pass
Loop Prevention → Stops iteration if converged, repeated feedback, or stagnant scores

Dependencies

Package	Purpose
`langgraph`	Multi-agent orchestration framework
`langchain-core`	Message types and output parsing
`langchain-openai`	OpenAI-compatible LLM interface
`langsmith`	LLM observability and tracing
`streamlit`	Web UI framework
`python-dotenv`	Environment variable management
`requests`	HTTP requests for Mermaid API
`pydantic`	Data validation

See requirements.txt for pinned versions.

Contributing

We welcome contributions! Please:

Fork the repository
Create a feature branch (git checkout -b feature/your-feature)
Make focused changes with clear commit messages
Ensure Python compiles without errors (python3 -m py_compile src/**/*.py)
Submit a pull request with a description of your changes

Areas for Contribution

Additional agent types (e.g., Cost Optimizer, Performance Analyst)
Enhanced Mermaid syntax support
Alternative LLM providers (OpenAI, Anthropic, local models)
Additional evaluation criteria
Improved loop prevention heuristics
Multi-language localization

License

MIT License (2026) — See LICENSE file for details

Built with: LangGraph + Llama 3.3 70B + LangSmith + Streamlit

Questions? Review agent prompts in src/prompts/ or check the conversation history in the UI.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

AgentGraph: AI Architecture Consultant

Features

Multi-Agent System

Professional UI/UX

Engineering Best Practices

Cost-Conscious Design

Installation

Prerequisites

Setup

Quick Start

Web Interface (Streamlit)

CLI Execution

Configuration

Environment Variables

Agent Tuning

Project Structure

How It Works

Dependencies

Contributing

Areas for Contribution

License

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 13 Commits
evals/evaluators		evals/evaluators
src		src
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
app.py		app.py
main.py		main.py
requirements.txt		requirements.txt

Folders and files

Latest commit

History

Repository files navigation

AgentGraph: AI Architecture Consultant

Features

Multi-Agent System

Professional UI/UX

Engineering Best Practices

Cost-Conscious Design

Installation

Prerequisites

Setup

Quick Start

Web Interface (Streamlit)

CLI Execution

Configuration

Environment Variables

Agent Tuning

Project Structure

How It Works

Dependencies

Contributing

Areas for Contribution

License

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages