meGPT 🚀

Your Personal LLM Laboratory on a Laptop

One-command setup to run your own ChatGPT-style interface with complete control. 100% open source, runs locally, experiment fast with multiple models.

What is meGPT?

meGPT is a local LLM lab that lets you:

🎨 Experiment Fast - ChatGPT-style UI for testing multiple models
🔒 Full Control - Everything runs on your machine, no data leaves
🔄 Hot-swap Models - Switch models at runtime, no restarts needed
🧪 Compare & Test - Built-in tools for model comparison, evals, RAG
🛡️ Red Team - Test safety and security of your deployments
📦 One Command - ./bootstrap.sh and you're running

Architecture

┌─────────────┐
│  Open WebUI │ (ChatGPT-style Interface)
│   :3000     │
└──────┬──────┘
       │
       ▼
┌─────────────┐
│   Gateway   │ (FastAPI - Multi-model routing)
│   :8001     │
└──────┬──────┘
       │
       ▼
┌─────────────┐
│   Ollama    │ (LLM Runtime)
│   :11434    │
└──────┬──────┘
       │
       ▼
   [Models]      (Llama2, Mistral, CodeLlama, etc.)

Key Design Principles:

UI → Gateway → Ollama → Models - Clean separation of concerns
Models as Data - Swappable at runtime, not baked into containers
Python Everywhere - Easy to hack, extend, and understand
Weird Ports - 3000, 8001, 11434 (easy to remember, avoid conflicts)

Quick Start

Prerequisites

Docker & Docker Compose
8GB+ RAM recommended
10GB+ free disk space for models

One-Command Setup

git clone https://github.com/William0Friend/megpt.git
cd megpt
./bootstrap.sh

That's it! The script will:

✅ Check dependencies
🐳 Start all services
📦 Pull default models
🎉 Open your browser to http://localhost:3000

Manual Setup

# Start services
docker compose up -d

# Pull a model
docker exec megpt-ollama ollama pull llama2

# Access the UI
open http://localhost:3000

Service URLs

Service	URL	Purpose
Open WebUI	http://localhost:3000	ChatGPT-style interface
Gateway API	http://localhost:8001	FastAPI routing layer
Ollama API	http://localhost:11434	LLM runtime

Features

🎯 Core Features

ChatGPT-style UI: Clean, familiar interface via Open WebUI
Multi-model Support: Run Llama2, Mistral, CodeLlama, and 50+ models
Model Swapping: Change models without restarting anything
OpenAI-Compatible API: Use existing OpenAI SDK code
FastAPI Gateway: Extensible routing and middleware layer

🧪 Advanced Features

Promptfoo Evals (./evals/): Automated prompt testing and scoring
RAG Support (./rag/): Retrieval-augmented generation examples
Model Comparison (./model-comparison/): Side-by-side testing tools
Red Team Testing (./red-team/): Safety and security testing
Prompt Library (./prompts/): Reusable prompt templates

Usage Examples

Basic Chat

curl -X POST http://localhost:8001/v1/chat/completions \
  -H "Content-Type: application/json" \
  -d '{
    "model": "llama2",
    "messages": [
      {"role": "user", "content": "Explain quantum computing in simple terms"}
    ]
  }'

Model Comparison

cd model-comparison
python compare_models.py \
  --models llama2,mistral,codellama \
  --question "Write a Python function to reverse a string"

Run Evaluations

cd evals
npm install -g promptfoo
promptfoo eval
promptfoo view

RAG Example

cd rag
python simple_rag.py

Red Team Testing

cd red-team
python red_team_tests.py --category safety --model llama2

Managing Models

List Available Models

docker exec megpt-ollama ollama list

Pull New Model

docker exec megpt-ollama ollama pull mistral
docker exec megpt-ollama ollama pull codellama
docker exec megpt-ollama ollama pull neural-chat

Remove Model

docker exec megpt-ollama ollama rm llama2

Model Library

Browse all available models: https://ollama.ai/library

Popular choices:

llama2 - General purpose, balanced
mistral - Strong reasoning, faster
codellama - Code generation and analysis
neural-chat - Conversational AI
vicuna - Creative and detailed
orca-mini - Lightweight, fast

Configuration

Environment Variables

Copy .env.example to .env and customize:

# Ollama
OLLAMA_HOST=0.0.0.0
OLLAMA_BASE_URL=http://localhost:11434

# Gateway
GATEWAY_PORT=8001
LOG_LEVEL=info

# WebUI
WEBUI_SECRET_KEY=change-me-in-production
ENABLE_SIGNUP=true

# Default models to pull on bootstrap
DEFAULT_MODELS=llama2,mistral,codellama

Customizing the Gateway

Edit gateway/main.py to:

Add custom routing logic
Implement model selection strategies
Add logging and monitoring
Integrate with external services

Development

Project Structure

megpt/
├── docker-compose.yml      # Service orchestration
├── bootstrap.sh           # One-command setup
├── .env.example          # Environment template
├── gateway/              # FastAPI gateway service
│   ├── main.py          # Gateway implementation
│   ├── Dockerfile       # Gateway container
│   └── requirements.txt # Python dependencies
├── prompts/             # Prompt templates
│   ├── system/         # System prompts
│   ├── user/           # User prompt templates
│   └── examples/       # Complete examples
├── evals/              # Promptfoo evaluations
│   ├── promptfooconfig.yaml
│   └── README.md
├── rag/                # RAG examples
│   ├── simple_rag.py
│   └── README.md
├── model-comparison/   # Model testing tools
│   ├── compare_models.py
│   └── README.md
└── red-team/          # Security testing
    ├── red_team_tests.py
    └── README.md

Extending meGPT

Add a new route to the gateway:

# gateway/main.py
@app.post("/custom/endpoint")
async def custom_handler(request: Request):
    # Your logic here
    pass

Add custom middleware:

from fastapi import Request

@app.middleware("http")
async def log_requests(request: Request, call_next):
    # Log or modify requests
    response = await call_next(request)
    return response

Useful Commands

# View logs
docker compose logs -f

# Restart services
docker compose restart

# Stop everything
docker compose down

# Stop and remove volumes (clears data)
docker compose down -v

# Rebuild gateway after code changes
docker compose up -d --build gateway

# Check service health
curl http://localhost:8001/health

# List running containers
docker compose ps

Troubleshooting

Services won't start

# Check if ports are in use
lsof -i :3000
lsof -i :8001
lsof -i :11434

# View service logs
docker compose logs ollama
docker compose logs gateway
docker compose logs webui

Model downloads fail

# Check Ollama connectivity
curl http://localhost:11434/api/tags

# Try pulling manually
docker exec -it megpt-ollama bash
ollama pull llama2

Out of memory

# Use smaller models
docker exec megpt-ollama ollama pull orca-mini
docker exec megpt-ollama ollama pull tinyllama

# Or increase Docker memory limit in Docker Desktop settings

Gateway connection issues

# Verify gateway is running
curl http://localhost:8001/health

# Check gateway logs
docker compose logs gateway

# Restart gateway
docker compose restart gateway

Why meGPT?

vs. ChatGPT:

✅ Runs locally, no API costs
✅ Complete privacy, no data sent out
✅ Customizable, hackable, extendable
❌ Requires your own hardware
❌ Models smaller than GPT-4

vs. Raw Ollama:

✅ Better UI (Open WebUI vs CLI)
✅ Gateway layer for routing/logging
✅ Built-in eval & testing tools
✅ Example prompts and workflows
✅ Reproducible setup

vs. Other Local Solutions:

✅ Simpler architecture
✅ Faster iteration
✅ Better documentation
✅ Python-focused (easy to hack)
✅ Production-ready patterns

Roadmap

Contributing

Contributions welcome! Whether it's:

🐛 Bug fixes
✨ New features
📚 Documentation improvements
🧪 More eval examples
🎨 UI enhancements

See issues for ideas, or open a PR!

License

MIT License - do whatever you want with it!

Resources

Ollama - LLM runtime
Open WebUI - ChatGPT-style interface
FastAPI - Python API framework
Promptfoo - LLM testing and evals
OWASP LLM Top 10 - LLM security

Credits

Built with ❤️ using:

Ollama for LLM runtime
Open WebUI for the interface
FastAPI for the gateway
Docker for packaging

Happy experimenting! 🚀

Questions? Issues? Ideas? Open an issue!

Name		Name	Last commit message	Last commit date
Latest commit History 8 Commits
.github/workflows		.github/workflows
evals		evals
gateway		gateway
model-comparison		model-comparison
prompts		prompts
rag		rag
red-team		red-team
.env.example		.env.example
.gitignore		.gitignore
ARCHITECTURE.md		ARCHITECTURE.md
CONTRIBUTING.md		CONTRIBUTING.md
DEPLOYMENT.md		DEPLOYMENT.md
IMPLEMENTATION_SUMMARY.md		IMPLEMENTATION_SUMMARY.md
LICENSE		LICENSE
Makefile		Makefile
QUICKSTART.md		QUICKSTART.md
README.md		README.md
SECURITY.md		SECURITY.md
TESTING.md		TESTING.md
bootstrap.sh		bootstrap.sh
docker-compose.yml		docker-compose.yml

Folders and files

Latest commit

History

Repository files navigation

meGPT 🚀

What is meGPT?

Architecture

Quick Start

Prerequisites

One-Command Setup

Manual Setup

Service URLs

Features

🎯 Core Features

🧪 Advanced Features

Usage Examples

Basic Chat

Model Comparison

Run Evaluations

RAG Example

Red Team Testing

Managing Models

List Available Models

Pull New Model

Remove Model

Model Library

Configuration

Environment Variables

Customizing the Gateway

Development

Project Structure

Extending meGPT

Useful Commands

Troubleshooting

Services won't start

Model downloads fail

Out of memory

Gateway connection issues

Why meGPT?

Roadmap

Contributing

License

Resources

Credits

About

Resources

License

Contributing

Security policy

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages