UniAI Kernel

Enterprise-level AI Development Framework - Built with FastAPI, LangGraph, and LiteLLM, supporting multi-tenant model management, intelligent memory systems, and streaming chat.

✨ Core Features

🔌 Multi-tenant Model Management

7 Built-in Providers: DeepSeek, Groq, Zhipu AI, Qwen, OpenAI, Anthropic, Google Gemini
User-level Isolation: Independent API Keys and configurations for each user
Quick Environment Setup: One-click start using .env
Dynamic Switching: Seamlessly switch model providers at runtime

🧠 Intelligent Memory System

Three-tier Memory Architecture: Short-term Dialogue → Mid-term Summary → Long-term Profile
Automatic Extraction: Chain-of-Thought memory extraction
AI Arbitration: Intelligent deduplication and merging
pgvector Retrieval: Semantic similarity search

🚀 Developer Friendly

Clear APIs: RESTful design, complete Swagger documentation
Type Safety: Pydantic data validation
Streaming Responses: SSE real-time chat
User Authentication Interface: Pre-reserved clear integration points

📂 Project Structure

uniai-kernel/
├── app/
│   ├── api/endpoints/      # API Endpoints
│   │   ├── chat.py         # Intelligent Chat
│   │   ├── providers.py    # Provider Management
│   │   ├── memories.py     # Memory Management
│   │   └── sessions.py     # Session Management
│   ├── config/
│   │   └── provider_templates.py  # Provider Template Configuration
│   ├── core/
│   │   ├── llm.py          # Multi-tenant LLM Invocation
│   │   ├── auth.py         # User Authentication Interface
│   │   ├── config.py       # Configuration Management
│   │   └── startup.py      # Startup Auto-configuration
│   ├── models/             # Data Models
│   ├── services/           # Business Services
│   └── main.py             # Application Entry
├── scripts/                # Utility Scripts
│   ├── init_providers.py   # Initialize Providers
│   └── reset_user.py       # Reset User Configuration
└── tests/                  # Test Scripts

🚀 Quick Start

1. Install Dependencies

# Using uv (Recommended)
curl -LsSf https://astral.sh/uv/install.sh | sh
uv sync

# Or using pip
pip install -r requirements.txt

2. Configure Environment

Edit the .env file:

# Database
POSTGRES_PASSWORD=your_database_password
ENCRYPTION_KEY=your_encryption_key

# Model Configuration (Select a free provider)
DEFAULT_LLM_PROVIDER=Qwen
DEFAULT_LLM_MODEL=qwen-flash
DEFAULT_LLM_API_KEY=sk-xxx  # Obtain from dashscope.aliyuncs.com

3. Start Service

# Start Database
docker-compose up -d postgres

# Run Database Migrations
uv run alembic upgrade head

# Initialize Provider Templates
uv run python scripts/init_providers.py

# Start Service
uv run uvicorn app.main:app --reload

Visit http://localhost:8000/docs to view the API documentation ✨

🐳 Docker Deployment

Development Environment (Local)

# Only start infrastructure
docker-compose up -d postgres redis

# Run API locally (Recommended, supports hot reload)
uv run uvicorn app.main:app --reload

Production Environment (Full Deployment)

# One-click start all services
docker-compose up -d

# Check service status
docker-compose ps

# View logs
docker-compose logs -f uai-api

Common Commands

# Start/Stop
docker-compose start
docker-compose stop

# Restart service
docker-compose restart uai-api

# Enter container
docker exec -it uai-pg psql -U root -d agent_db
docker exec -it uai-redis redis-cli

# View logs
docker logs -f uai-api      # API logs
docker logs -f uai-pg        # Database logs

# Clean up
docker-compose down          # Stop and remove containers
docker-compose down -v       # Remove associated volumes

Container Names

Service	Container Name	Port
API	`uai-api`	8000
PostgreSQL	`uai-pg`	5432
Redis	`uai-redis`	6379

💻 Developer Guide

Invoking Models in Code

1. LLM Chat

from app.core.llm import completion

# Basic Call (Automatically uses user's default model)
response = await completion(
    messages=[
        {"role": "system", "content": "You are a helpful assistant"},
        {"role": "user", "content": "Hello"}
    ],
    user_id="user_001"
)

# Specify Model (Still uses the user's API Key)
response = await completion(
    messages=[...],
    model="gpt-4",
    user_id="user_001"
)

# Streaming Response
async for chunk in await completion(
    messages=[...],
    user_id="user_001",
    stream=True
):
    print(chunk.choices[0].delta.content)

2. Embedding Vectors

from app.core.llm import embedding

# Single Text
result = await embedding(
    input="Hello World",
    user_id="user_001"
)
vector = result['data'][0]['embedding']

# Batch Text
result = await embedding(
    input=["Text 1", "Text 2", "Text 3"],
    user_id="user_001"
)

3. Memory Retrieval

from app.services.memory_service import memory_service
from app.core.db import get_db

async with get_db() as session:
    # Search for related memories
    memories = await memory_service.search_memories(
        user_id="user_001",
        query="User's profession",
        top_k=5
    )
    
    # Add memory
    await memory_service.add_memory(
        session,
        user_id="user_001",
        content="User is a Python developer",
        category="professional_background"
    )

4. Context Management

from app.services.context_service import context_service

# Build complete context (Memory + Session Summary + History)
messages = await context_service.build_context_messages(
    session_id="session_001",
    user_id="user_001",
    current_query="What's the weather like today?",
    db_session=session,
    enable_memory=True,
    enable_session_summary=True
)

🔧 Managing Providers

List Available Providers

curl http://localhost:8000/api/v1/providers/templates

Example Response:

[
  {
    "name": "Qwen",
    "provider_type": "openai",
    "is_free": true,
    "supported_models": ["qwen-turbo", "qwen-plus", "qwen-max", "qwen-flash"]
  }
]

Configure User Providers

# Method 1: Using API
curl -X POST http://localhost:8000/api/v1/providers/my/providers \
  -H "Content-Type: application/json" \
  -d '{
    "template_name": "OpenAI",
    "api_key": "sk-proj-xxx",
    "custom_config": {}
  }'

# Method 2: Using Environment Variables (Recommended)
# Edit .env
DEFAULT_LLM_PROVIDER=OpenAI
DEFAULT_LLM_MODEL=gpt-4
DEFAULT_LLM_API_KEY=sk-proj-xxx

Set Default Model

curl -X PUT http://localhost:8000/api/v1/providers/my/default-models \
  -H "Content-Type: application/json" \
  -d '{
    "model_type": "llm",
    "model_name": "gpt-4-turbo",
    "provider_id": 1
  }'

🧠 Intelligent Dialogue Example

Create Session

curl -X POST http://localhost:8000/api/v1/chat-sessions/ \
  -H "Content-Type: application/json" \
  -d '{"title": "Technical Consultation", "user_id": "user_001"}'

Start Dialogue

curl -X POST http://localhost:8000/api/v1/chat \
  -H "Content-Type: application/json" \
  -d '{
    "session_id": "a1b2c3",
    "user_id": "user_001",
    "message": "I am a Python developer, recommend a learning path",
    "enable_memory": true,
    "enable_session_context": true
  }'

Streaming Response (SSE Format):

data: {"type": "status", "content": "Retrieving memories..."}
data: {"type": "thought", "content": "Loaded user preferences and history"}
data: {"type": "status", "content": "Generating response..."}
data: {"type": "token", "content": "As"}
data: {"type": "token", "content": "a"}
data: {"type": "token", "content": "Python"}
data: {"type": "token", "content": "developer..."}
data: [DONE]

🔐 User Authentication Integration

The framework reserves a clear interface for user authentication, located at app/core/auth.py.

Default Implementation (Single User Mode)

async def get_current_user_id(
    x_user_id: Optional[str] = Header(None)
) -> str:
    return x_user_id or "default_user"

Integrate JWT

from fastapi.security import HTTPBearer
from jose import jwt

security = HTTPBearer()

async def get_current_user_id(
    credentials: HTTPAuthorizationCredentials = Depends(security)
) -> str:
    token = credentials.credentials
    payload = jwt.decode(token, SECRET_KEY, algorithms=["HS256"])
    return payload["user_id"]

Usage in API

from app.core.auth import get_current_user_id

@router.post("/chat")
async def chat(
    request: ChatRequest,
    user_id: str = Depends(get_current_user_id),  # Automatically injected
    db: AsyncSession = Depends(get_db)
):
    # user_id is automatically obtained from the auth system
    ...

📦 Extend Providers

Add New Template

Edit app/config/provider_templates.py:

PROVIDER_TEMPLATES.append({
    "name": "Mistral",
    "provider_type": "mistral",
    "api_base": "https://api.mistral.ai/v1",
    "is_free": False,
    "requires_api_key": True,
    "supported_models": ["mistral-large", "mistral-medium"],
    "description": "Mistral AI Models",
    "config_schema": {
        "api_key": {"required": True, "description": "Mistral API Key"}
    }
})

Then run the initialization script:

uv run python scripts/init_providers.py

🛠️ Utility Scripts

# Check database status
uv run python scripts/check_db.py

# Reset user config (Troubleshooting)
uv run python scripts/reset_user.py

# Test chat and memory features
uv run python tests/test_chat_memory.py

📊 Tech Stack

Component	Technology	Description
Web Framework	FastAPI	High-performance async framework
LLM Integration	LiteLLM	Unified interface for 100+ models
Database	PostgreSQL + pgvector	Vector storage
ORM	SQLAlchemy 2.0	Async database operations
Migrations	Alembic	Database version management
Orchestration	LangGraph	State machine workflows
Package Manager	uv	Extremely fast python package installer

🌟 Supported Providers

Free Models

Provider	Model	Website
DeepSeek	deepseek-chat	platform.deepseek.com
Groq	llama-3.1-70b	console.groq.com
Zhipu AI	glm-4-flash	open.bigmodel.cn
Qwen	qwen-flash	dashscope.aliyuncs.com

Paid Models

Provider	Model	Website
OpenAI	gpt-4-turbo	platform.openai.com
Anthropic	claude-3	console.anthropic.com
Google	gemini-pro	ai.google.dev

📚 API Endpoints

Full documentation: http://localhost:8000/docs

Endpoint	Method	Description
`/api/v1/chat`	POST	Intelligent Chat (SSE Streaming)
`/api/v1/chat-sessions/`	POST	Create Session
`/api/v1/memories/search`	GET	Search Memories
`/api/v1/providers/templates`	GET	List Provider Templates
`/api/v1/providers/my/providers`	POST	Configure My Provider
`/api/v1/providers/my/default-models`	PUT	Set Default Model
`/api/v1/users/init`	POST	Initialize New User

🔒 Production Deployment

Security Recommendations

Encryption Key: Use a strong random ENCRYPTION_KEY

python -c "from cryptography.fernet import Fernet; print(Fernet.generate_key().decode())"

Database Password: Use a complex password and restrict access
User Authentication: Integrate JWT or OAuth2
HTTPS: Absolute requirement for production environments

Performance Optimization

# Using Gunicorn + Uvicorn Workers
gunicorn app.main:app \
  --workers 4 \
  --worker-class uvicorn.workers.UvicornWorker \
  --bind 0.0.0.0:8000

🤝 Contributing

PRs and Issues are welcome!

📄 License

Apache License 2.0

Happy Coding! 🚀

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
.github		.github
alembic		alembic
app		app
docs		docs
resources		resources
scripts		scripts
tests		tests
.dockerignore		.dockerignore
.env.example		.env.example
.gitignore		.gitignore
CODE_OF_CONDUCT.md		CODE_OF_CONDUCT.md
CONTRIBUTING.md		CONTRIBUTING.md
Dockerfile		Dockerfile
LICENSE		LICENSE
README.md		README.md
README_zh.md		README_zh.md
alembic.ini		alembic.ini
docker-compose.yml		docker-compose.yml
pyproject.toml		pyproject.toml
requirements.txt		requirements.txt
uv.lock		uv.lock

Folders and files

Latest commit

History

Repository files navigation

UniAI Kernel

✨ Core Features

🔌 Multi-tenant Model Management

🧠 Intelligent Memory System

🚀 Developer Friendly

📂 Project Structure

🚀 Quick Start

1. Install Dependencies

2. Configure Environment

3. Start Service

🐳 Docker Deployment

Development Environment (Local)

Production Environment (Full Deployment)

Common Commands

Container Names

💻 Developer Guide

Invoking Models in Code

1. LLM Chat

2. Embedding Vectors

3. Memory Retrieval

4. Context Management

🔧 Managing Providers

List Available Providers

Configure User Providers

Set Default Model

🧠 Intelligent Dialogue Example

Create Session

Start Dialogue

🔐 User Authentication Integration

Default Implementation (Single User Mode)

Integrate JWT

Usage in API

📦 Extend Providers

Add New Template

🛠️ Utility Scripts

📊 Tech Stack

🌟 Supported Providers

Free Models

Paid Models

📚 API Endpoints

🔒 Production Deployment

Security Recommendations

Performance Optimization

🤝 Contributing

📄 License

About

Resources

License

Code of conduct

Contributing

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages