Enterprise Knowledge Compilation & Intelligent Retrieval Platform
Transform raw documents into structured, searchable knowledge through AI-powered distillation, graph mapping, and dual-engine retrieval.
Features · Quick Start · Architecture · Configuration · Structure · Development
|
The Karpathy-inspired compilation pipeline transforms raw knowledge into distilled, structured Wiki articles.
|
The RAG-powered retrieval pipeline provides structured knowledge access through taxonomy and graph navigation.
|
| Layer | Technology | Purpose |
|---|---|---|
| Frontend | React 19, Vite 6, Tailwind CSS, TypeScript | Dark-mode SPA with Linear-inspired design system |
| Backend | FastAPI, Python 3.11+, Pydantic v2 | Async API with structured logging |
| AI/Agent | LangGraph, LangChain, OpenAI / Anthropic | Multi-agent knowledge processing pipeline |
| Vector DB | Milvus 2.4 | High-performance similarity search |
| Graph DB | Neo4j 5 (Community) | Entity-relationship knowledge graph |
| Relational DB | PostgreSQL 16 | Metadata, projects, audit logs |
| Cache | Redis 7 | Session cache, rate limiting |
| Object Storage | MinIO | Document archival (PDF, DOCX, images) |
- Python 3.11+
- Node.js 20+
- Docker & Docker Compose
git clone https://github.com/Cliff-AI-Lab/wikimap.git
cd wikimap
# Copy environment config
cp .env.example .env
# Edit .env with your API keys and database credentials# Start Neo4j, Milvus, Redis (local development)
docker compose up -d
# For full containerized deployment (includes PostgreSQL + API)
docker compose --profile full up -d# Create virtual environment
python -m venv .venv
source .venv/bin/activate # Windows: .venv\Scripts\activate
# Install dependencies
pip install -e ".[dev]"
# Run API server
uvicorn api.main:app --reload --port 8000cd frontend
npm install
npm run devOpen http://localhost:5173 to access the platform.
Set LLM_PROVIDER=mock and EMBEDDING_PROVIDER=mock in .env to run the full platform offline — perfect for development and evaluation.
┌──────────────────────────────────────────────────────┐
│ Frontend (React 19 + Vite) │
│ ProjectList → Dashboard → 8 Workflow Pages │
├──────────────────────────────────────────────────────┤
│ API Layer (FastAPI v14.0) │
│ projects · knowledge · qa · wiki · analysis · ... │
├──────────────────────────────────────────────────────┤
│ Business Logic (Retrieval + Distillation) │
│ IntentRouter → SkillsRouter → Branch Activation │
├──────────────────────────────────────────────────────┤
│ Agent Layer (LLM Agents) │
│ Librarian · Judge · Refiner · SkillsRouter │
├──────────────────────────────────────────────────────┤
│ Storage Layer │
│ PostgreSQL · Milvus · Neo4j · Redis · MinIO │
└──────────────────────────────────────────────────────┘
| Step | Stage | Description |
|---|---|---|
| 1 | Import | Upload documents — TXT, MD, DOCX, PDF, images, videos |
| 2 | Denoise | AI quality scoring with human review for edge cases |
| 3 | Analyze | Statistical insights and knowledge distillation metrics |
| 4 | Compile | Domain-level Wiki synthesis with cross-reference linking |
| 5 | Index | Schema generation for LLM-readable knowledge access |
| 6 | Search & QA | Dual-engine retrieval combining both branches |
Key environment variables (see .env.example for full list):
| Variable | Description | Default |
|---|---|---|
LLM_PROVIDER |
LLM backend (openai / anthropic / mock) |
openai |
LLM_MODEL |
Model name | gpt-4o-mini |
EMBEDDING_PROVIDER |
Embedding backend (openai / mock) |
openai |
AUTH_REQUIRED |
Enable authentication | false |
FEISHU_MOCK_MODE |
Mock Feishu/Lark connector | true |
DINGTALK_MOCK_MODE |
Mock DingTalk connector | true |
WECOM_MOCK_MODE |
Mock WeCom connector | true |
Tip: The platform supports any OpenAI-compatible API endpoint. Set
OPENAI_BASE_URLto use providers like Azure OpenAI, DeepSeek, Zhipu GLM, or local models via Ollama/vLLM.
wikimap/
├── api/ # FastAPI application
│ ├── main.py # App entry, lifespan, middleware
│ ├── deps.py # Dependency injection & store init
│ ├── middleware/ # Authentication middleware
│ ├── routers/ # API endpoints (10 modules)
│ └── schemas/ # Pydantic request/response models
├── packages/ # Core business logic
│ ├── agents/ # LLM agents — Librarian, Judge, Refiner
│ ├── common/ # Config, logging, shared utilities
│ ├── retrieval/ # Search pipeline — IntentRouter, scoring
│ ├── storage/ # DB adapters — PG, Milvus, Neo4j, Redis
│ └── templates/ # Industry templates (5 verticals)
├── frontend/ # React 19 SPA
│ ├── src/components/ # UI components — layout, icons, settings
│ ├── src/pages/ # 14 route pages
│ └── DESIGN.md # Design system spec
├── tests/ # Unit & integration tests
├── docker-compose.yml # Infrastructure (Neo4j, Milvus, Redis, PG)
├── Dockerfile # Multi-stage production build
└── pyproject.toml # Python project configuration
# Run tests
pytest
# Lint & format
ruff check . --fix
ruff format .
# Type check
mypy api/ packages/
# Build frontend for production
cd frontend && npm run build
知 识 图 鉴 · Wiki-Map
Enterprise Knowledge Compilation & Intelligent Retrieval Platform