DIPC is an enterprise-grade document processing system that leverages multi-modal LLMs to extract structured information from various document formats. Built with a microservices architecture, it provides high-performance, scalable document intelligence capabilities.
- 📄 Multi-Format Support - PDFs, images (JPG/PNG/GIF/WEBP), text files, and ZIP archives
- 🚀 High Performance - Asynchronous processing with distributed workers
- 💰 Cost Management - Built-in estimation and usage limiting
- 🔍 Vector Search - Optional semantic search capabilities
- 🌐 Multi-Architecture - Native support for x86_64 and ARM64 (including Apple Silicon)
- 🔒 Enterprise Ready - Production-grade security and monitoring
- Docker 20.10+ (Install Guide)
- Docker Compose 2.0+
# 1. Create project directory
mkdir dipc && cd dipc
# 2. Download configuration
curl -O https://raw.githubusercontent.com/nociex/DIPC/main/docker-compose.yml
curl -O https://raw.githubusercontent.com/nociex/DIPC/main/.env.example
# 3. Configure API key
cp .env.example .env
# Edit .env and add your OpenAI/OpenRouter API key
# 4. Start services
docker compose up -d
# 5. Access the application
# Frontend: http://localhost:38110
# API Docs: http://localhost:38100/docsgraph LR
A[Next.js Frontend] --> B[FastAPI Gateway]
B --> C[Redis Queue]
C --> D[Celery Workers]
D --> E[LLM Providers]
D --> F[Vector DB]
B --> G[(PostgreSQL)]
D --> G
- Processing Speed: 10-50 pages/minute (depending on complexity)
- Concurrent Jobs: Up to 100 simultaneous documents
- API Latency: <100ms for most endpoints
- Storage: S3-compatible object storage support
# Required: Choose one LLM provider
OPENAI_API_KEY=sk-your-key
# OR
OPENROUTER_API_KEY=sk-or-your-key
# Optional: Advanced settings
MAX_WORKERS=4
PROCESSING_TIMEOUT=300
VECTOR_DB_ENABLED=trueFor production deployments, Kubernetes configurations, and advanced settings, see the Deployment Guide.
- User Guide - Getting started and features
- API Reference - Complete API documentation
- Architecture - System design and components
- Troubleshooting - Common issues and solutions
We welcome contributions! Please see our Contributing Guide for details.
This project is licensed under the MIT License - see the LICENSE file for details.
Built with excellent open-source technologies including FastAPI, Next.js, Celery, and more.