LEO CDP is an Open Source AI-first Customer Data Platform (CDP) framework that empowers organizations to build and operate their own fully customizable CDP infrastructure — with machine learning and big data at its core.
Designed for developers, data scientists, marketers, and enterprises, LEO CDP enables unified data collection, real-time customer analytics, audience segmentation, and personalized marketing — all while remaining self-hosted and privacy-friendly.
- The philosophy of Dataism → USPA → LEO CDP
- Democratize AI-powered data platforms for digital transformation
- Promote data sovereignty, on-premise intelligence, and open collaboration
-
Omnichannel Data Collection & Unification
Collect data from web, mobile, CRM, IoT, POS, social media, and APIs. Unify into rich customer profiles. -
Real-Time Customer 360
Build a complete view of every customer using behavioral, transactional, and third-party data. -
AI-based Segmentation & Scoring
Use clustering, RFM, CLV prediction, churn scoring, and dynamic audiences using ML models. -
Behavioral Tracking & Journey Mapping
Track individual actions and interactions in real time. Map customer journeys across channels. -
Predictive Analytics & Insights
Leverage machine learning pipelines with Jupyter/Colab for real-time insights. -
Personalization & Activation
Using Agentic AI to deliver personalized experiences via email, push, SMS, and content based on customer intent. -
Event-Driven Architecture with ETL/ELT
Built-in Apache Airflow integration to manage data ingestion, transformation, and orchestration. -
Plug-in Ecosystem & API-First Design
Easy to extend, integrate, and automate via REST APIs and modular services. -
Data Governance & Privacy
Built-in consent tracking, GDPR compliance, and on-prem hosting for full control over customer data. -
DevOps Ready
Docker-based deployment, Prometheus + Grafana monitoring, scalable microservice architecture.
- Break away from SaaS lock-in. Full customization and ownership of your CDP.
- Ideal for agencies, startups, enterprises, and researchers building AI-powered marketing stacks.
- Open source encourages transparency, innovation, and community-driven evolution.
Feature | Status |
---|---|
✅ Core CDP Platform (Profiles, Events, Segmentation) | Complete |
✅ CDP SDKs (JavaScript, Python) | Complete |
🔄 Identity Resolution with Graph + Vector Matching | In Progress |
🔄 AI Assistant (Chatbot for Audience Insights & Suggestions) | In Progress |
🔄 Agentic AI: Personalizing the Customer Experience | In Progress |
🔄 Embedding Model for Customer Vector Search (via Qdrant) | In Progress |
🆕 CDP Mobile SDKs (Android, iOS, React Native) | Planned |
🆕 Open Source Campaign Management UI | Planned |
🆕 Integration Marketplace for Martech Tools | Planned |
🆕 Webhook + Event Bus Support (Kafka / RabbitMQ / SQS) | Planned |
🆕 Federated Identity Graph using OpenID & OAuth | Planned |
Want to contribute? Join the community!
- URL: https://dcdp.bigdatavietnam.org
- Username:
demo
- Password:
123456
- 🇻🇳 Document bằng tiếng Việt
- 🧠 CDP Handbook 2023
- 📊 Data Model & Journey Map
- ⚙️ Analytics Core Functions
- 💡 Data Strategy with LEO CDP
- Backend: Java 11 (Amazon Corretto), Python 3.10
- Database: ArangoDB 3.11 (Multi-model: Document + Graph + Search)
- Monitoring: Prometheus 2 + Grafana 8
- Data Pipeline: Apache Airflow
- Analytics & ML: Jupyter Notebook / Google Colab
- Messaging: Redis 6, OneSignal, Firebase
- Deployment: Ubuntu 22 LTS, Docker, On-Prem / Cloud
- Google Cloud, AWS, VNG Cloud, or your own private infrastructure
See: Installation Guide
Created by: Trieu Nguyen (Thomas)
License: Open Source - MIT-style.
Use freely. Customize. Brand your own white-label CDP. Just respect the original creator 🙏.
- Bugs or ideas? Email: trieu@leocdp.com
- Join our learning group: BigDataVietnam.org
- YouTube: @bigdatavn
- Blog: knowledge.leocdp.net