Skip to content

trieu/leo-cdp-free-edition

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

LEO CDP framework

LEO CDP – The Open Source AI-first Customer Data Platform

LEO CDP is an Open Source AI-first Customer Data Platform (CDP) framework that empowers organizations to build and operate their own fully customizable CDP infrastructure — with machine learning and big data at its core.

Designed for developers, data scientists, marketers, and enterprises, LEO CDP enables unified data collection, real-time customer analytics, audience segmentation, and personalized marketing — all while remaining self-hosted and privacy-friendly.


🚀 Vision & Philosophy

  • The philosophy of Dataism → USPA → LEO CDP
  • Democratize AI-powered data platforms for digital transformation
  • Promote data sovereignty, on-premise intelligence, and open collaboration

🔥 Key Features

  1. Omnichannel Data Collection & Unification
    Collect data from web, mobile, CRM, IoT, POS, social media, and APIs. Unify into rich customer profiles.

  2. Real-Time Customer 360
    Build a complete view of every customer using behavioral, transactional, and third-party data.

  3. AI-based Segmentation & Scoring
    Use clustering, RFM, CLV prediction, churn scoring, and dynamic audiences using ML models.

  4. Behavioral Tracking & Journey Mapping
    Track individual actions and interactions in real time. Map customer journeys across channels.

  5. Predictive Analytics & Insights
    Leverage machine learning pipelines with Jupyter/Colab for real-time insights.

  6. Personalization & Activation
    Using Agentic AI to deliver personalized experiences via email, push, SMS, and content based on customer intent.

  7. Event-Driven Architecture with ETL/ELT
    Built-in Apache Airflow integration to manage data ingestion, transformation, and orchestration.

  8. Plug-in Ecosystem & API-First Design
    Easy to extend, integrate, and automate via REST APIs and modular services.

  9. Data Governance & Privacy
    Built-in consent tracking, GDPR compliance, and on-prem hosting for full control over customer data.

  10. DevOps Ready
    Docker-based deployment, Prometheus + Grafana monitoring, scalable microservice architecture.


🌍 Why Open Source?

  • Break away from SaaS lock-in. Full customization and ownership of your CDP.
  • Ideal for agencies, startups, enterprises, and researchers building AI-powered marketing stacks.
  • Open source encourages transparency, innovation, and community-driven evolution.

📈 Roadmap 2025+

Feature Status
✅ Core CDP Platform (Profiles, Events, Segmentation) Complete
✅ CDP SDKs (JavaScript, Python) Complete
🔄 Identity Resolution with Graph + Vector Matching In Progress
🔄 AI Assistant (Chatbot for Audience Insights & Suggestions) In Progress
🔄 Agentic AI: Personalizing the Customer Experience In Progress
🔄 Embedding Model for Customer Vector Search (via Qdrant) In Progress
🆕 CDP Mobile SDKs (Android, iOS, React Native) Planned
🆕 Open Source Campaign Management UI Planned
🆕 Integration Marketplace for Martech Tools Planned
🆕 Webhook + Event Bus Support (Kafka / RabbitMQ / SQS) Planned
🆕 Federated Identity Graph using OpenID & OAuth Planned

Want to contribute? Join the community!


🧪 System Demo


📚 Documents


🛠️ Tech Stack

  • Backend: Java 11 (Amazon Corretto), Python 3.10
  • Database: ArangoDB 3.11 (Multi-model: Document + Graph + Search)
  • Monitoring: Prometheus 2 + Grafana 8
  • Data Pipeline: Apache Airflow
  • Analytics & ML: Jupyter Notebook / Google Colab
  • Messaging: Redis 6, OneSignal, Firebase
  • Deployment: Ubuntu 22 LTS, Docker, On-Prem / Cloud

☁️ Cloud Options

  • Google Cloud, AWS, VNG Cloud, or your own private infrastructure

🔧 Installation

See: Installation Guide


🧑‍💻 Author & License

Created by: Trieu Nguyen (Thomas)
License: Open Source - MIT-style.
Use freely. Customize. Brand your own white-label CDP. Just respect the original creator 🙏.


💬 Community & Support


📜 Historical Proof of Innovation