Understand the structure of your data — without training machine learning models.
hypertopos transforms relational data into a geometric space where every entity gets a coordinate derived from its relationships. Distance from the population center reveals anomalies. Proximity between entities reveals similarity. Movement over time reveals drift. No training, no labels — geometry is computed from the data.
pip install hypertopos- Detect anomalies without training ML models or labeling data
- Discover clusters and structural archetypes
- Track behavioral drift over time
- Compare populations to find what differentiates them
- Explore datasets with AI agents that navigate rather than query
You describe your data in YAML — entity types, sources, relationships. hypertopos computes population statistics and produces a sphere: pre-computed geometry stored in Apache Arrow format.
Agents (or Python code) open the sphere and navigate it using twelve primitives that cover movement, clustering, anomaly detection, population comparison, and temporal analysis. Each step is stateful — where you are determines what you see next.
For the full picture: Introduction · Core Concepts · Quick Start
Validated on three domains with the same engine, zero domain rules, zero labels:
| Domain | Dataset | Key result |
|---|---|---|
| Banking | Berka (Czech, real data) | 85.5% recall on loan defaults |
| AML | IBM AML (synthetic) | 80.4% recall, zero labels |
| Transport | NYC Yellow Taxi (7.6M trips) | 8/8 anomaly categories detected |
Benchmark scripts and data preparation are included. Results are reproducible. Numbers are from the pre-0.1.0 validation run and have not been re-evaluated against recent releases.
Full results: Benchmarks
| Introduction | The idea and where it stands |
| Quick Start | Install, build, navigate |
| Core Concepts | Mathematical foundation |
| Configuration | Sphere builder YAML reference |
| API Reference | Python API |
| Data Format | On-disk storage format |
| Architecture | Package layers and design |
Research-stage project. Working code, reproducible benchmarks, active development. API may change.
Business Source License 1.1. Free for internal use, development, testing, and research. See LICENSE.md for details.