Skip to content

hypertopos/hypertopos-py

Repository files navigation

hypertopos

Understand the structure of your data — without training machine learning models.

Python 3.12+ License: BSL 1.1 DOI PyArrow Lance MCP Version

hypertopos transforms relational data into a geometric space where every entity gets a coordinate derived from its relationships. Distance from the population center reveals anomalies. Proximity between entities reveals similarity. Movement over time reveals drift. No training, no labels — geometry is computed from the data.

pip install hypertopos

hypertopos overview

What you can do with it

  • Detect anomalies without training ML models or labeling data
  • Discover clusters and structural archetypes
  • Track behavioral drift over time
  • Compare populations to find what differentiates them
  • Explore datasets with AI agents that navigate rather than query

How it works

You describe your data in YAML — entity types, sources, relationships. hypertopos computes population statistics and produces a sphere: pre-computed geometry stored in Apache Arrow format.

Agents (or Python code) open the sphere and navigate it using twelve primitives that cover movement, clustering, anomaly detection, population comparison, and temporal analysis. Each step is stateful — where you are determines what you see next.

For the full picture: Introduction · Core Concepts · Quick Start

Benchmarks

Validated on three domains with the same engine, zero domain rules, zero labels:

Domain Dataset Key result
Banking Berka (Czech, real data) 85.5% recall on loan defaults
AML IBM AML (synthetic) 80.4% recall, zero labels
Transport NYC Yellow Taxi (7.6M trips) 8/8 anomaly categories detected

Benchmark scripts and data preparation are included. Results are reproducible. Numbers are from the pre-0.1.0 validation run and have not been re-evaluated against recent releases.

Full results: Benchmarks

Documentation

Introduction The idea and where it stands
Quick Start Install, build, navigate
Core Concepts Mathematical foundation
Configuration Sphere builder YAML reference
API Reference Python API
Data Format On-disk storage format
Architecture Package layers and design

Status

Research-stage project. Working code, reproducible benchmarks, active development. API may change.

License

Business Source License 1.1. Free for internal use, development, testing, and research. See LICENSE.md for details.

About

Understand the structure of your data — without training machine learning models

Topics

Resources

License

Contributing

Security policy

Stars

Watchers

Forks

Packages

 
 
 

Contributors

Languages