model-ledger

Open-source model governance framework for fintechs. Auto-discover models, trace dependencies, validate compliance.

The Problem

Every financial institution under model risk regulation needs a model inventory. Today, 90% use spreadsheets. model-ledger replaces that with code — auto-discovered models, dependency graphs, and compliance validation from source systems.

Quick Start

pip install model-ledger

from model_ledger import Ledger, DataNode

ledger = Ledger()

# Define your models with inputs and outputs
segmentation = DataNode("segmentation", outputs=["segments"])
scorer = DataNode("fraud_scorer", inputs=["segments", "velocity"], outputs=["scores"])
alerting = DataNode("fraud_alerts", inputs=["scores"])

# Add and connect — edges build automatically
ledger.add([segmentation, scorer, alerting])
ledger.connect()

# Query the graph
ledger.trace("fraud_alerts")    # → ['segmentation', 'fraud_scorer', 'fraud_alerts']
ledger.upstream("fraud_alerts") # → ['segmentation', 'fraud_scorer']

Architecture

graph TB
    subgraph "Discovery (v0.4.0)"
        DN["DataNode<br/><i>name, inputs, outputs</i>"]
        DP["DataPort<br/><i>smart string with optional schema</i>"]
        SC["SourceConnector<br/><i>protocol: discover() → DataNode[]</i>"]
    end

    subgraph "Ledger"
        L["Ledger<br/><i>add, connect, trace<br/>upstream, downstream</i>"]
        MR["ModelRef<br/><i>regulatory identity</i>"]
        SN["Snapshot<br/><i>immutable event log</i>"]
    end

    subgraph "Adapters"
        SQL["SQL Parser<br/><i>extract tables, writes, filters</i>"]
        TBL["Table Discovery<br/><i>find pipelines from output tables</i>"]
    end

    subgraph "Compliance"
        VP["Validation Profiles"]
        S1["SR 11-7"]
        EA["EU AI Act"]
        NI["NIST AI RMF"]
    end

    subgraph "Storage"
        BP["LedgerBackend Protocol"]
        IM["InMemory"]
        SQ["SQLite"]
        CU["Your Backend"]
    end

    SC --> DN
    DN --> DP
    DN --> L
    L --> MR
    L --> SN
    L --> BP
    BP --> IM
    BP --> SQ
    BP --> CU
    SQL --> SC
    TBL --> SC
    VP --> S1
    VP --> EA
    VP --> NI

How It Works

Everything is a DataNode with inputs and outputs. The dependency graph builds itself from port matching.

sequenceDiagram
    participant C as SourceConnector
    participant L as Ledger
    participant B as Backend

    C->>L: ledger.add(connector.discover())
    L->>B: save_model(ModelRef) for each node
    L->>B: append_snapshot("discovered", {inputs, outputs})

    Note over L: ledger.connect()
    L->>L: Match output ports → input ports
    L->>B: append_snapshot("depends_on") for each edge

    Note over L: ledger.trace("model")
    L->>B: Read snapshots, walk graph
    L-->>C: ['upstream_a', 'upstream_b', 'model']

Key Features

Auto-discovery from source systems

Write a SourceConnector for your platform. It returns DataNode objects. The Ledger does the rest.

class MyConnector:
    name = "my_platform"

    def discover(self) -> list[DataNode]:
        # Query your ML platform, rules engine, ETL scheduler, etc.
        return [DataNode("model_a", inputs=["table_x"], outputs=["scores"])]

ledger.add(MyConnector().discover())
ledger.connect()

Shared tables with discriminators

When multiple models write to the same table, use DataPort for precision:

from model_ledger import DataPort

# Two models write to the same alert table
DataNode("check_rules", outputs=[DataPort("alerts", model_name="checks")])
DataNode("card_rules", outputs=[DataPort("alerts", model_name="cards")])

# Reader specifies which model's alerts it consumes
DataNode("check_queue", inputs=[DataPort("alerts", model_name="checks")])
# Only connects to check_rules, not card_rules

Table-based pipeline discovery

Discover active pipelines from their output tables — works with any orchestrator:

from model_ledger.adapters.tables import discover_pipelines_from_table

nodes = discover_pipelines_from_table(
    connection=my_db,
    table="scores_archive",
    name_column="model_name",
    timestamp_column="run_date",
)
# Returns one DataNode per active model_name in the table

SQL parsing utilities

Extract table references, write targets, and filters from SQL — useful for any SQL-based connector:

from model_ledger.adapters.sql import extract_tables_from_sql, extract_write_tables

extract_tables_from_sql("SELECT * FROM a.b JOIN c.d ON 1=1")
# → ['a.b', 'c.d']

extract_write_tables("INSERT INTO schema.output SELECT * FROM source")
# → ['schema.output']

Dependency tracking

ledger.trace("fraud_alerts")                              # Full pipeline path
ledger.upstream("fraud_alerts")                           # What feeds this model
ledger.downstream("segmentation")                         # What depends on this
ledger.dependencies("fraud_alerts", direction="upstream")  # Detailed dep info

Point-in-time inventory

# What models existed on a specific date?
inventory = ledger.inventory_at(audit_date)

Compliance validation

Profile	Regulation	What It Checks
`sr_11_7`	US Federal Reserve SR 11-7	Validator independence, governance docs, validation schedule
`eu_ai_act`	EU AI Act (2024/1689)	Risk assessment, data governance, human oversight
`nist_ai_rmf`	NIST AI RMF 1.0	GOVERN, MAP, MEASURE, MANAGE functions

Model introspection

Extract algorithm, features, and hyperparameters from fitted models:

from model_ledger import introspect

result = introspect(fitted_model)
# result.algorithm → "XGBClassifier"
# result.features → [FeatureInfo(name="velocity_30d", ...), ...]
# result.hyperparameters → {"n_estimators": 50, "max_depth": 4}

Pluggable storage

ledger = Ledger()                              # InMemory (default)
ledger = Ledger(backend=MySnowflakeBackend())  # Your warehouse

Install

pip install model-ledger                       # Core
pip install model-ledger[cli]                  # + CLI
pip install model-ledger[introspect-sklearn]   # + sklearn introspector

Design Principles

Everything is a DataNode — models, rules, pipelines, signals. Same abstraction.
The graph builds itself — connect outputs to inputs. No manual linking.
Event log, not a table — every change is an immutable Snapshot.
Protocol-first — all extension points use @runtime_checkable Protocol. No base classes.
Data as discovery — scan database output tables, not orchestration configs. Works with any stack.

For Organizations

The OSS core handles data models, graph building, and compliance. Your internal package adds:

SourceConnectors for your platforms (Snowflake, SageMaker, Airflow, etc.)
LedgerBackend for your data warehouse
Validation profiles for your regulations (OSFI E-23, PRA SS1/23, MAS AIRG)

Contributing

See CONTRIBUTING.md. All commits require DCO sign-off.

License

Apache-2.0. See LICENSE.

Name		Name	Last commit message	Last commit date
Latest commit History 56 Commits
.github		.github
docs		docs
src/model_ledger		src/model_ledger
tests		tests
.gitignore		.gitignore
CLAUDE.md		CLAUDE.md
CODEOWNERS		CODEOWNERS
CONTRIBUTING.md		CONTRIBUTING.md
GOVERNANCE.md		GOVERNANCE.md
LICENSE		LICENSE
README.md		README.md
pyproject.toml		pyproject.toml
renovate.json		renovate.json
uv.lock		uv.lock

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

model-ledger

The Problem

Quick Start

Architecture

How It Works

Key Features

Auto-discovery from source systems

Shared tables with discriminators

Table-based pipeline discovery

SQL parsing utilities

Dependency tracking

Point-in-time inventory

Compliance validation

Model introspection

Pluggable storage

Install

Design Principles

For Organizations

Contributing

License

About

Uh oh!

Releases 1

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

model-ledger

The Problem

Quick Start

Architecture

How It Works

Key Features

Auto-discovery from source systems

Shared tables with discriminators

Table-based pipeline discovery

SQL parsing utilities

Dependency tracking

Point-in-time inventory

Compliance validation

Model introspection

Pluggable storage

Install

Design Principles

For Organizations

Contributing

License

About

Resources

License

Code of conduct

Contributing

Security policy

Uh oh!

Stars

Watchers

Forks

Releases 1

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages