DevTrust

The trust stack for AI-era engineering — from PR to production.

DevTrust is a connected platform of small, opinionated, production-grade tools that give engineering teams a coherent answer to a simple question: as AI starts to write more of the code, how do we keep trust in what ships?

Each tool stands alone, ships to PyPI independently, and works with nothing but pip install. They also compose: the architecture model from one product feeds the test selector in the next, which feeds the PR reviewer after that, which feeds incident response when something breaks. Three full waves are shipped; a fourth is in progress.

Quick start

All eight packages are live on PyPI.

pip install devtrust-repox       # codebase architecture model
pip install devtrust-sts         # smart test selector
pip install devtrust-apr         # AI-era PR reviewer
pip install devtrust-agtrace     # agent-aware tracing SDK
pip install devtrust-whychanged  # production diff-detective
pip install devtrust-tokencost   # LLM cost attribution

GitHub Apps for the CI surfaces:

pip install devtrust-sts-app     # GitHub App — sts on every PR
pip install devtrust-apr-app     # GitHub App — apr on every PR

Try one:

repox build .
sts info --repo .
apr review --repo . --title "Refactor models" --description "Tighten validation."

60-second integrated demo

Wire three of the products together in one Python session — model the repo, trace an agent, attribute LLM cost, and gate a tool call with a policy:

from agtrace import default_tracer
from agentguard import Policy, Rule, ToolCall, enforce, with_agent
from agentguard.baseline import baseline_starter_policy
from tokencost.middleware import chain, default_sink, to_active_agtrace_span
from tokencost.models import TokenUsage
from datetime import UTC, datetime

tracer = default_tracer()
sink   = chain(default_sink(), to_active_agtrace_span())
policy = Policy(name="demo", rules=[*baseline_starter_policy().rules])

with tracer.span("agent.run", kind="agent"), with_agent("demo-bot"):
    with tracer.span("llm.call", kind="prompt"):
        sink(TokenUsage(timestamp=datetime.now(UTC),
                        provider="anthropic", model="claude-sonnet-4-6",
                        input_tokens=1234, output_tokens=567,
                        cost_micros=12000, feature="demo"))

    decision = enforce(policy, ToolCall(tool="stripe.charge",
                                        arguments={"amount": 100}),
                       audit="audit.jsonl")
    print(decision.status, decision.reason)  # → 'deny' + reason

Then dump the trace tree to confirm everything stitched together:

agtrace dump --from-file .agtrace/traces.jsonl
cat audit.jsonl

Five products. One coherent run. No agents required.

Naming: distribution names on PyPI are namespaced under devtrust- to avoid collisions with common short names. Python imports and CLI commands stay short — once installed, you import repox and run repox build . exactly as you'd expect.

What's in here

Eight installable packages across three shipped waves, each with its own pyproject.toml, README.md, CHANGELOG.md, tests, and PyPI release.

Naming convention: distribution names on PyPI are namespaced under devtrust- (e.g. pip install devtrust-repox) to avoid collisions with common short names. The Python module names and CLI commands stay short — once installed, you import repox and run repox build . exactly as you'd expect.

Wave 1 — codebase understanding

PyPI package	CLI / module	What it does	Latest
`devtrust-repox`	`repox`	Build a portable architecture model of any codebase: files, imports, symbols, and function-call edges across Python + JavaScript + TypeScript via tree-sitter. Emits `.repox/architecture.{json,md}` that downstream tools consume.	0.4.0
`devtrust-sts`	`sts`	Smart Test Selector: given a code change, decide which tests must run. Transitive-import-aware, framework-detection (pytest, unittest, jest, vitest), reads `repox` artifacts when available.	0.0.3
`devtrust-apr`	`apr`	Agent-PR Reviewer: deterministic + AI-pattern review for pull requests. Python + JS/TS rule packs, plus opt-in LLM-backed `ai-review:diff-comprehension` (Anthropic) and a deterministic `ai-review:hallucinated-symbol` rule that walks `repox` call edges to flag invented function calls.	0.2.0

Wave 2 — ship surfaces

PyPI package	CLI / module	What it does	Latest
`devtrust-sts-app`	`sts-app`	GitHub App for `sts`. JWT-signed installation tokens, HMAC webhook verification, tarball clone (no `git` binary needed), runs `repox` + `sts` end-to-end and posts a sticky PR comment with the verdict.	0.0.3
`devtrust-apr-app`	`apr-app`	GitHub App for `apr`. Same shape as `sts-app` — webhook receiver, GitHub App auth, sticky PR comment with findings.	0.0.1

Wave 3 — open & run

PyPI package	CLI / module	What it does	Latest
`devtrust-agtrace`	`agtrace`	Agent-aware tracing for LLM-driven workflows: spans, events, tool calls, JSONL append-only event store, ContextVar-based attribution that's safe across threads + async.	0.0.2
`devtrust-whychanged`	`whychanged`	Production diff-detective for incident response: when something breaks, rank the changes most likely to be the culprit. Pluggable `ChangeProvider` interface (git history + GitHub Deployments shipped).	0.1.0
`devtrust-tokencost`	`tokencost`	Financial-grade attribution for LLM spend: capture every Anthropic / OpenAI call with team / user / feature attribution, money in integer micro-USD (no float drift), JSONL store + cost report. Composes with `agtrace` so cost attaches to the active agent span.	0.0.3

→ See docs/wave-3-overview.md for the trio explainer.

Wave 4 — queued

agentguard — runtime governance + policy-as-code for AI agent tools. Spec written; build pending.

Why one repo

Independent versions, one source of truth. Cross-package changes (apr depends on repox artifacts; tokencost writes spans into the active agtrace context) land in one PR, get reviewed together, and ship as a coherent platform release. Each package still publishes to PyPI independently — release.yml fires on <package>-v<version> tags and publishes that package only.

Getting started

Prerequisites

Python 3.11+ (workspace pinned to 3.14.2 in .python-version)
uv (recommended) or pip

One-time setup

git clone https://github.com/AbdullahBakir97/DevTrust.git
cd DevTrust

uv venv
.venv\Scripts\activate
uv sync --all-packages --all-groups

Smoke check

# Lint, format, type-check, all tests, version + changelog gates
uv run python scripts/release.py --check

This is the same gate release.yml runs in CI before publishing to PyPI. A clean run looks like:

== Per-package metadata check ==
  ok    repox      @ 0.4.0
  ok    sts        @ 0.0.3
  ok    sts-app    @ 0.0.3
  ok    apr        @ 0.2.0
  ok    apr-app    @ 0.0.1
  ok    agtrace    @ 0.0.2
  ok    whychanged @ 0.1.0
  ok    tokencost  @ 0.0.3
== Workspace gates ==
  ok    ruff check
  ok    ruff format --check
  ok    mypy --strict
== Per-package tests ==
  ok    repox       33 passed
  ok    sts         30 passed
  ok    sts-app     29 passed
  ok    apr         53 passed, 4 skipped
  ok    apr-app     22 passed
  ok    agtrace     18 passed
  ok    whychanged  31 passed
  ok    tokencost   36 passed
READY TO RELEASE

Try a single tool

# Build an architecture model of any repo
repox build C:\path\to\some\repo

# Pick which tests should run for a change set
sts select --repo . --changed src\products\01-repo-xray\code\src\repox\analyzer.py

# Review a PR locally before pushing
apr review --repo . --title "Fix nullable fields" --description "..."

# Find culprits for an incident
whychanged explain --repo . --since 30m --service api

# Show LLM cost attribution
tokencost report --since 1d

Releases & PyPI

Each package versions independently. Tag format: <package>-v<version> (e.g. apr-v0.2.0, repox-v0.4.0). Tags trigger release.yml which:

Re-runs the full preflight (scripts/release.py --check).
Builds wheel + sdist with uv build --package <name>.
Publishes via PyPI Trusted Publishing (OIDC, no long-lived API tokens).

See RELEASE.md for the full process, including one-time PyPI Trusted Publisher setup per package.

Security & responsible disclosure

See SECURITY.md. In short: do not file public issues for security bugs; email the address in SECURITY.md and we'll respond.

API keys, GitHub App private keys, webhook secrets, and PyPI tokens are never committed to this repo. Local development uses .env files (gitignored). CI uses GitHub Actions secrets. PyPI publishing uses Trusted Publishing — no PyPI API tokens exist anywhere in this codebase.

Documentation

Wave overviews: src/waves/ — narrative plans for each wave
Per-product specs: src/products/NN-name/PRODUCT.md
Per-package READMEs: src/products/NN-name/code/README.md (and app/README.md for the GitHub App variants)
Wave 3 explainer: docs/wave-3-overview.md
Landing page (DevTrust Cloud): docs/landing/index.html — single self-contained HTML file. Open it in a browser to preview, drop it on any host (GitHub Pages, Netlify, Vercel, plain S3) when the domain is live.
Master plan: src/docs/

DevTrust Cloud — coming soon

Everything in this repo is free, open-source, and self-hostable. You can pip install any of the 8 packages today and run them on your own infrastructure forever.

If you'd rather not run two GitHub Apps, manage webhook secrets, host a control plane, and stitch together cross-repo dashboards yourself — DevTrust Cloud is the hosted version of the same stack. One install, every product wired together, a fleet-wide view of code trust, AI-cost attribution, agent traces, and incident causality across your whole engineering org.

What Cloud adds on top of the OSS:

Hosted sts-app and apr-app — install one GitHub App per org, no infrastructure to run.
Cross-repo dashboard — every repo's repox architecture model, apr review trends, sts test-selection efficiency, in one place.
Cross-service agtrace aggregation and tokencost attribution — see which team / feature / customer is burning LLM spend in real time.
whychanged as a webhook receiver — auto-ranked culprits posted to Slack the moment your incident-management tool fires.
SSO, audit logs, role-based access, on-prem option, support SLA, security review.

Free tier: all OSS packages, all features, individuals + public repos. Paid tier: private repos, fleet view, premium rule packs, enterprise auth.

Join the waitlist → (early-access pricing being finalized; waitlist members get the first 3 months free)

Status

Wave	State	Packages
Wave 1 (codebase understanding)	Shipped · on PyPI	`devtrust-repox` 0.4.0, `devtrust-sts` 0.0.3, `devtrust-apr` 0.2.0
Wave 2 (ship surfaces)	Shipped · on PyPI	`devtrust-sts-app` 0.0.3, `devtrust-apr-app` 0.0.1
Wave 3 (open & run)	Shipped · on PyPI	`devtrust-agtrace` 0.0.2, `devtrust-whychanged` 0.1.0, `devtrust-tokencost` 0.0.3
Wave 4 (compliance & governance)	In progress	`devtrust-agentguard` (scaffold)

Spec'd but not yet implemented

These products have written specs in src/products/NN-name/PRODUCT.md but no code yet. PRs and discussion welcome:

04-ci-local — local CI runner (spec only)
05-dep-upgrade-pilot — dependency upgrade pilot (spec only)

00-shared-platform/ is intentionally not a buildable package — it's the design doc for the shared infrastructure layer (auth, billing, runtime, dashboard) that lives in DevTrust Cloud, the commercial counterpart to this OSS monorepo. See DevTrust Cloud — coming soon above.

License: Apache-2.0. Owner: Abdullah Bakir (github.com/AbdullahBakir97).

Name		Name	Last commit message	Last commit date
Latest commit History 13 Commits
.github		.github
docs		docs
scripts		scripts
src		src
.editorconfig		.editorconfig
.gitignore		.gitignore
.pre-commit-config.yaml		.pre-commit-config.yaml
.python-version		.python-version
CODE_OF_CONDUCT.md		CODE_OF_CONDUCT.md
CONTRIBUTING.md		CONTRIBUTING.md
LICENSE		LICENSE
README.md		README.md
RELEASE.md		RELEASE.md
SECURITY.md		SECURITY.md
pyproject.toml		pyproject.toml
uv.lock		uv.lock

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

DevTrust

Quick start

60-second integrated demo

What's in here

Wave 1 — codebase understanding

Wave 2 — ship surfaces

Wave 3 — open & run

Wave 4 — queued

Why one repo

Getting started

Prerequisites

One-time setup

Smoke check

Try a single tool

Releases & PyPI

Security & responsible disclosure

Documentation

DevTrust Cloud — coming soon

Status

Spec'd but not yet implemented

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

DevTrust

Quick start

60-second integrated demo

What's in here

Wave 1 — codebase understanding

Wave 2 — ship surfaces

Wave 3 — open & run

Wave 4 — queued

Why one repo

Getting started

Prerequisites

One-time setup

Smoke check

Try a single tool

Releases & PyPI

Security & responsible disclosure

Documentation

DevTrust Cloud — coming soon

Status

Spec'd but not yet implemented

About

Topics

Resources

License

Code of conduct

Contributing

Security policy

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages