Verity

An open, domain-general engine for forensic surface comparison — a transparent, calibrated likelihood ratio, not a black-box "match."

Live app · How it works · Why · Docs · API reference

Verity compares 3-D surface-topography scans — bullet lands, cartridge-case breech-face impressions, striated and impressed toolmarks, and (in time) footwear and fractured surfaces — directly from X3P files (ISO 25178-72). It pairs a domain-general surface comparison with a transparent, calibrated likelihood-ratio decision layer and region-level attribution. The machine never reports a "match"; it reports an auditable weight of evidence, characterized on a named dataset.

Status: early. The X3P codec (verity-x3p) is landed and tested against real-world files; the engine, comparison API, and web app are live; the first-principles method is validated source-disjoint on Hamby-252 (see Validation). The full firearms-proof validation is in progress.

Why

Forensic firearm/toolmark comparison today is either subjective examiner judgment or proprietary black-box correlation (IBIS), while the open tooling is a pile of domain-specific R packages with no unified, deployable platform. Courts are increasingly skeptical of unqualified pattern-match testimony (Abruquah v. Maryland, 2023; the 2023 amendment to FRE 702), and no discipline yet has a well-characterized error rate (Cuellar et al., 2024). Verity's bet: one general, calibrated, explainable method — proven first where ground truth is strongest (firearms), then transferred across domains.

Design principles

Statistics decide, not a black box. A representation produces a score; a transparent, ELUB-bounded calibration turns that score into a reportable likelihood ratio. The report is interpretable regardless of how the score was computed — the firewall against the black box.
Reproducible by construction. Deterministic, version-pinned, content-hashed.
Open and language-independent. Built on the X3P standard; MIT/Apache-2.0.

What you get

Not a verdict — a calibrated ComparisonReport: a likelihood ratio with its verbal equivalent, a characterized cost (Cllr) on a named reference population, an ELUB bound on how strong a claim the data can support, and the region-level attribution that drove the score.

{
  "likelihood_ratio": 146.0,
  "verbal": "moderately strong support for same source",
  "lr_bound_log10": 2.16,
  "reference": { "name": "pooled bullet-land", "n_km": 146, "n_knm": 1755, "auc": 0.984, "cllr": 0.193 },
  "attribution": [ /* the matched regions — the explanation */ ],
  "scope_note": "Not a claim about the error rate of examination, which remains unknown."
}

The method — Congruent Matching Regions (CMR)

CMR generalizes Song's Congruent Matching Cells (the standard cartridge-case method) from 2-D cells and a fixed translation+rotation to regions of any dimension under any transformation group — so one algorithm scores striated, impressed, and (research) fractured marks. Partition a mark into regions, register each against the other mark, and count the regions that agree on one common geometry. The congruent regions are the attribution map.

Modality	Region	Transform group	Reduces to
Striated	1-D profile window	1-D translation	Chumbley / CMS
Impressed	2-D grid cell	2-D translation+rotation	= CMC
Fractured	3-D mesh patch	3-D rigid pose	(research)

Full write-up: docs/congruent-matching-regions.md.

Validation (honest)

Source-disjoint, first-principles (no learned representation). Under a barrel-disjoint protocol (no barrel in both train and test; reported per study, never pooled across makes), the production diag_contrast scorer yields on held-out barrels AUC ≈ 1.00 and test Cllr ≈ 0.11 on Hamby-252, and across the four NBTRD bullet studies (Hamby-252/173, PGPD Beretta, Phoenix Ruger) test Cllr ≈ 0.11–0.35 at AUC ≈ 0.97–1.00 — an informative, calibrated weight of evidence from metrology alone. (Cllr < 1 = informative; the Cllr − Cllr_min gap is the calibration loss the source-disjoint split exposes, answering the Cuellar et al. critique on its own terms.) The scorer was selected over the Phase-1 diag_mean and a multivariate fusion by an explicit barrel-disjoint ablation (verity-margin); verity-validation-report regenerates the full characterization — Tippett, DET, calibration, and the source-disjoint summary — as a court-ready PDF.
Learned representation (Phase-2b). Trained barrel-disjoint on 210 Hamby scans, it does not beat the cross-correlation baseline — it overfits (held-out AUC collapses to ≈ 0.67). Synthetic tests confirm the pipeline does learn given enough signal: a data limit, not a defect. Next: expand the dataset and retest.

Nothing here is a claim about the error rate of forensic examination, which remains unknown.

Repository map

A polyglot monorepo: one Rust codec core, thin language bindings, and the Python science + service stack on top.

Package	Lang	Role
`crates/verity-x3p`	Rust	Native X3P (ISO 25178-72) reader/writer — the format's single source of truth.
`bindings/python`	PyO3 + NumPy	Python binding to the core (bit-identical I/O).
`bindings/r/verityx3p`	extendr	R binding to the core (`x3ptools`-compatible layout).
`services/engine`	Python	Metrology preprocessing, registration, CMR, the calibrated-LR decision layer.
`services/api`	FastAPI	The comparison HTTP API serving the `ComparisonReport`.
`services/catalog`	Python	Normalized catalog + content-addressed store + ingestion (NBTRD/Figshare).
`services/web`	Next.js	verity.codes and the interactive comparison UI.

Quickstart

The X3P codec — Rust / Python / R

use verity_x3p::{read_x3p, write_x3p, WriteOptions};
let surface = read_x3p("scan.x3p")?;          // verifies the stored MD5
write_x3p(&surface, "copy.x3p", &WriteOptions::default())?;

import verity_x3p
s = verity_x3p.read_x3p("scan.x3p")            # s.data, s.mask are (ny, nx) NumPy arrays
verity_x3p.write_x3p(s, "copy.x3p", z_type="D")

library(verityx3p)
s <- read_x3p("scan.x3p")                      # s$surface is an nx-by-ny matrix
write_x3p(s, "copy.x3p")

A file written from any binding reads back bit-identically in every other.

Compare two marks over HTTP

curl -s -X POST https://api.verity.codes/compare \
  -F domain=striated \
  -F mark_a=@bulletA_land1.x3p -F mark_a=@bulletA_land2.x3p \
  -F mark_b=@bulletB_land1.x3p -F mark_b=@bulletB_land2.x3p

See the full docs and the interactive API reference.

Develop

cargo test -p verity-x3p                        # the Rust core

cd services/engine && uv venv --python 3.12 && uv pip install -e ".[dev]" && uv run pytest
cd services/api    && uv run --extra dev verity-api          # API on :8000
cd services/web    && pnpm install && pnpm dev               # web on :3000

Deployment (Vercel + a container host for the API) is documented in DEPLOY.md.

Status & roadmap

✅ verity-x3p native codec + Python/R bindings (bit-identical round-trip).
✅ Engine: ISO 16610 preprocessing, registration, the calibrated-LR decision layer, CMR; source-disjoint Hamby validation.
✅ Platform: comparison API + web app, live at verity.codes.
🔜 Expand the bullet/cartridge/toolmark datasets (NBTRD harvest) and retest the learned representation; CMR-2D → CMC parity on Fadul; TypeScript/Swift/Java codec bindings.

License

Dual-licensed under either of MIT or Apache-2.0, at your option. Bundled reference data carries its own upstream attribution — see services/api/verity_api/references/NOTICE.md.

Name		Name	Last commit message	Last commit date
Latest commit History 79 Commits
bindings		bindings
crates/verity-x3p		crates/verity-x3p
docs		docs
services		services
tests/fixtures		tests/fixtures
.dockerignore		.dockerignore
.gitignore		.gitignore
.vercelignore		.vercelignore
Cargo.lock		Cargo.lock
Cargo.toml		Cargo.toml
DEPLOY.md		DEPLOY.md
LICENSE-APACHE		LICENSE-APACHE
LICENSE-MIT		LICENSE-MIT
README.md		README.md
railway.json		railway.json

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Verity

Why

What you get

The method — Congruent Matching Regions (CMR)

Validation (honest)

Repository map

Quickstart

The X3P codec — Rust / Python / R

Compare two marks over HTTP

Develop

Status & roadmap

License

About

Licenses found

Uh oh!

Releases

Packages

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Verity

Why

What you get

The method — Congruent Matching Regions (CMR)

Validation (honest)

Repository map

Quickstart

The X3P codec — Rust / Python / R

Compare two marks over HTTP

Develop

Status & roadmap

License

About

Topics

Resources

License

Licenses found

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Packages