DriftCheck by A2ZAI

Local-first regression intelligence for AI builders.

DriftCheck helps catch behavior drift when prompts, tools, RAG flows, SDKs, or models change. It runs locally first, writes JSON and markdown reports, and publishes a hosted proof card only when you choose.

Quick Start

npx @a2zai-ai/driftcheck init
npx @a2zai-ai/driftcheck check

During local package development:

npm install
npm run smoke

DriftCheck creates:

.driftcheck/checks/*.yml starter packs
.driftcheck/runs/latest.json
driftcheck-report.md

Starter Packs

Tool-Calling Reliability: schema-valid tool arguments, safe fallback behavior, and hallucinated tools.
RAG Faithfulness: grounded answers, citations, missing-context refusal, and source scope.
Model Migration: quality, cost, latency, and safety drift when moving between models.

To also write a live model comparison pack:

npx @a2zai-ai/driftcheck init --live

Run One Pack

npx @a2zai-ai/driftcheck check --pack tool-calling
npx @a2zai-ai/driftcheck check --pack rag-faithfulness
npx @a2zai-ai/driftcheck check --pack model-migration

Override Models From The CLI

For packs with a live execution block, you can override the baseline and candidate models without editing YAML:

OPENAI_API_KEY="sk-..." npx @a2zai-ai/driftcheck check \
  --pack model-migration \
  --baseline-model gpt-4o-mini \
  --candidate-model gpt-4.1-mini

The same values can be set with environment variables:

DRIFTCHECK_BASELINE_MODEL=gpt-4o-mini \
DRIFTCHECK_CANDIDATE_MODEL=gpt-4.1-mini \
OPENAI_API_KEY="sk-..." \
npx @a2zai-ai/driftcheck check --pack model-migration

Static packs still run without API keys. Model overrides only affect packs that define execution.provider.

One-Command Live Compare

Use compare when you want to check a model migration without editing YAML first:

OPENAI_API_KEY="sk-..." npx @a2zai-ai/driftcheck compare \
  --baseline-model gpt-4o-mini \
  --candidate-model gpt-4.1-mini

This runs the built-in Live Model Compare pack and writes the same .driftcheck/runs/latest.json and driftcheck-report.md outputs.

Generate A CI Summary

npx @a2zai-ai/driftcheck summary --run .driftcheck/runs/latest.json

The GitHub Action writes this summary to the workflow run automatically, so PR authors can see the overall score, dimension scores, model pair, and cases needing review without opening artifacts.

Publish A Proof Card

Publishing is explicit. Reports stay local unless you run publish.

DRIFTCHECK_TOKEN="paste-token-here" npx @a2zai-ai/driftcheck publish --run .driftcheck/runs/latest.json --public

The hosted proof layer currently lives at A2ZAI:

DRIFTCHECK_API_URL="https://www.a2zai.ai" npx @a2zai-ai/driftcheck publish --run .driftcheck/runs/latest.json --public

Pack Format

Packs live in .driftcheck/checks/*.yml.

id: tool-calling
name: Tool-Calling Reliability
category: tool-calling
description: Catch schema drift, hallucinated tool calls, and weak fallback behavior before agent changes ship.
cases:
  - name: Valid tool arguments
    dimension: quality
    weight: 3
    threshold: 80
    baselineOutput: "call_tool({ user: 'acct_123', action: 'refund_review' })"
    candidateOutput: "call_tool({ userId: 'acct_123', action: 'refund_review' })"
    expectedContains:
      - userId
      - action
    forbiddenContains:
      - malformed
      - undefined

Supported categories:

tool-calling
rag-faithfulness
model-migration

Supported score dimensions:

quality
safety
latency
cost

Live Model Execution

Static outputs work without API keys. To compare live OpenAI model responses, add an execution block and set OPENAI_API_KEY.

execution:
  provider: openai
  baselineModel: gpt-4o-mini
  candidateModel: gpt-4.1-mini
  temperature: 0
  maxTokens: 140

GitHub Action

After this repo is public, use:

name: DriftCheck

on:
  pull_request:

jobs:
  driftcheck:
    runs-on: ubuntu-latest
    steps:
      - uses: actions/checkout@v4
      - uses: a2zai-ai/driftcheck@v0
        with:
          fail-threshold: 70
          baseline-model: gpt-4o-mini
          candidate-model: gpt-4.1-mini

Privacy

DriftCheck is local-first:

Pack files stay in your repo.
Reports are written locally.
Publish is opt-in.
Known secret patterns are redacted from generated reports before publish.

Roadmap

npm package publication as @a2zai-ai/driftcheck
standalone a2zai-ai/driftcheck public repo
richer GitHub Action summaries
more starter packs for agents, support bots, coding workflows, and RAG apps

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
bin		bin
lib		lib
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
action.yml		action.yml
package-lock.json		package-lock.json
package.json		package.json

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

DriftCheck by A2ZAI

Quick Start

Starter Packs

Run One Pack

Override Models From The CLI

One-Command Live Compare

Generate A CI Summary

Publish A Proof Card

Pack Format

Live Model Execution

GitHub Action

Privacy

Roadmap

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

DriftCheck by A2ZAI

Quick Start

Starter Packs

Run One Pack

Override Models From The CLI

One-Command Live Compare

Generate A CI Summary

Publish A Proof Card

Pack Format

Live Model Execution

GitHub Action

Privacy

Roadmap

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages