@withboundary/contract

Your LLM already returns valid JSON. That doesn't mean it's correct.

@withboundary/contract enforces domain correctness — not just structure. It validates outputs against your rules, fixes failures automatically, and retries until the result is actually usable.

No more:

silent logic errors
invalid business decisions
brittle retry loops

Why this exists

LLM outputs fail in ways JSON validation can't catch:

{
  "tier": "hot",
  "score": 25
}

Valid JSON. Wrong for your system.

Your logic requires: hot leads must have score > 70.

Schema validation passes. Your system breaks.

Without a contract, this enters your system. With @withboundary/contract, it is rejected and repaired automatically.

Install

npm install @withboundary/contract zod

Quick example

import { enforce } from "@withboundary/contract";
import { z } from "zod";

const schema = z.object({
  tier: z.enum(["hot", "warm", "cold"]),
  score: z.number(),
});

const result = await enforce(schema, runLLM, {
  rules: [
    (d) => d.tier !== "hot" || d.score > 70
      || `hot leads require score > 70, got ${d.score}`,
  ],
});

if (result.ok) {
  result.data; // guaranteed correct
}

async function runLLM(attempt) {
  const res = await openai.chat.completions.create({
    model: "gpt-4o",
    messages: [
      { role: "system", content: attempt.instructions },
      { role: "user", content: "Score this lead..." },
      ...attempt.repairs,
    ],
  });

  return res.choices[0].message.content;
}

result.data is guaranteed to satisfy your schema and your rules.

Correctness is just a function

Schemas validate structure. Rules define what correct means for your domain.

rules: [
  // Cross-field correctness
  (d) => Math.abs(d.subtotal + d.tax - d.total) < 0.01
    || `subtotal + tax != total`,

  // Business logic
  (d) => d.tier !== "hot" || d.score > 70
    || `hot leads require score > 70`,

  // State constraints
  (d) => d.endDate > d.startDate
    || "end date must be after start date",
]

A rule returns true if it passes, or a string describing what's wrong. The string becomes part of the repair prompt — the model sees exactly what to fix.

What it does

The model proposes an output. The contract decides if your system accepts it.

If it fails:

The error is classified
A targeted repair is generated
The model retries with context

This repeats until the output is correct or retries are exhausted.

Failure categories

Category	Meaning
`EMPTY_RESPONSE`	Model returned nothing
`REFUSAL`	Safety refusal detected
`NO_JSON`	No JSON found in output
`TRUNCATED`	Incomplete/cut-off output
`PARSE_ERROR`	Invalid JSON
`VALIDATION_ERROR`	Schema mismatch
`RULE_ERROR`	Rule violation
`RUN_ERROR`	Execution error

Each category produces different repair instructions, so the model gets specific feedback — not a generic "try again."

Result type

type ContractResult<T> =
  | { ok: true; data: T; attempts: number; raw: string; durationMs: number }
  | { ok: false; error: ContractError }

No exceptions. Pattern match on ok.

Reusable contracts

Define once, reuse everywhere:

import { defineContract } from "@withboundary/contract";

const leadContract = defineContract({
  schema,
  rules: [
    (d) => d.tier !== "hot" || d.score > 70
      || `hot leads require score > 70`,
  ],
  retry: { maxAttempts: 4 },
});

const result = await leadContract.accept(runLLM);

Observability

Every attempt is structured.

const result = await enforce(schema, runLLM, {
  onAttempt: (event) => {
    console.log(event.attempt, event.category, event.durationMs);
  },
});

You know:

why it failed
which rule was violated
how many retries it took

For human-readable debugging:

import { createConsoleLogger } from "@withboundary/contract";

const result = await enforce(schema, runLLM, {
  logger: createConsoleLogger({ showCleanedOutput: true }),
});

Use cheaper models safely

Correctness is enforced by the contract — not the model.

Run smaller models, retry when needed, and only escalate if necessary. The contract is the safety net.

Engine primitives

For custom pipelines, the individual steps are exported:

Function	Purpose
`clean(raw)`	Normalize raw LLM output to JSON
`verify(data, schema, rules?)`	Validate against schema + rules
`classify(raw, cleaned)`	Categorize a failure
`repair(detail)`	Generate repair messages
`instructions(schema)`	Generate schema-driven prompt instructions

When not to use

Fully unstructured text (creative writing, essays)
Tasks without clear correctness criteria

This works best when "correct" can be defined.

Works with any LLM

Model-agnostic. Works with any provider that returns text — OpenAI, Anthropic, Google, Mistral, local models.

License

MIT

Links

Stop trusting LLM output. Start verifying it.

Name		Name	Last commit message	Last commit date
Latest commit History 56 Commits
.changeset		.changeset
.github/workflows		.github/workflows
examples		examples
src		src
tests		tests
.gitignore		.gitignore
API.md		API.md
CHANGELOG.md		CHANGELOG.md
EXAMPLES.md		EXAMPLES.md
LICENSE		LICENSE
README.md		README.md
package-lock.json		package-lock.json
package.json		package.json
tsconfig.build.json		tsconfig.build.json
tsconfig.json		tsconfig.json
vitest.config.ts		vitest.config.ts

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

@withboundary/contract

Why this exists

Install

Quick example

Correctness is just a function

What it does

Failure categories

Result type

Reusable contracts

Observability

Use cheaper models safely

Engine primitives

When not to use

Works with any LLM

License

Links

About

Uh oh!

Releases 5

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

@withboundary/contract

Why this exists

Install

Quick example

Correctness is just a function

What it does

Failure categories

Result type

Reusable contracts

Observability

Use cheaper models safely

Engine primitives

When not to use

Works with any LLM

License

Links

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases 5

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages