-
Notifications
You must be signed in to change notification settings - Fork 1
Pay per crawl
Agent402 is two-sided. The main server lets agents buy tools. The
agent402-tollbooth
package is the inverse: it lets any site charge the AI bots that crawl it.
Put it in front of any site or API: human visitors browse free, while AI crawlers and agents pay per request — in USDC over x402, or for free by solving a proof-of-work. The open, self-hostable answer to Cloudflare's closed pay-per-crawl: no CDN lock-in, no Stripe, no Merchant-of-Record, no signup.
Express middleware — humans pass; known AI crawlers get 402:
import express from "express";
import { createTollbooth } from "agent402-tollbooth";
const app = express();
app.use(createTollbooth({ price: "$0.002" }));
app.get("/article", (_req, res) => res.send("…your content…"));
app.listen(3000);Reverse proxy — wrap any existing site, any language, zero code changes:
TOLLBOOTH_UPSTREAM=https://your-site.com node tollbooth/index.jscurl -A "Mozilla/5.0" localhost:4021/article # human -> 200, free
curl -A "ClaudeBot/1.0" localhost:4021/article # bot -> 402 Payment Required-
Who pays: by default, requests whose
User-Agentmatches a known AI/LLM crawler (GPTBot, ClaudeBot, CCBot, PerplexityBot, Bytespider, Google-Extended, …). Classic search indexers (Googlebot, Bingbot) are not charged, so SEO stays free. Override withbotUserAgents, or a customcharge(req)predicate. -
Free rail (proof-of-work): works out of the box, no wallet. A crawler
solves a single-use, resource-bound sha256 puzzle and retries with
X-Pow-Solution: <token>:<nonce>— the same hardened scheme the main server uses (see Paying with Compute). -
Paid rail (x402 USDC): set
payToand supply averifyX402hook wired to the standard x402 stack — settlement is reused, not reinvented (see Paying with x402).
UA matching is the default, but it's evadable — so the tollbooth lets you stop detecting bots and instead make access cost something (opt-in, defaults unchanged):
-
mode:"bots"(default) ·"all"(charge everyone but afree()match) ·"strict"(charge anything that isn't a real-browser request). A more sophisticated bot gains nothing — it pays or solves a proof-of-work like everyone else. -
adaptive: true: proof-of-work difficulty rises with load (capped), so a high-volume scraper pays escalating CPU per request regardless of disguise. Detection is an arms race; economics isn't. -
Analytics:
gate.stats()and a/__tollbooth/statsendpoint show requests, how many were charged, proof-of-work solves, and USDC collected — so you can see how much of your traffic is bots and what it's worth.
You don't have to flip the meter on cold. Observe-only mode classifies every
request as bot vs. human and counts it, but never returns 402. Deploy for a
week, watch the dashboard fill, then flip the flag off to start enforcing — no
other code changes.
app.use(createTollbooth({ observe: true })); // or TOLLBOOTH_OBSERVE=trueThe dashboard grows a "Would charge" counter, and bots see a
X-Tollbooth-Observed: would-charge header for log filtering.
By default, stats live in process memory: fine for single-instance Node,
useless on the edge or across replicas. Pass a statsSink to make them
survive restarts and aggregate across instances:
// Cloudflare Workers: aggregate across every isolate using KV
import { createEdgeTollbooth, kvStatsSink } from "agent402-tollbooth/edge";
const gate = createEdgeTollbooth({
secret: env.TOLLBOOTH_SECRET,
statsSink: kvStatsSink(env.TOLLBOOTH_KV),
});
ctx.waitUntil(gate.flush()); // ensure deltas land in KV after the responseOn the Cloudflare Worker entry, /__tollbooth and /__tollbooth/stats
are auto-mounted before the gate (so they're free and unblockable). With KV
bound, the dashboard shows one consistent number across every colo.
On Next.js / Vercel Edge, middleware can't host endpoints itself, so a
companion route handler + dashboard page ship as drop-in snippets in
deploy/nextjs/middleware.js. The middleware writes via httpStatsSink to
the route handler, which persists into Vercel KV / Upstash.
Build your own sink by implementing { incr(field, n?), flush?(), snapshot() }
— e.g. a Durable Object for strongly-consistent counters, or pipe into Cloudflare
Analytics Engine.
Ready-to-copy templates: deploy/cloudflare/
(Cloudflare Workers), deploy/nextjs/
(Next.js / Vercel middleware), and deploy/docker/
(docker compose up -d reverse proxy). The reverse proxy also serves a live
operator dashboard at /__tollbooth (and JSON at /__tollbooth/stats).
The big platforms shipped pay-per-crawl as a closed, fiat, you-must-be-on-our-CDN feature. This is the open, crypto-native, run-it-yourself version, built on the same 402 + proof-of-work machinery as the rest of Agent402. It turns the project into both sides of the x402 economy: agents buy capabilities, and sites charge agents.
Full docs and config table: tollbooth/README.md. MIT licensed.
agent402.tools · synced from wiki/ in the main repo — edit there, not here.