tool-lab

An interactive Claude tool-use sandbox.

Define tools, send a user message, mock the tool responses — and watch the agent loop play out live. The thing you want when you're building a Claude agent and don't want to stand up real tool implementations yet.

Bring your own key. No backend. Nothing leaves your browser.

Live → tool-lab-bice.vercel.app

Why

Building an agent with Claude tool use means a lot of plumbing: write the schema, run the call, parse the tool_use block, plug in a tool implementation, feed the result back, run again. Most of that plumbing is what you'd skip through anyway when you're still iterating on which tools your agent needs and how the prompt steers them.

tool-lab takes the plumbing out: you write the tool schemas, hit run, and when Claude calls a tool you type the result by hand. The loop runs end to end with you in the role of every tool. Fast iteration on agent design.

What it does

Define tools — paste an array of { name, description, input_schema } in the live JSON editor. Invalid JSON disables Run.
Send a user message — pre-filled with a sensible sample, or type your own.
Run the loop — Claude streams a response. If it includes tool_use blocks, the page shows an input form for each one.
You play every tool — type whatever result you want, toggle is_error for failure cases, or hit fill suggestion for a pre-canned answer.
Continue — your results get sent back as tool_result blocks; Claude streams again. Loop continues until it returns plain text (end_turn).
BYOK, zero backend — requests go straight from your browser to Anthropic; the key lives in localStorage.

Why this is useful

Design tools before you build them. See what the model actually asks for before you write the tool implementation.
Stress-test prompts. Make tools error, return garbage, return huge payloads — see how the agent recovers.
Demo agents. Drive a multi-step loop in front of someone with full control over what every tool "returns."

How it works

src/
  app/
    page.tsx          orchestration: state machine for the loop
    layout.tsx        metadata, OG card, JSON-LD
    globals.css       design tokens, dark theme
  components/
    ConfigPane.tsx    model · system · tools JSON · sample / reset
    ConversationView.tsx   the running message log
    InputArea.tsx     contextual: composer / tool-result inputs / stop
    KeyDialog.tsx     BYOK key entry
  lib/
    anthropic.ts      fetch-based streaming + SSE parser with tool_use support
    pricing.ts        tier-aware cost model
    sample.ts         the built-in sample (system, tools, user message)
    storage.ts        localStorage helpers
    types.ts          Message, Block, Tool, PendingToolUse

The SSE parser handles tool_use blocks specifically — it accumulates the input_json_delta events per block and parses the full JSON at content_block_stop. Text streams live, tool_use cards appear when complete.

Run locally

npm install
npm run dev
# open http://localhost:3000

You'll need an Anthropic API key — create one at console.anthropic.com.

Deploy

Static-friendly, no environment variables. One-click on Vercel:

Read the story

The case for prototyping agents by hand-mocking tool responses before writing the real implementations:

Build the sandbox before you write a single tool — why "tool implementations are not the hard part of agent development; tool design is," with a worked example where 3 of 4 initial tools didn't survive the first sandbox session — and the fifteen-minute exercise that saves a day of rework.

A small suite

Five tools for seeing what Claude is doing, built together with a shared design language:

claudoscope — x-ray your Claude API calls
agent-replay — replay a static agent trace
prompt-lab — A/B test prompts side by side
tool-lab — interactive tool-use sandbox (this one)
context-lens — see a Claude prompt before you ship it

Tech

Next.js 16 · React 19 · TypeScript · Tailwind CSS v4 · Framer Motion

License

MIT — see LICENSE.

Name		Name	Last commit message	Last commit date
Latest commit History 9 Commits
.github		.github
docs		docs
public		public
src		src
.gitignore		.gitignore
AGENTS.md		AGENTS.md
CLAUDE.md		CLAUDE.md
CONTRIBUTING.md		CONTRIBUTING.md
LICENSE		LICENSE
README.md		README.md
next.config.ts		next.config.ts
package-lock.json		package-lock.json
package.json		package.json
postcss.config.mjs		postcss.config.mjs
tsconfig.json		tsconfig.json

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

tool-lab

Why

What it does

Why this is useful

How it works

Run locally

Deploy

Read the story

A small suite

Tech

License

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

tool-lab

Why

What it does

Why this is useful

How it works

Run locally

Deploy

Read the story

A small suite

Tech

License

About

Topics

Resources

License

Contributing

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages