feat: Hookdeck Outpost quickstarts and agent onboarding prompt by leggetter · Pull Request #815 · hookdeck/outpost

leggetter · 2026-04-10T11:56:05Z

TL;DR

Hookdeck-managed quickstarts (curl, TypeScript, Python, Go) plus a single agent prompt template for published docs and dashboard onboarding ({{PLACEHOLDERS}} for API base, topics, test destination, docs URL).
Agent evaluation harness under docs/agent-evaluation/: 10 scenarios (basics → app stacks → integrate-into-existing), Claude Agent SDK runner, heuristic transcript scoring + LLM judge, lifecycle sidecars, read/bash sandbox, authoring rules (AGENTS.md, Cursor rule).
CI slice: workflow Docs agent eval (CI slice) runs scenarios 01 + 02 against local repo docs (EVAL_LOCAL_DOCS=1), then executes generated curl + TypeScript against live Outpost; supports workflow_dispatch for manual runs.
OpenAPI: add DestinationSchemaField.key so the published spec matches the API.

Stacking: This PR targets feat/refactor-docs. The broader docs platform / content restructure (Markdoc layout, nav, redirects migration, etc.) is not introduced here—it lives on the base branch. This branch adds quickstarts, the onboarding prompt, the eval harness, CI, and targeted doc updates on top of that foundation.

Goals

Operators can follow Hookdeck-hosted quickstarts with a consistent story across languages.
The same agent prompt can ship in docs and be injected in the Outpost dashboard (placeholders documented on the prompt page).
We can regression-test that prompt via repeatable agent scenarios and a small, billable CI slice that catches obvious drift.

Hookdeck quickstarts and agent prompt

New/updated quickstart pages under docs/content/quickstarts/ (curl, TypeScript, Python, Go, overview).
docs/content/quickstarts/hookdeck-outpost-agent-prompt.mdoc: copy-paste template with explicit rules (no API key in chat, topics reconciliation, test destination, links into the rest of the docs).
{{DOCS_URL}} and related copy aligned with production Hookdeck docs URLs where intended.
Supporting content updates on this branch include concepts (e.g. OSS vs managed framing, destinations hub tweak), Building your own UI, mental model / UI-guide links, Option 3 full-stack integration guidance, and the list-topics example fix for string[] API shape.

Agent evaluation (`docs/agent-evaluation/`)

Area	What shipped
Scenarios	`01`–`10` under `scenarios/`: basics (curl/TS/Python/Go), app templates (Next.js, FastAPI, Go HTTP), existing-app integration (08–10).
Runner	TypeScript (`run-agent-eval.ts`), harness (`eval-harness.ts`), declarative Eval sections / pre-steps, sandboxed read/bash.
Scoring	`score-transcript.ts` (per-scenario heuristics, including 08–10 publish_beyond_test_only), optional `llm-judge.ts` against each scenario’s Success criteria.
Operations	`SCENARIO-RUN-TRACKER.md`, `results/` layout + templates, `.env.example`, `AGENTS.md`, `.cursor/rules/agent-evaluation-authoring.mdc`.
CI execution	`execute-ci-artifacts.sh` runs generated curl + TypeScript against secrets; `smoke-test-execute-ci-artifacts.sh` + `npm run smoke:execute-ci` for local verification.

Local runs are opt-in per scenario (--scenario / --scenarios / --all); full suite and 08–10 are slow by design (clones, installs, multi-turn agents).

Here are some examples of what was built:

Basic with some UI

Integration into SaaS templates

CI

Workflow: .github/workflows/docs-agent-eval-ci.yml — Docs agent eval (CI slice).
Triggers: push to main, pull_request (same-repo only for the eval job), and workflow_dispatch.
Steps: ci-eval.sh → npm run eval:ci (scenarios 01, 02 with heuristic + LLM judge), then execute-ci-artifacts.sh with OUTPOST_API_KEY, webhook URL from EVAL_TEST_DESTINATION_URL, and explicit OUTPOST_API_BASE_URL / OUTPOST_CI_PUBLISH_TOPIC.
Note: Anthropic-backed steps consume API usage; fork PRs skip the job when secrets are unavailable.

API / SDK docs

docs/apis/openapi.yaml: document DestinationSchemaField.key (and related spec consistency).

How to review

Quickstarts + prompt: Read hookdeck-outpost-agent-prompt.mdoc and one language quickstart end-to-end; check placeholders and links.
Eval harness: docs/agent-evaluation/README.md + AGENTS.md; spot-check a scenario file’s Success criteria vs score-transcript.ts / judge behavior.
CI: Confirm org secrets exist for ANTHROPIC_API_KEY, EVAL_TEST_DESTINATION_URL, OUTPOST_API_KEY; optional manual workflow_dispatch after merge if needed.
Diff scope: Compare against feat/refactor-docs in the PR “Files changed” tab for the exact delta (this description highlights the onboarding/eval work; other touched files should match reviewer expectations).

Follow-ups (optional)

Expand CI to more scenarios only if cost/latency and flake budget are acceptable.
Keep SCENARIO-RUN-TRACKER.md updated as scenarios or the prompt change.

Add self-contained quickstarts for curl, TypeScript, Python, and Go against the managed API, with Settings → Secrets, env-based examples, and verification via Hookdeck Console and project logs. Nest Quickstarts nav under Hookdeck Outpost (above Self-Hosted) and add an agent prompt template page for dashboard copy/paste. Include TEMP-hookdeck-outpost-onboarding-status.md for GA tracking. Made-with: Cursor

- Claude Agent SDK runner with explicit --scenario/--scenarios/--all, per-run workspace - Heuristic + LLM scoring vs scenario Success criteria; score-transcript 01-10 - Scenarios: basics, minimal apps, existing-app integration baselines - CI slice (eval:ci), SCENARIO-RUN-TRACKER, prompt template Files on disk guidance - Allow committing docs/**/.env.example under docs/.gitignore - TEMP status and README updates Made-with: Cursor

…tracker - Agent prompt: language implies SDK; simplest path defaults to curl; option 2/3 framework mapping; warn on sdks.mdx vs per-language quickstarts. - Curl quickstart: shell script notes (HTTP 202, portable body/status split). - run-agent-eval: PreToolUse write guard, default EVAL_MAX_TURNS 80, local docs block aligned with prompt; scenario heuristic fix for publish data key escaping. - Scenarios 01-10: realistic short user turns; success-criteria fixes where needed. - SCENARIO-RUN-TRACKER: cleared run results for a fresh pass; action items reset. - README and .env.example updates for eval harness as applicable. Made-with: Cursor

Made-with: Cursor

…lock - Point local EVAL_LOCAL_DOCS guidance at full curl quickstart instead - Reword scenario 01 execution criteria to reference quickstart/OpenAPI

…on pass

Made-with: Cursor

- Expand concepts with SaaS/platform flow; refine building-your-own-ui (API root, paths, no localhost:3333 in examples) - Agent prompt: link concepts, UI guide, topics; tighten option-2 guidance - Eval harness: local docs list includes concepts, building-your-own-ui, topics - SCENARIO-RUN-TRACKER: scenario 05 assessment for 17-21-22 run, heuristic notes - Minor scenario 05 doc tweak Made-with: Cursor

Made-with: Cursor

GET /topics returns a JSON array of topic names (OpenAPI). The React snippet incorrectly treated items as objects with id and name, which misled readers and agent integrations. Use the string as key, value, and label to match the API and TypeScript SDK (topicsList → Array<string>). Made-with: Cursor

- Add eval-harness.ts to parse eval-harness fenced JSON (git_clone + agentCwd). - Runner applies pre-steps per scenario, sets agent cwd and write guard to the run directory, passes scenario markdown once into runOneScenario. - Transcript meta includes evalHarness summary; document EVAL_SKIP_HARNESS_PRE_STEPS. Made-with: Cursor

- Add ## Eval harness JSON (git_clone + agentCwd) for Next.js, FastAPI, Go baselines. - Turn 1 stays in-user voice (repo present) without naming the eval harness. - Align Automated eval and success criteria with pre-cloned workspace model. Made-with: Cursor

Made-with: Cursor

Expand the copy-paste agent template so existing apps with a product UI wire backend (BFF, server SDK) and frontend (calls own API only). Point to Concepts and Building your own UI before destination screens; allow API-only path when there is no customer UI. Made-with: Cursor

Pin scenario 09 to fastapi/full-stack-fastapi-template (React + Pydantic v2). Update scoreScenario09 baseline check, README index, TEMP onboarding status, and SCENARIO-RUN-TRACKER notes. Optional clone URL override: EVAL_FASTAPI_BASELINE_URL. Made-with: Cursor

Run 2026-04-09T22-16-54-750Z-scenario-09: heuristic 6/6, LLM pass. Point execution notes to prior Docker smoke on 20-48 stamp. Made-with: Cursor

Made-with: Cursor

Reword for customer-facing UI builders: clearer tenant/auth framing, configurable API base URL, less internal jargon and emphasis noise. Add implementation checklists for planning, destinations, activity, and safe rendering without duplicating the OpenAPI mapping tables. Made-with: Cursor

- Agent prompt: topic reconciliation, domain vs test publish, full-stack UI guidance; remove eval-flavored Turn 0 / next-run wording in template. - score-transcript: publish_beyond_test_only for 08/09/10 (domain publish). - Scenarios + README: success criteria and Turn 1 nudges match prompt. - SCENARIO-RUN-TRACKER: scenario 09 review notes marked resolved. Made-with: Cursor

Rewrite Turn 1 blockquotes as natural operator speech; drop Option 3, Turn 0, and prompt-section references. Align success-criteria wording with configured onboarding topics. Tracker references user-turn scripts. Made-with: Cursor

Add no_client_bundled_outpost_key and readme_or_env_docs checks to scoreScenario09 (align with full-stack success criteria). Made-with: Cursor

Write eval-run-started.json at scenario start; eval-failure.json on uncaught errors; eval-aborted.json on SIGTERM/SIGINT. Register signal handlers so interrupted runs leave a trace (SIGKILL still silent). Made-with: Cursor

Add docs/agent-evaluation/AGENTS.md (anti-leakage checklist), root AGENTS.md pointer, and a Cursor rule scoped to docs/agent-evaluation/. Document run sidecars, re-scoring, integration verification wording, and scenario 09 heuristic summary. Fix placeholder fixtures markdown. Made-with: Cursor

Restrict PreToolUse Read/Glob/Grep to the run directory (and docs/ when EVAL_LOCAL_DOCS). Block Bash that touches the monorepo root outside those areas; deny Agent unless EVAL_ALLOW_AGENT_TOOL. Split read vs write guard env vars. Write eval-started, eval-failure, and eval-aborted next to the run folder under results/runs/ so the agent cannot read harness metadata. SIGTERM/ SIGINT abort payload includes runDirectory. Made-with: Cursor

Describe sibling *.eval-*.json harness files and expanded PreToolUse permissions (read guard, bash, Agent tool). Made-with: Cursor

Record 2026-04-10 run, quickstart.sh artifact, execution smoke test, and sibling harness sidecar layout. Made-with: Cursor

Update README, OpenAPI contact URL, entrypoint migration hint, and example READMEs so public links match Outpost docs on Hookdeck. Made-with: Cursor

- Default EVAL_DOCS_URL to https://hookdeck.com/docs/outpost - Replace invalid destinations directory path with overview + webhook mdoc - Document placeholder examples in agent prompt and fixtures Made-with: Cursor

- Point scenario and script links at docs/content paths (.mdoc) - Update SCENARIO-RUN-TRACKER for latest heuristic-pass runs - Revise README and AGENTS for current layout - Remove SKILL-UPSTREAM-NOTES (obsolete) Made-with: Cursor

Log 2026-04-10T22-14-20-704Z-scenario-10 with heuristic/LLM/execution results and execution notes (Go baseline, signup smoke, Hookdeck probe). Made-with: Cursor

Add docs-agent-eval-ci.yml: scenarios 01+02 with EVAL_LOCAL_DOCS, heuristic + LLM judge, then execute-ci-artifacts.sh (curl + TypeScript) using OUTPOST_API_KEY. Trigger on docs content/apis, agent-evaluation harness (ignoring tracker/results README noise), TypeScript SDK, and workflow edits. Ignore .env.ci for local secret template; document secrets and execution in README. Made-with: Cursor

Made-with: Cursor

GitHub rejects paths + paths-ignore on the same event; drop paths-ignore. README: manual workflow_dispatch; note broader path matches. Made-with: Cursor

Node parseArgs treats a bare -- as starting positionals; --scenarios then failed with ERR_PARSE_ARGS_UNEXPECTED_POSITIONAL in CI. Made-with: Cursor

- execute-ci-artifacts: EVAL_TEST_DESTINATION_URL fallback for webhook URL; default OUTPOST_API_BASE_URL with := (empty .env no longer strips version path); clearer errors on shell/ts failure - Add smoke-test-execute-ci-artifacts.sh + npm run smoke:execute-ci (topics *, loads .env then .env.ci) - CI execution step: OUTPOST_API_BASE_URL + OUTPOST_CI_PUBLISH_TOPIC - README troubleshooting (404) and .env.example OUTPOST_CI_PUBLISH_TOPIC Made-with: Cursor

leggetter added 30 commits April 8, 2026 17:25

docs(agent-eval): record fresh scenario 01 eval run in tracker

3bc5469

Made-with: Cursor

fix(agent-eval): remove harness-only 202/head hints from local docs b…

241dae6

…lock - Point local EVAL_LOCAL_DOCS guidance at full curl quickstart instead - Reword scenario 01 execution criteria to reference quickstart/OpenAPI

docs(agent-eval): update scenario 01 tracker after re-run and executi…

6b1fd4b

…on pass

docs(agent-eval): record scenario 02 run and execution pass

556b77f

docs(agent-eval): fix tracker table formatting and artifact markdown

46e6dcc

docs(agent-eval): record scenario 03 run and execution pass

f57b59d

docs(agent-eval): record scenario 04 run and execution pass

803b51c

docs(agent-eval): record scenario 05 run and execution pass

f600652

Made-with: Cursor

docs(agent-eval): record scenario 06–07 runs and execution passes

1c6042b

Made-with: Cursor

chore(agent-eval): update SCENARIO-RUN-TRACKER for recent runs

e766d98

Made-with: Cursor

docs(agent-eval): record scenario 09 re-eval after prompt update

12bca0d

Run 2026-04-09T22-16-54-750Z-scenario-09: heuristic 6/6, LLM pass. Point execution notes to prior Docker smoke on 20-48 stamp. Made-with: Cursor

docs: scenario 09 tracker, agent prompt, BYO UI events/retry guidance

9bad021

Made-with: Cursor

docs(eval): de-meta user turns in scenarios 8–10

97aaa24

Rewrite Turn 1 blockquotes as natural operator speech; drop Option 3, Turn 0, and prompt-section references. Align success-criteria wording with configured onboarding topics. Tracker references user-turn scripts. Made-with: Cursor

feat(eval): extend scenario 09 transcript heuristics

d5eef91

Add no_client_bundled_outpost_key and readme_or_env_docs checks to scoreScenario09 (align with full-stack success criteria). Made-with: Cursor

feat(eval): persist run lifecycle sidecars

e415e33

Write eval-run-started.json at scenario start; eval-failure.json on uncaught errors; eval-aborted.json on SIGTERM/SIGINT. Register signal handlers so interrupted runs leave a trace (SIGKILL still silent). Made-with: Cursor

docs(agent-evaluation): document sidecars, sandbox, and env vars

8ab658f

Describe sibling *.eval-*.json harness files and expanded PreToolUse permissions (read guard, bash, Agent tool). Made-with: Cursor

docs(agent-evaluation): update scenario 01 tracker row

cc6e7e0

Record 2026-04-10 run, quickstart.sh artifact, execution smoke test, and sibling harness sidecar layout. Made-with: Cursor

vercel bot deployed to Preview – outpost-website April 10, 2026 18:16 View deployment

alexbouchardd changed the title ~~feat/hookdeck outpost quickstarts and agent onboarding prompt~~ Outpost quickstarts and agent onboarding prompt Apr 10, 2026

alexbouchardd changed the title ~~Outpost quickstarts and agent onboarding prompt~~ Hookdeck Outpost quickstarts and agent onboarding prompt Apr 10, 2026

leggetter added 3 commits April 10, 2026 22:59

docs: use hookdeck.com/docs/outpost for production doc links

14ab5a2

Update README, OpenAPI contact URL, entrypoint migration hint, and example READMEs so public links match Outpost docs on Hookdeck. Made-with: Cursor

vercel bot deployed to Preview – outpost-website April 10, 2026 22:01 View deployment

vercel bot deployed to Preview – outpost-docs April 10, 2026 22:01 View deployment

docs(eval): record scenario 10 pass in run tracker

b7316f4

Log 2026-04-10T22-14-20-704Z-scenario-10 with heuristic/LLM/execution results and execution notes (Go baseline, signup smoke, Hookdeck probe). Made-with: Cursor

vercel bot deployed to Preview – outpost-docs April 10, 2026 22:44 View deployment

vercel bot deployed to Preview – outpost-website April 10, 2026 22:44 View deployment

vercel bot deployed to Preview – outpost-website April 10, 2026 23:16 View deployment

vercel bot deployed to Preview – outpost-docs April 10, 2026 23:16 View deployment

ci(docs): allow workflow_dispatch for manual agent eval runs

736a23f

Made-with: Cursor

vercel bot deployed to Preview – outpost-docs April 10, 2026 23:20 View deployment

vercel bot deployed to Preview – outpost-website April 10, 2026 23:21 View deployment

ci(docs): fix workflow YAML (paths vs paths-ignore); document dispatch

9ab3771

GitHub rejects paths + paths-ignore on the same event; drop paths-ignore. README: manual workflow_dispatch; note broader path matches. Made-with: Cursor

vercel bot deployed to Preview – outpost-docs April 10, 2026 23:22 View deployment

vercel bot deployed to Preview – outpost-website April 10, 2026 23:22 View deployment

fix(agent-eval): eval:ci argv — drop stray -- before --scenarios

49d5713

Node parseArgs treats a bare -- as starting positionals; --scenarios then failed with ERR_PARSE_ARGS_UNEXPECTED_POSITIONAL in CI. Made-with: Cursor

vercel bot deployed to Preview – outpost-website April 10, 2026 23:24 View deployment

vercel bot deployed to Preview – outpost-docs April 10, 2026 23:24 View deployment

vercel bot deployed to Preview – outpost-website April 11, 2026 10:45 View deployment

vercel bot deployed to Preview – outpost-docs April 11, 2026 10:45 View deployment

leggetter changed the title ~~Hookdeck Outpost quickstarts and agent onboarding prompt~~ feat: Hookdeck Outpost quickstarts and agent onboarding prompt Apr 11, 2026

leggetter marked this pull request as ready for review April 11, 2026 11:14

alexbouchardd merged commit 04102df into feat/refactor-docs Apr 12, 2026
5 checks passed

alexbouchardd deleted the feat/hookdeck-outpost-quickstarts branch April 12, 2026 17:28

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: Hookdeck Outpost quickstarts and agent onboarding prompt#815

feat: Hookdeck Outpost quickstarts and agent onboarding prompt#815
alexbouchardd merged 48 commits intofeat/refactor-docsfrom
feat/hookdeck-outpost-quickstarts

leggetter commented Apr 10, 2026 •

edited

Loading

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

leggetter commented Apr 10, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

TL;DR

Goals

Hookdeck quickstarts and agent prompt

Agent evaluation (docs/agent-evaluation/)

Basic with some UI

Integration into SaaS templates

CI

API / SDK docs

How to review

Follow-ups (optional)

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

leggetter commented Apr 10, 2026 •

edited

Loading

Agent evaluation (`docs/agent-evaluation/`)