What are people at the top of the game building with AI agents, and how are they doing it?
Are they Claudemaxxing with 8 terminals open at once? Or adversarially testing Opus 4.7 generated code with OpenAI Codex? Do they define suites and swarms of sub-agents, or use AGENTS.md and agent skills?
What do they love about building with agents? What do they hate? What tips and tricks do they use to supercharge their workflows?
Thomas Wiecki (PyMC Labs) and Hugo Bowne-Anderson (Vanishing Gradients) are on a mission to find out. Think Excel World Championships meets Eurovision.
This repo turns each episode into browsable, forkable artifacts: markdown skills, workflow writeups, and tool references you can read, copy, and adapt to your own setup.
You can install the skills in this repo with:
npx skills add https://github.com/hugobowne/show-us-your-agent-skills
Install one skill only:
npx skills add https://github.com/hugobowne/show-us-your-agent-skills --skill explain
Check for updates:
npx skills check
npx skills update
These skills are snapshots from the corresponding live streams. Creators often iterate on their own versions afterwards, so check each skill's README for instructions to pull the latest from the creator when one is linked.
Field notes: Wes McKinney, Jeremiah Lowin, Randy Olson.
| Skill | What it does | Guest | Watch |
|---|---|---|---|
| explain | Agent narrates what it just did, like a teammate handing off. | Jeremiah Lowin (Prefect, FastMCP) | 00:46:14 |
| github-reply | Replies to GitHub contributors in your voice, no "Great work, but rejected" sandwiches. | Jeremiah Lowin (Prefect, FastMCP) | 00:54:08 |
| ship-it | Re-trains "ship it" to mean open a PR, not merge. | Jeremiah Lowin (Prefect, FastMCP) | 00:54:52 |
| high-signal-chart-workflow | Turns a one-line idea into a Tufte-style chart, with an LLM-as-judge verifier loop. | Randy Olson (Goodeye Labs, r/dataisbeautiful) | 01:12:37 |
| 8-bit-video-gen | Turns guest headshots into short 8-bit pixel-art video clips for livestream intros and cutaways. | Show Us Your Agent Skills | Episode 1 |
| Workflow | What it does | Guest | Watch |
|---|---|---|---|
| agentic-software-factory | Run several agent projects in parallel while background review agents read every commit and maintain a fix queue. | Wes McKinney (Posit, pandas) | 00:27:14 |
| second-brain | Feed a personal agent memory with daily voice memos and use an editable memory substrate for asynchronous work. | Jeremiah Lowin (Prefect, FastMCP) | 00:35:50 |
Field notes: Hilary Mason, Bryan Bischof, Eric Ma, Tomasz Tunguz.
| Skill | What it does | Guest | Watch |
|---|---|---|---|
| prompt-refinement | Interview the user's intent, ask for three variations at different magnitudes of change, score against a rubric you wrote up front. | Hilary Mason (Hidden Door) | 01:01:00 |
| marimo-pair | A coding agent drives a reactive Marimo notebook through a bash bridge into the Python kernel, for human-in-the-loop EDA. | Eric Ma (Moderna) | 00:11:57 |
| Workflow | What it does | Guest | Watch |
|---|---|---|---|
| agentic-eda | Human-in-the-loop EDA: agent renders the next plot, human picks the next question, every claim backed by an artifact. | Eric Ma (Moderna) | 00:23:27 |
| eval-driven-charts | Build an agent-facing chart library by generalising eval failures into features; the package can never regress on an eval it once passed. | Bryan Bischof (Theory Ventures) | 01:25:11 |
| weekly-gremlins | Three agent personas pull from a bad-ideas backlog, pitch and critique each other, and write design docs for moonshots no roadmap would schedule. | Hilary Mason (Hidden Door) | 01:14:20 |
| local-first-agents | Default to a local model and thin harness, reaching for cloud inference only for named exceptions. | Tomasz Tunguz (Theory Ventures) | 02:07:42 |
Episode 3: Matthew Honnibal, Eleanor Berger, Nico Gerold, Alan Nichol, Vincent Warmerdam, Paul Iusztin
Field notes: Matthew Honnibal, Eleanor Berger, Nico Gerold, Alan Nichol, Vincent Warmerdam, Paul Iusztin.
| Skill | What it does | Guest | Watch |
|---|---|---|---|
| try-except | Reads a Python codebase and tightens every try/except so the try covers only what can fail and the except catches the right exception. |
Matthew Honnibal (spaCy, Explosion) | 00:12:09 |
| pre-mortem | Reads production code, finds where it is fragile, and writes post-mortems for bugs that have not happened yet but a plausible change could introduce. | Matthew Honnibal (spaCy, Explosion) | 00:14:10 |
| mutation-testing | Measures test-suite strength by introducing deliberate bugs one at a time and reporting which ones no test caught. | Matthew Honnibal (spaCy, Explosion) | 00:14:10 |
| here-now | Publishes HTML pages, files, and whole sites to live URLs without leaving the terminal. | Eleanor Berger (Jimini Health) | 00:45:55 |
| anki-connect | Drives Anki through the AnkiConnect API, gating every note- or card-modifying operation behind explicit confirmation. | Eleanor Berger (Jimini Health) | 00:49:46 |
| impeccable | Hands a coding agent a full frontend design language so it builds production-grade interfaces instead of generic ones. | Eleanor Berger (Jimini Health) | 00:50:02 |
| youtube-watch-later-gist-summaries | Reads your YouTube Watch Later playlist, summarises every video from its transcript, and publishes each summary as a secret gist. | Eleanor Berger (Jimini Health) | 00:52:57 |
| thread-postmortem | Introspects a thread that went sideways, traces each misstep to the instruction behind it, and proposes edits biased toward deletion. | Nico Gerold (Sourcegraph, Amp) | 01:59:04 |
| remotion-video | Encodes a builder's design judgment for programmatic video, so Claude turns a few minutes of recorded audio into a finished explainer. | Alan Nichol (Rasa) | 02:46:00 |
| research | Builds and queries a persistent LLM-curated research wiki from Obsidian, Readwise, NotebookLM, GitHub repos, and supplied sources. | Paul Iusztin (Decoding AI) | 02:19:52 |
| Workflow | What it does | Guest | Watch |
|---|---|---|---|
| personal-agent-harness | Run a personal agent on isolated spare hardware, reachable through Discord or WhatsApp, with autonomy granted gradually. | Eleanor Berger (Jimini Health) | 00:47:50 |
Vincent Warmerdam's segment uses notebooks as a shared canvas for humans and agents; his Marimo Pair skill shipped in Episode 2 (marimo-pair).
Field notes: Hamel Husain, Chris Fonnesbeck, Doug Turnbull.
| Workflow | What it does | Guest | Watch |
|---|---|---|---|
| skill-scepticism | Review shared agent skills before trusting, adapting, replacing, or rejecting them. | Hamel Husain (Parlance Labs) | 00:22:32 |
| plan-review-implementation-review | Ask an agent for a plan, audit it with review plans, implement only after the plan is clean, then audit the finished code with review implementation. |
Chris Fonnesbeck (PyMC Labs) | 01:05:53 |
| auto-research-agentic-search | Let an agent experiment with search-code patches while hidden validation decides what survives. | Doug Turnbull | 01:41:07 |
Friday, June 19, 2026, 9:00 AM - 10:30 AM AEST (GMT+10), live on YouTube.
Joined by John Berryman (Arcturus Labs, early engineer on GitHub Copilot, O'Reilly author), Isaac Flath (Kentro Tech, ex-Answer.AI), and Matt Palmer (Conductor, ex-Replit).
Vanishing Gradients is a podcast, workshop series, blog, and newsletter focused on what you can build with AI right now. Over 70 episodes with expert practitioners from Google DeepMind, Netflix, Stanford, and elsewhere. Hundreds of hours of free, hands-on workshops. All independent, all free. Subscribe on Substack.
