presentation-forge

Build presentations by talking to an agent. You describe what you want; the agent maintains a small set of plain-text files that fully describe the deck, and renders the final .pptx from them. The PPTX is just a build artifact — your real source of truth is the text.

This is not a CLI you run by hand. It's a set of agent skills your AI assistant uses on your behalf. You stay in the conversation; the agent edits the files, generates the images, and produces the deck.

What's in this repo

Three agentskills.io-compatible skills your agent installs once and then uses together:

Skill	What it does for you
`presentation-forge`	Owns the deck spec (`story.md`, `slides.yaml`, `images.yaml`, `theme.yaml`, `selections.yaml`) and produces draft + final PPTX.
`image-generator`	Generates AI image variants from a YAML brief using Azure AI Foundry models.
`pptx-render`	Composes slides onto your corporate `.potx`/`.pptx` template so the deck looks native.

Install once:

gh skill install tkubica12/presentation-forge presentation-forge
gh skill install tkubica12/presentation-forge image-generator
gh skill install tkubica12/presentation-forge pptx-render

Image generation needs Azure AI Foundry credentials — the image-generator skill walks you through .env setup.

Recommended repository structure

For a shared presentation repo, we recommend separating reusable talk blueprints, concrete deliveries, and visual assets:

repo/
  talks/
    <talk-name>/
      story.md
      slides.yaml
      images.yaml
      theme.yaml
      selections.yaml
      build/
  deliveries/
    <delivery-name>/
      story.md
      slides.yaml
      images.yaml
      theme.yaml
      selections.yaml
      build/
  pptx-assets/
    design-templates/
      <template>.potx
    slide-libraries/   # reserved for future reuse/import workflows
    brand-assets/
      <logos, icons, source graphics>

Use talks/ for reusable source talks you expect to adapt many times.
Use deliveries/ for customer, event, or internal one-off decks derived from a talk or created for one situation.
Use pptx-assets/design-templates/ for corporate .potx / .pptx templates.
Keep pptx-assets/slide-libraries/ in the repo layout now even though slide import/reuse is not implemented yet.

How a deck gets built — the conversation flow

Each step is a chat with the agent. Each step ends with a file you and the agent can both read and edit. You never need to know the CLI; the agent handles it.

1. The story — `story.md`

You tell the agent what the talk is about. The agent asks a handful of clarifying questions and writes a tight prose narrative.

"Help me draft a 10-minute internal pitch on Aurora Coffee's Q3 launch. Audience is execs. Goal is to get budget approval."

story.md is prose, not slides — audience, central claim, the 3–5 beats that carry it, the call to action, any sources to ground in. You're done when you can read it aloud and it flows.

Multilingual tip. If the slides will be presented in a different language than story.md itself, fill in the ## Localization section the scaffold provides — target language, tone, terms to keep untranslated, translation philosophy, phrasing preferences. Keep story.md in whatever language is most comfortable for you to edit (usually English) and let this section steer the slide output.

2. The slide structure — `slides.yaml`

You ask the agent to turn the story into slides. Together you decide, slide by slide:

Layout — cover (hero title), bullets, bullets-with-image, full-bleed-image, quote, two-column, image-grid, … Discuss what fits each beat.
Title and bullets — short and scannable; the agent drafts, you push back.
Where images belong — not every slide needs art. Decide which slides carry a visual and give that visual a stable name (an image_ref).
Speaker notes — if you want any; they end up in PPTX notes.

Iterate in chat: "slide 4 should be a comparison, not bullets", "drop slide 7, merge into 8", "add a quote slide after the intro".

3. The image briefs — `images.yaml`

For every image_ref you used, you tell the agent what the image should look like. Three layers:

General visual style at the top (common_requirements) — palette, mood, lighting, framing. Treat this as the "house style" for the whole deck. Spend time here; it affects every image.
Per-image description — what this specific image shows.
Variations — 2–4 alternate takes per image so you have something to choose between (e.g. "close-up vs wide shot vs top-down").

The agent fans these out across multiple AI models so you typically get several candidates per image.

4. The draft deck

You ask the agent to build the draft. It generates all the images (cached, resumable — fine to interrupt and continue later) and produces build/draft.pptx. The draft contains every image variant as a separate slide, each labeled with its variant tag.

You open draft.pptx in PowerPoint and review.

5. Picking variants — `selections.yaml`

You walk through the draft and tell the agent which variant you want per slide. Easiest way: copy the variant label from the draft slide and paste it into chat.

"For 'hero', take the gpt-image-2 v02 i00 variant." "For 'pour-over', none of these work — make it warmer, top-down, less clinical, and regenerate." "Slide 4's title is too long — tighten it."

The agent handles each kind of feedback differently:

You say	Agent does
Pick a specific variant	Updates `selections.yaml`.
Reject all variants for one image	Edits that image's brief in `images.yaml`, regenerates just that one.
Tweak slide text	Edits `slides.yaml`, rebuilds the draft (no image regen needed).

Loop until you're happy. Selections persist across rebuilds — unchanged slides keep their pick.

6. The final deck

You ask the agent to build final.

build/final.pptx is rendered with the picked variant per slide, composed onto your corporate template. Layouts, masters, fonts, colors, decorative shapes — all inherited from the template. The deck looks native to your brand.

If you have final.pptx already open in PowerPoint, the agent writes to final-updated.pptx instead so nothing is lost.

What lives in each file

talks/
  my-talk/
    story.md          # the narrative, in prose
    slides.yaml       # ordered slides: layout, title, bullets, image_ref, notes
    images.yaml       # image briefs: house style + per-image + variations
    theme.yaml        # which corporate template + brand tokens to use
    selections.yaml   # your picked variant per slide
    build/
      images/         # generated images (cached)
      draft.pptx      # all variants — for review
      final.pptx      # the deck you present

Everything except build/ is text you can read, version-control, and hand off. The agent is the one that writes these files, but you can always open them and read what's there.

Using your corporate template

Your corporate .potx or .pptx template is the structural backbone of the deck — not just a stylesheet. The agent opens it as the starting PowerPoint file and adds your slides on top of it. That means everything your brand team baked into the template comes through automatically:

Slide masters and layouts — including all decorative shapes, logo placements, footers, page-number positions.
Theme — color scheme, font scheme, format scheme.
Default text styles — heading and body fonts, sizes, paragraph spacing.
Slide dimensions — 16:9, 4:3, custom A4, whatever the template uses.
Background graphics on layouts — e.g. the wireframe globe in the corner of a "section" layout, the giant quote-mark glyph behind a "quote" layout. These show up automatically on the right slide kinds because the agent picks the matching template layout by name.

Open the resulting final.pptx in PowerPoint and look at View → Slide Master — you'll see your template's masters there, untouched. The deck looks native to your brand.

Where to put the template

Drop the .potx or .pptx anywhere on disk. For a shared presentation repo, we suggest pptx-assets/design-templates/. Both .potx (PowerPoint template) and .pptx (regular presentation) work; .potx is normalized internally before use.

Telling the agent about it

Point theme.yaml at the template path and map your logical layout names to the layout names that exist in the template. The agent will help you discover the names (open the template in PowerPoint → View → Slide Master, or ask the agent to inspect it):

template: ../../pptx-assets/design-templates/your-corporate-template.potx

If you prefer Windows-style separators, this works too:

template: ..\..\pptx-assets\design-templates\your-corporate-template.potx

layouts:
  cover:              "Title Slide 1"
  section-divider:    "Section Slide 1"
  bullets:            "Title and Content"
  bullets-with-image: "Photo Slide 1"
  full-bleed-image:   "Photo full bleed lower title"
  quote:              "Quote"
  two-column:         "Two Column Bullet text"
  image-grid:         "Three Filmstrip Photos"
  # ... only the layouts you actually use need a mapping

metadata:
  author: "you@company.com"

You only need to map the layouts your deck actually uses. The left-hand side is our fixed vocabulary (cover, bullets, quote, …); the right-hand side is the exact layout name as it appears in your template — names must match character-for-character. Both / and \ work in theme.yaml paths on Windows.

What if I don't supply a template

Builds still work, using PowerPoint's plain default master. No brand, no decorations, generic fonts and colors. Fine for a smoke test; not what you want to present from. Always wire up a real template before final.

A few principles worth knowing

The PPTX is disposable. If you hand-edit final.pptx, your edits are lost on next build. Tell the agent to change the spec instead.
Slide IDs are stable. Don't ask the agent to rename a slide-id; selections are keyed by it. Edit the content of a slide freely.
Iterate cheaply. Tweaking text doesn't re-run image generation. Tweaking one image doesn't re-run the others. The agent has commands for both.
Be opinionated about house style. Vague briefs in images.yaml produce vague images. Spend time on common_requirements.

License

MIT — see LICENSE.

The pptx-render skill is a verbatim vendored copy of microsoft/hve-core (also MIT, © Microsoft Corporation) — see skills/pptx-render/NOTICE and skills/pptx-render/LICENSE-microsoft for provenance.

This repo does not vendor Anthropic's pptx skill. That skill's license forbids redistribution outside Anthropic's services.

Name		Name	Last commit message	Last commit date
Latest commit History 19 Commits
docs		docs
skills		skills
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

presentation-forge

What's in this repo

Recommended repository structure

How a deck gets built — the conversation flow

1. The story — `story.md`

2. The slide structure — `slides.yaml`

3. The image briefs — `images.yaml`

4. The draft deck

5. Picking variants — `selections.yaml`

6. The final deck

What lives in each file

Using your corporate template

Where to put the template

Telling the agent about it

What if I don't supply a template

A few principles worth knowing

License

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

presentation-forge

What's in this repo

Recommended repository structure

How a deck gets built — the conversation flow

1. The story — story.md

2. The slide structure — slides.yaml

3. The image briefs — images.yaml

4. The draft deck

5. Picking variants — selections.yaml

6. The final deck

What lives in each file

Using your corporate template

Where to put the template

Telling the agent about it

What if I don't supply a template

A few principles worth knowing

License

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

1. The story — `story.md`

2. The slide structure — `slides.yaml`

3. The image briefs — `images.yaml`

5. Picking variants — `selections.yaml`

Packages