Idea Execution Framework

Vision

Make executing ideas with an AI efficient and transparent, with plans, decisions, artifacts, and learnings versioned and human‑auditable. The AI Agent collaborates with the human to clarify the idea and drive execution in observable steps.

Maybe for some ideas human involvement can be minimized in later phases. For example:

high involvment during early idea elaboration
lower during planning/prototypin
low during execution within agreed scope

But this may depend on the idea.

Core Principles

Git repo as shared project state/blackboard.
Persist all plans, decisions, artifacts, and learnings in the repo.
Human provides the vision and makes high-level decisions, AI Agent collaborates.
Memory is in repo: both working artifacts and distilled interaction learnings live as files here.
Repo is the single source of truth and synchronization medium; plans and specs live here and drive delegation.
Skills explains to Agent how to perform certain types of tasks (see next chapter)

Operating Model

Roles

Human: sets idea, constraints, core principles, requirements, approves major decisions, provides clarifications
AI Agent: a critical partner who helps think through and clarify the idea, expands high-level ideas into actionable detail, turns ideas into plans, executes tasks, and keeps the repo in sync.
At certain points, tasks may be executed in parallel by different agents, potentially using different LLM models or CLIs, each with their own context or specialization (e.g., coding, ops, research).

Agent

Agent is started from the IDE extension (e.g. Github copilot) or CLI (Codex, Claude Code or Github) from devcontainer

Skills

Skill Name	Description
Browser usage	When agent needs real browser to do a task it use playwright mcp (later try try conver mcp to cli)
Exection of task at given time	Use github action that call agent CLI with a task to do
Email	TODO

Iteration rhythm

Updates to the repo happen whenever useful (often after meaningful exchanges) and at least once per focused work cycle to keep the repo the single source of truth.
Phases are flexible and can overlap:
- Phase A – Idea Elaboration (high human involvement): Human states goal/idea; AI Agent probes, challenges, and expands; fast back-and-forth in README; capture assumptions and decisions.
- Phase B – Planning & Prototyping (moderate human involvement): Agent drafts approaches, proposes tasks, executes small spikes; human reviews key choices.
- Phase C – Execution (low human involvement): Agent works autonomously within agreed scope; updates repo regularly; escalates only for major decisions or boundary changes.

Agent work loop

flowchart TD
	A@{ shape: circle, label: "Start" } --> B[Pick first task from Agent TODO]
	B --> C[Work on task]
	C -->|Finished or cannot handle| E[Read memory and think about the next move]
	C -->|Needs clarification| H[Ask Human]
    E -->|Needs clarification| H
	E --> G{Cycles < x?}
	G -->|Yes| B
	G -->|No| I[Human review]

Work on the first task from the Agent TODO list until either:
- the task is finished, or
- the agent encounters a problem it cannot handle. In both cases, update memory (Interaction Log, Learnings, and Decisions if applicable).
Read memory and think about the next move - update memory including TODO
Repeat steps 1–2 up to x cycles, then return status to human.

At any point, the agent may decide that further work requires a clarifying question to the human.

Repository Structure & Memory

Start README-first. Add additional files and folders only when the project grows; keep the repo the single source of truth.

README.md
- vision
- core principles
- skills
- operating model
- repository structure & memory
- iteration rhythm
- TODOs (Lead Agent, Human)
- roadmap
- decision notes (template)
- learnings (template)

Memory approach: we keep Decisions, Learnings, and an Interaction Log inline here first to maximize transparency and minimize overhead. When those sections grow, we’ll split them into DECISIONS.md, LEARNINGS.md, and LOG.md. Retrieval isn’t needed right now—we rely on long‑context models over the repo files (or the whole repo when practical). Summarization may help with periodic roll‑ups later, but it’s intentionally deferred for now.

Decisions

Template: Date – [Context/Question] → Decision: [What was decided]. Rationale: [Why]. Impact: [What this affects].

Learnings

Template: Date – [What happened/was tried] → Learning: [What we discovered]. Application: [How this changes our approach].

Interaction Log

2025-09-30 – Designed the high-level Agent work loop and added a Mermaid diagram under “Iteration rhythm”. Outcome: keep the loop intentionally high-level, rely on LLM judgment for “cannot handle,” and cap cycles to x (default 5); README updated accordingly.

TODO – Lead Agent

How to give browser to agent?

TODO – Human

Review if framework is ready for Phase B (Planning & Prototyping)
- Think what should be the structure of the projects that will be used to execute ideas according to the Idea Execution Framework
Consider applying framework to a second small project for additional validation

Roadmap

Phase 1: Execute Phase A for "Idea Execution Framework" (this repo is the pilot) till Phase B or till Phase 2 below.
Phase 2: When the framework feels solid, start a second project executed according to it.

Name		Name	Last commit message	Last commit date
Latest commit History 10 Commits
.devcontainer		.devcontainer
AGENTS.md		AGENTS.md
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Idea Execution Framework

Vision

Core Principles

Operating Model

Roles

Agent

Skills

Iteration rhythm

Agent work loop

Repository Structure & Memory

Decisions

Learnings

Interaction Log

TODO – Lead Agent

TODO – Human

Roadmap

About

Uh oh!

Releases

Packages

marcingurbisz/idea-execution-framework

Folders and files

Latest commit

History

Repository files navigation

Idea Execution Framework

Vision

Core Principles

Operating Model

Roles

Agent

Skills

Iteration rhythm

Agent work loop

Repository Structure & Memory

Decisions

Learnings

Interaction Log

TODO – Lead Agent

TODO – Human

Roadmap

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Packages