Mini-Coding-Agent

This folder contains a small standalone coding agent:

code: mini_coding_agent.py
CLI: mini-coding-agent

It is a minimal local agent loop with:

workspace snapshot collection
stable prompt plus turn state
structured tools
approval handling for risky tools
transcript and memory persistence
bounded delegation

The model backend is currently based on Ollama.

Stay tuned for a more detailed tutorial to be linked here

Requirements

You need:

Python 3.10+
uv
Ollama installed
an Ollama model pulled locally

This project has no Python runtime dependency beyond the standard library. uv is only used for environment management and the CLI entry point.

Install Ollama

Install Ollama on your machine so the ollama command is available in your shell.

Official installation link: ollama.com/download

Then verify:

ollama --help

Start the server:

ollama serve

In another terminal, pull a model. Example:

ollama pull qwen3.5:4b

Qwen 3.5 model library:

ollama.com/library/qwen3.5

The default in this project is qwen3.5:4b. If you have sufficient memory, it is worth trying a larger model such as qwen3.5:9b or another larger Qwen 3.5 variant. The agent just sends prompts to Ollama's /api/generate endpoint.

Project Setup

Clone the repo or your fork:

git clone https://github.com/rasbt/mini-coding-agent.git
cd mini-coding-agent
uv sync

If you forked it first, use your fork URL instead:

git clone https://github.com/<your-github-user>/mini-coding-agent.git
cd mini-coding-agent
uv sync

That creates the local environment and installs the CLI entry point.

Basic Usage

Start the agent:

cd mini-coding-agent
uv run mini-coding-agent

The mini-coding-agent command is the CLI entry point for the project. Run it without a prompt to open the interactive REPL, or pass a quoted prompt to run a single request and exit.

Examples:

uv run mini-coding-agent
uv run mini-coding-agent "Summarize the repository layout."
uv run mini-coding-agent --cwd /path/to/project --approval auto

By default it uses:

model: qwen3.5:4b
approval: ask

For a concrete usage example, see EXAMPLE.md.

Approval Modes

Risky tools such as shell commands and file writes are gated by approval.

--approval ask prompts before risky actions (default and recommended)
--approval auto allows risky actions automatically (convenient but riskier)
--approval never denies risky actions

Example:

uv run mini-coding-agent --approval auto

Resume Sessions

The agent saves sessions under the target workspace root in:

.mini-coding-agent/sessions/

Resume the latest session:

uv run mini-coding-agent --resume latest

Resume a specific session:

uv run mini-coding-agent --resume 20260401-144025-2dd0aa

Interactive Commands

Inside the REPL, slash commands are handled directly by the agent instead of being sent to the model as a normal task.

/help shows the list of available interactive commands
/memory prints the distilled session memory, including the current task, tracked files, and notes
/session prints the path to the current saved session JSON file
/reset clears the current session history and distilled memory but keeps you in the REPL
/exit exits the interactive session
/quit exits the interactive session; alias for /exit

Main CLI Flags

uv run mini-coding-agent --help

CLI flags are passed before the agent starts. Use them to choose the workspace, model connection, resume behavior, approval mode, and generation limits.

Important flags:

--cwd sets the workspace directory the agent should inspect and modify; default: .
--model selects the Ollama model name, such as qwen3.5:4b; default: qwen3.5:4b
--host points the agent at the Ollama server URL (usually not needed); default: http://127.0.0.1:11434
--ollama-timeout controls how long the client waits for an Ollama response (usually not needed); default: 300 seconds
--resume resumes a saved session by id or uses latest; default: start a new session
--approval controls how risky tools are handled: ask, auto, or never; default: ask
--max-steps limits how many model and tool turns are allowed for one user request; default: 6
--max-new-tokens caps the model output length for each step; default: 512
--temperature controls sampling randomness; default: 0.2
--top-p controls nucleus sampling for generation; default: 0.9

Example

See EXAMPLE.md

Notes & Tips

The agent expects the model to emit either <tool>...</tool> or <final>...</final>.
Different Ollama models will follow those instructions with different reliability.
If the model does not follow the format well, use a stronger instruction-following model.
The agent is intentionally small and optimized for readability, not robustness.

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
tests		tests
.gitignore		.gitignore
EXAMPLE.md		EXAMPLE.md
LICENSE		LICENSE
README.md		README.md
mini_coding_agent.py		mini_coding_agent.py
pyproject.toml		pyproject.toml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Mini-Coding-Agent

Requirements

Install Ollama

Project Setup

Basic Usage

Approval Modes

Resume Sessions

Interactive Commands

Main CLI Flags

Example

Notes & Tips

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Mini-Coding-Agent

Requirements

Install Ollama

Project Setup

Basic Usage

Approval Modes

Resume Sessions

Interactive Commands

Main CLI Flags

Example

Notes & Tips

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages