Release v0.1.2 for templates/template_sia.
Publication
- Version: 0.1.2
- GitHub release: https://github.com/docxology/template_sia/releases/tag/v0.1.2
- DOI: https://doi.org/10.5281/zenodo.20932066
- Zenodo: https://zenodo.org/records/20932066
- PDF SHA-256:
6e6d19d04182628bb825471cf8094b5c32d2c491d2c646652ec7e2439ba80773
Abstract
This exemplar documents template_sia, a deterministic implementation of the Self-Improvement Agent (SIA) harness contract described in . The default pipeline replays fixture-backed generations for the mini_classify task; opt-in live mode runs bounded target subprocesses and optional Ollama-backed meta/feedback steps.
Run snapshot. Task mini_classify, run 1, 3 generation(s), live=false. Final accuracy=0.8333 over 6 held-out samples. Values are injected by scripts/z_generate_manuscript_variables.py after analysis.
Keywords: self-improvement agents, benchmark harness, reproducible evaluation, agent loops