Release v0.1.1 for templates/template_sia.
Publication
- Version: 0.1.1
- GitHub release: https://github.com/docxology/template_sia/releases/tag/v0.1.1
- DOI: https://doi.org/10.5281/zenodo.20693012
- Zenodo: https://zenodo.org/records/20693012
- PDF SHA-256:
e350973a772b52bd6971747464fa40b671a8b3a0ffb2e3a867201defc340d7f5
Abstract
Abstract
This exemplar documents template_sia, a deterministic implementation of the Self-Improvement Agent (SIA) harness contract described in . The default pipeline replays fixture-backed generations for the mini_classify task; opt-in live mode runs bounded target subprocesses and optional Ollama-backed meta/feedback steps.
Run snapshot. Task mini_classify, run 1, 3 generation(s), live=false. Final accuracy=0.8333 over 6 held-out samples. Values are injected by scripts/z_generate_manuscript_variables.py after analysis.
Keywords: self-improvement agents, benchmark harness, reproducible evaluation, agent loops