Releases: docxology/template_autoscientists
A Deterministic Testbed for Self-Organizing Agent-Team Coordination (v1.0.1)
Release v1.0.1 for templates/template_autoscientists.
Publication
- Version: 1.0.1
- GitHub release: https://github.com/docxology/template_autoscientists/releases/tag/v1.0.1
- DOI: https://doi.org/10.5281/zenodo.20693001
- Zenodo: https://zenodo.org/records/20693001
- PDF SHA-256:
972bc4e0a82c97a88442ffc4370a17dd3323de96f27f91294dcb19dab3934a29
Abstract
Abstract
Recent work on AutoScientists coordinates self-organizing teams of language-model agents through a small set of shared mechanisms: a champion-and-experiment-log shared state, a registry of retired dead-end directions, effect-size ranking of candidate directions, noise-band confirmation of claimed improvements, and stagnation-driven reorganization of teams. This exemplar provides a deterministic, standalone reference implementation of those mechanisms and studies them honestly as a testbed rather than as a performance claim.
We make the comparison fair by holding the total number of objective evaluations fixed: coordinated teams partition a single sequential experiment budget rather than adding parallel compute. Under that matched budget, coordination cannot — and in our results does not — beat a single-thread baseline on the final champion metric; we report the actual numbers and claim no speedup. What the testbed does demonstrate are two distinct, independently measurable benefits. First, noise-robustness: because the objective is stochastic, a single observed gain can be a draw of evaluation noise, so we separate the reported champion metric from the clean noise-free ground truth and show that noise-band confirmation shrinks the gap between them by roughly an order of magnitude — with confirmation on, the final champion's reported metric sits
AutoScientists v1.0.0 — public exemplar standalone snapshot
Deterministic coordination-mechanism testbed (arXiv:2605.28655): shared champion/experiment-log state, dead-end registry, effect-size ranking, noise-band confirmation, stagnation-driven reorganization. 79 tests, src/ coverage 99.59%, no mocks. Snapshot of the template_autoscientists exemplar from docxology/template (canonical dev home). Double-published: GitHub + Zenodo concept DOI 10.5281/zenodo.20533669 (version 10.5281/zenodo.20533670). MIT License.