Skip to content

chore(release): v0.5.1 — finalize LLM-reduction roadmap (#33)#52

Merged
wesleysimplicio merged 1 commit into
masterfrom
claude/eloquent-ptolemy-0Y7d2
May 31, 2026
Merged

chore(release): v0.5.1 — finalize LLM-reduction roadmap (#33)#52
wesleysimplicio merged 1 commit into
masterfrom
claude/eloquent-ptolemy-0Y7d2

Conversation

@wesleysimplicio
Copy link
Copy Markdown
Owner

Resumo

Release v0.5.1 que fecha o roadmap de redução de dependência de LLM (#33). As 4 alavancas estão implementadas, testadas e em produção; esta release marca formalmente o fechamento e separa a validação empírica restante.

Publicado no PyPI: https://pypi.org/project/simplicio-cli/0.5.1/

O que muda

Alavancas entregues (#33)

Alavanca Onde
D — content-addressed completion cache simplicio/_cache.py, simplicio cache stats|clear
C — static verify-loop fixers simplicio/pipeline_fixers.py
A — declarative plan recipes simplicio/scratch/recipes.py
B — mechanical task executors (py/ts/go/rust/php) simplicio/scratch/codegen/*

Validação

  • ruff check nos arquivos tocados → passou
  • pytest tests/python/test_package_metadata.py2 passed
  • python -m build → sdist + wheel 0.5.1
  • twine check (packaging 26.x) → PASSED (wheel + sdist)
  • wheel instalado em venv limpo → __version__ == 0.5.1, importlib.metadata.version == 0.5.1
  • Publicação PyPI confirmada (latest = 0.5.1, simple index lista whl + tar.gz)

A suíte completa (~489 testes) depende de stacks ML pesadas (torch/sentence-transformers) não instaladas neste ambiente; rodei o subconjunto de metadata/packaging diretamente afetado pelo bump + o build/twine. O código de produção é idêntico ao já publicado e testado em 0.5.0 (mesmo commit base), então o diff de risco é só versão + changelog.

Validação empírica diferida

O gate restante de release — baseline real de 50 goals com codegen desligado (B/codegen pass-rate + latência) — é caro (chamadas LLM reais, horas) e foi separado em #51, conforme a própria recomendação de #33. O audit local (bench/results_issue_closure_audit.*) continua reportando esse gate como aberto por design — nenhuma evidência parcial é declarada completa.

Refs #33, #51.

https://claude.ai/code/session_01By9Mb3TTWTukAGLfUFt9JH


Generated by Claude Code

All four LLM-reduction levers ship and are covered by the test suite:
- D content-addressed completion cache
- C static verify-loop fixers
- A declarative plan recipes
- B mechanical task executors (python/typescript/go/rust/php)

The remaining empirical 50-goal codegen-disabled LLM baseline is split
into a dedicated release-validation issue and does not block this
release. The repo-local closure audit still reports that gate as open by
design, so partial evidence is never claimed as complete.

https://claude.ai/code/session_01By9Mb3TTWTukAGLfUFt9JH
@wesleysimplicio wesleysimplicio marked this pull request as ready for review May 31, 2026 10:32
@wesleysimplicio wesleysimplicio merged commit 29cca87 into master May 31, 2026
1 check passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants