chore(release): agent-runtime 0.45.0#176
Merged
Merged
Conversation
Cuts the 58-commit backlog on main into a published release. Headline surface: - runToolLoop / streamToolLoop — bounded turn-level tool-dispatch loop (#137) - RSI agent tree: recursive Agent.act, Supervisor keystone, runProgram, the adaptive-driver channel (#139/#151/#165) - optimization API collapsed onto agent-eval selfImprove; the runtime keeps the CODE-surface ImprovementDriver you pass as driver (#172) - deployable benchmark adapters: AppWorld, commit0, aec-bench, EnterpriseOps-Gym; runBenchmarks over one ADAPTERS registry (#153/#156/#157) - agent-eval floor raised to >=0.83.0 (#175)
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Add this suggestion to a batch that can be applied as a single commit.This suggestion is invalid because no changes were made to the code.Suggestions cannot be applied while the pull request is closed.Suggestions cannot be applied while viewing a subset of changes.Only one suggestion per line can be applied in a batch.Add this suggestion to a batch that can be applied as a single commit.Applying suggestions on deleted lines is not supported.You must change the existing code in this line in order to create a valid suggestion.Outdated suggestions cannot be applied.This suggestion has been applied or marked resolved.Suggestions cannot be applied from pending reviews.Suggestions cannot be applied on multi-line comments.Suggestions cannot be applied while the pull request is queued to merge.Suggestion cannot be applied right now. Please check back later.
Cuts the 58-commit backlog on
main(v0.44.0..HEAD) into a published release. This single release unblocks three things at once:runToolLoop/streamToolLoop(feat: bounded turn-level tool-dispatch loop (runToolLoop / streamToolLoop) #137) ship, so agent-app's hand-rolledrunAppToolLoopcollapses to a 1:1 re-export.Agent.act/Supervisor machinery +improvementDriver(the optimization API collapse onto agent-evalselfImprove, refactor(improvement): collapse optimization API onto agent-eval selfImprove #172) become importable socompareDriverscan race baseline-vs-RSI.Headline surface
runToolLoop/streamToolLoop— bounded turn-level tool-dispatch loop (feat: bounded turn-level tool-dispatch loop (runToolLoop / streamToolLoop) #137)Agent.act, Supervisor keystone,runProgram, adaptive-driver channel (feat(loops): runtime steer-firewall + dynamicLoopRunner analyze forwarding (RSI Gen-1) #139/feat(loops): recursive execution atom — budget-conserving Scope + Supervisor keystone #151/feat(runtime): gen-6 architecture consolidation — one recursive agent tree, observable + deep-cleaned #165)selfImprove; runtime keeps the CODE-surfaceImprovementDriver(refactor(improvement): collapse optimization API onto agent-eval selfImprove #172)runBenchmarksover one ADAPTERS registry (feat(bench): wire real benchmark adapters — aec-bench, commit0, programbench, appworld + shared harness #153/feat(bench): benchmark unifier — runBenchmarks over one ADAPTERS registry, AgentProfile the only knob #156/feat(bench): EnterpriseOps-Gym adapter — deployable SQL state-checker (the gate's non-coding middle-band domain) #157)>=0.83.0(docs: sync optimization docs to selfImprove + bump agent-eval floor to 0.83 #175)Verification (full publish.yml verify gate, run locally on this tree)
INSTALL(frozen) ✓ LINT ✓ TYPECHECK ✓ TEST ✓ BUILD ✓ VERIFY:PACKAGE ✓Tag
v0.45.0will be pushed after merge to trigger the OIDC publish (version-lock: tag == package.json).