Releases: praneethtota/Adaptive-Utility-Agent
Releases · praneethtota/Adaptive-Utility-Agent
v1.2.0
Resilience, security, the extended plugin system, and the operations toolkit
(#37–#55 block), plus a full pre-release audit.
Added
- Retry with exponential backoff (#39): per-specialist transport-level
retry onConnectError/ReadTimeout/429/502/503/504 with configurable
base_delay_ms,max_delay_ms,±25%jitter, andretryable_status_codes.
Non-transient codes (400/500) are never retried.max_retries: 0disables it. - Circuit breaker per specialist (#37): CLOSED/OPEN/HALF_OPEN state machine
with sliding failure window, automatic HALF_OPEN probe after
recovery_timeout_s, and per-specialist status atGET /health/ready. - Degraded-mode failover (#38): open circuits are excluded from routing;
responses carrydegraded_modeanddegraded_specialistsso callers can
detect partial availability. - Bearer token auth wiring:
security.auth_enabledactivates HMAC-SHA256
token verification middleware (15 scopes, revocation). Public endpoints
(health/docs/version) pass through; zero overhead when disabled. - mTLS wiring:
security.mtls.key_file/cert_file/ca_fileare passed to
uvicorn; presence ofca_filerequires client certs (mutual TLS). - Extended plugin system (#51): four new Protocol interfaces —
ContradictionDetectorPlugin,AssertionStorePlugin,
RoutingStrategyPlugin,ScoringComponentPlugin. - Custom utility function (#53):
FullUtilityScorerPlugin.score_full()
bypasses the linear form (Axiom A5) for quadratic, multiplicative,
Cobb-Douglas, Rawlsian-min, and threshold-gate models. - Extended middleware (#52):
on_chunk(SSE token interception),
before_batch/after_batch, andon_errorhooks onAUAMiddleware. - Compatibility matrix (#55): model-format × hardware × backend matrix in
aua/compat.py;aua doctorcheck group 6 andaua doctor --compat-matrix. - Operations toolkit:
aua testbuilt-in suites (#54),aua loadtest
(#50), persistent batch queue (#56), model registry + version pinning (#46),
experiment tracking via MLflow/W&B (#47), shadow mode (#48), regression gate
(#49), multi-tenancy isolation (#44). - ArbiterAgent live pipeline: the four-check arbitration (logical,
mathematical, cross-session, empirical via SymPy/arXiv/PubMed) is now the
default; a simplified LLM-only path is available viaarbitration_mode: "llm". - tau softmax routing and T_min promotion gate wired into the router.
- Hardware tiers
gaming-pcandh100-clusteradded.
Fixed
- Version source of truth corrected to 1.2.0 (was 1.1.0).
arbitration_modeis now validated at config load time (was accepted
silently; only the runtime PATCH endpoint validated).- Retry and circuit-breaker numeric fields are validated at load time
(max_retries >= 0,max_delay_ms >= base_delay_ms, thresholds>= 1). infer_model_format()is backend-aware: Ollama/llama.cpp tags without a
suffix resolve to GGUF (previously every Ollama user — including the default
aua init --tier macbookscaffold — saw "model format unknown" warnings).- Normalised the
aua.versionschema field across tier templates. - Packaging: explicit
aua/templates/prompts/*.txtinclude; removed stale
committed build artifacts.
Install / upgrade:
pip install --upgrade "adaptive-utility-agent==${GITHUB_REF_NAME#v}"
pip install --upgrade adaptive-utility-agentalso works — this tag is marked latest.
v1.1.0
The AUA-Veritas production backport plus the completed expert path.
Added
- Persistence & search (V-P1.1): message-level keyword search with async
background indexing, startup backfill, and DB fallback (GET /search,
POST /conversations/{id}/messages) - Context backups (V-P1.2/1.4): 6-section structured handoff notes,
token/message/time-gap triggers, 6-hour coverage job
(POST /context/backup/run-coverage-job) - Correction lifecycle (V-P1.3/2.1/2.4): explicit
correction:prefix,
implicit detection with Accept/Reject (POST /corrections/confirm-implicit),
CRUD + evidence history (PATCH/DELETE /corrections/{id},
GET /corrections/evidence), arbiter findings surfaced asreview_notes - Self-maintenance (V-P1.5/1.6/2.3/3.1): crash sentinel + auto-reporting,
remote model config with remote→cache→builtin fallback, update management
(GET /version/check,POST /update/skip), structured bug reports
(POST /bug-report) - Analytics suite (V-P2.2):
GET /analytics,/reliability,/usage,
/pricing - Projects & local models (V-P3.2/3.3): conversation grouping,
Ollama-class model registration and specialist tagging - Dynamic domain ontology (V-P3.4): 10 L0 roots, alias map + edit-distance
resolution, 4-gate candidate promotion, hourly maintenance job
(GET /domain-tree) - Session IDs (#15): session/trace/request IDs on every request —
client-supplied honored, UUIDs generated, returned as headers on every
response, propagated to specialists/hooks/audit/logs - Secrets (#19):
secrets:config block (env|vault|aws|gcp) and live
Vault + AWS Secrets Manager integration tests in CI - YAML extension wiring (F-09/F-10/F-11):
plugins:,hooks:,
middleware:,state:, andsecurity:config blocks now parse with
strict validation and wire at startup;GET /extensionsreports what the
running server loaded - Tutorial: Concepts section, bring-your-own-model walkthrough, complete
config reference, troubleshooting guide, How-to 18 (production ops)
Fixed
- Audit log writes failed silently (missing
request_id/routing_mode
columns) POST /projectsfailed on an injectedidcolumn- Keyword extraction dropped years/numbers (dead code path)
- Crash reporter could self-report the current session
- Hook YAML format and 9 plugin constructor examples in the tutorial matched
a contract the loader never had
Install / upgrade:
pip install --upgrade "adaptive-utility-agent==${GITHUB_REF_NAME#v}"
pip install --upgrade adaptive-utility-agentalso works — this tag is marked latest.