v0.3.0-alpha
Pre-release
Pre-release
·
442 commits
to main
since this release
KORA v0.3.0-alpha
This prerelease adds KORA's initial runtime-path benchmark evidence flow.
Highlights
- Initial runtime-path benchmark harness:
python3 -m kora run runtime_integrated_benchmark -- --offline
- Runtime benchmark JSON output path:
python3 -m kora run runtime_integrated_benchmark -- --offline --json-out /tmp/kora_runtime_integrated_benchmark.json
- Markdown evidence packet/report generator:
python3 examples/runtime_integrated_benchmark/report.py --input /tmp/kora_runtime_integrated_benchmark.json --md-out /tmp/kora_runtime_integrated_benchmark.md
- Telemetry-connected summary path:
python3 -m kora telemetry --input /tmp/kora_runtime_integrated_benchmark.json --json-out /tmp/kora_runtime_integrated_benchmark.telemetry.json --md-out /tmp/kora_runtime_integrated_benchmark.telemetry.md
- Reviewer-facing reproduction guide.
- Release-readiness checklist.
- Docs cross-link audit and claim-boundary review.
- Release validation packet and approval checkpoint.
Current bounded benchmark claim
In a reproducible 100-task deterministic-heavy benchmark workload, KORA-controlled execution avoided 80 of 100 simulated model invocations versus a naive direct baseline.
Expected counters
total_tasks: 100deterministic_route_count: 80fallback_or_model_candidate_route_count: 20simulated_baseline_model_invocations: 100kora_controlled_model_invocations: 20avoided_simulated_model_invocations: 80avoided_simulated_model_invocation_rate: 0.8deterministic_outputs_checked: 80mismatch_count: 0runtime_path_execution_status: oktelemetry_event_count: 100
Expected telemetry counters
total_llm_calls: 20events_ok: 100events_fail: 0events_skipped: 0stage_counts: ADAPTER 20 / DETERMINISTIC 80
Non-claims
This prerelease does not claim:
- production cost reduction proof
- real API-cost reduction proof
- production benchmark proof
- full runtime-integrated benchmark evidence
- broad workload superiority proof
- energy reduction evidence
- formal government validation
- signed partner validation
- guaranteed adoption or funding
Artifact policy
No release assets are uploaded for this prerelease. Raw generated benchmark JSON/Markdown artifacts are not uploaded. Generated outputs should be reproduced locally in /tmp or another user-provided output path.