DHMS Agent Harness v0.3.2 — Reproducibility Package
DHMS Agent Harness v0.3.2 - Reproducibility Package
Overview
DHMS Agent Harness v0.3.2 adds reproducibility packaging for the v0.3.1
mock/local multi-case report. External developers can clone the repository and
reproduce the multi-case report without OpenClaw, DeepSeek, provider API keys,
or real agent execution.
This release is mock/local only. No new real OpenClaw or DeepSeek confirmations
were run for this release.
v0.3.2 builds on:
v0.2.1-agent-harness-evidence-seal- evidence-sealed prototypev0.3.1-schema-report-polish- schema and report polish
Reproduction Command
Run from the repository root:
python3 cli.py test-agent-suite \
--suite cases/agent_core \
--run-all-cases \
--mock-agent \
--report \
--output reports/reproducibility/v0.3.1_mock_all_casesReference Artifacts
The reproducibility package includes:
docs/reproducibility/v0.3.1-mock-local-multicase.mddocs/reproducibility/artifacts/v0.3.1_mock_all_cases/execution_summary.jsondocs/reproducibility/artifacts/v0.3.1_mock_all_cases/suite_agent_report.md
Only lightweight reference artifacts are committed. The package does not commit
HTML output, logs, secrets, or real OpenClaw/DeepSeek outputs.
Expected Reproduction Summary
The mock/local run should report:
total_cases=6taxonomy_summary:A=5,B=1,C=0execution_summary.jsonexistssuite_agent_report.mdexists- no real tool execution
- no side effects
Validation Scope
v0.3.2 validation is mock/local only. It does not require a real model, API key,
OpenClaw, DeepSeek, or a real LLM Judge.
Limitations
This release does not claim:
- new real model validation
- new real OpenClaw or DeepSeek confirmations
- full-suite production validation
- production certification
- multi-model certification
- system-level sandbox proof
- real LLM Judge validation
- HTTP Adapter availability
No real LLM Judge was used, and the HTTP Adapter remains not implemented.
Release Status
Tag: v0.3.2-reproducibility-package