Releases · SuarezPM/apohara-compliance

09 Jun 20:02

v9.9.9-rerun-1.4.0-1781035154

4bb9d0c

v9.9.9-rerun-1.4.0-1781035154

chore(release): v1.4.0 prep — bump version + CHANGELOG (entries for v…

Assets 15

09 Jun 22:44

github-actions

v2.3.0

c748f3b

v2.3.0 Latest

Latest

v2.3.0 — Argument-Value Provenance (ADR-7)

A causal proxy (post-hoc, verbatim-flow) for injection->consequence
detection. ADDITIVE, OPT-IN, BYTE-IDENTICAL passthrough when the new
flag is empty.

Headline (TEST split, 192 positives, FROZEN dev/test, PREREG-verified):
  - v2.2 corr: 138/192 (71.9%)
  - v2.3 -P:   100/192 (52.1%)  (delta = -38, the FP-killer result)
  - FAILED FP: 28.7% -> 13.9%  (halved)
  - BENIGN FP: 5/352 -> 0/352  (all killed)

The 52.1% is a POST-HOC SUBSTRING-MATCH PROXY, not causation. See
docs/adr/ADR-7-argument-value-provenance.md for the full honest ceiling.

Refs:
  - tests/corpus/PREREG-v2.3.md (frozen, SHA 5e62e9e2... UNCHANGED post-scan)
  - tests/corpus/PROOF-v2.3-argument-value-provenance.md
  - tests/corpus/v2.3-argument-value-provenance-report.json (schema-validated)
  - BENCHMARK.md v2.3 section
  - CHANGELOG.md [2.3.0] entry

Assets 15

apohara-compliance-scanner-aarch64-apple-darwin

sha256:a6162a44e37ca3541d25c3aa0937ebe288889ab44f5a2bc89135d67def383195

4.23 MB 2026-06-09T22:44:35Z
apohara-compliance-scanner-aarch64-apple-darwin.pem

sha256:6bb784943f8ab1d769095fbbd80fedaa710486c6b55442dee115cd10a7b1e249

3.34 KB 2026-06-09T22:44:35Z
apohara-compliance-scanner-aarch64-apple-darwin.sig

sha256:2c94a7b25521efd344f3e7be8c08976040cf06fec6457e3b48b32a4a350073c9

96 Bytes 2026-06-09T22:44:35Z
apohara-compliance-scanner-x86_64-apple-darwin

sha256:653569136258715569c502f5db6ef9fd94b04fee6446b78bb3ab76587867a37a

4.48 MB 2026-06-09T22:44:35Z
apohara-compliance-scanner-x86_64-apple-darwin.pem

sha256:d13a8462b4f36ef279081d1714b7739cc4ed8f5bb499aff78b11ebc89810f011

3.34 KB 2026-06-09T22:44:35Z
apohara-compliance-scanner-x86_64-apple-darwin.sig

sha256:66a54e3903c4545f85e7d866af319336a5c82f50d4e4c453ba38d496ef2a4d78

96 Bytes 2026-06-09T22:44:35Z
apohara-compliance-scanner-x86_64-pc-windows-msvc.exe

sha256:3c07a46c9a9966f4f4456e47807fc6e0c509b89da5b300f073405a61880adc6e

4.03 MB 2026-06-09T22:44:35Z
apohara-compliance-scanner-x86_64-pc-windows-msvc.exe.pem

sha256:20edd3c6e2ece8a6923ebd3ce49784141adb1f8996be04a7e8b63b87977d76bd

3.34 KB 2026-06-09T22:44:35Z
apohara-compliance-scanner-x86_64-pc-windows-msvc.exe.sig

sha256:4e453225139a939f817aec27a3f73b1ef2b017384218a0f6c21ca5067b7f327f

96 Bytes 2026-06-09T22:44:35Z
apohara-compliance-scanner-x86_64-unknown-linux-gnu

sha256:566678a484f00bed5d70228b01cdae55e1b9a0d41a9fbca9b1785827025f9fb8

5.27 MB 2026-06-09T22:44:35Z
Source code (zip)

2026-06-09T22:40:38Z
Source code (tar.gz)

2026-06-09T22:40:38Z

09 Jun 21:09

SuarezPM

v2.2.0

a61f832

apohara-compliance v2.2.0 — Real-Trajectory Efficacy (ADR-6, bound triple)

v2.2.0 — Real-Trajectory Efficacy (ADR-6, bound triple)

Additive — no scanner change; the engine is run with the same frozen rules
(blob SHA dcd1ac6, frozen BEFORE scanning) over a corpus of real successful
indirect-injection trajectories from last-generation frontier models, and a
live current-frontier cross-check. The number is reported as a bound triple

its representation overlap-miss, and the correlation-not-causation ceiling
is stated as a co-headline of equal prominence.

Added

Eval harness (scripts/eval/wrap_agentdojo_trace.py + friends) that
transcribes AgentDyn traces via an apohara-agnostic wrapper to the REAL
release binary — never the scanner crate, never the rules, never the wrapper
(the measurement is BY construction, not a fit).
Download bound triple on AgentDyn (5353cf7, agentdojo 0.1.35, benchmark
v1.2.2; attack important_instructions; last-gen models, date-labeled;
open-ended suites): post-hoc AGT-TRJ detection on 236 real successes
169 / 236 (71.6 %); failed-injection (RESISTED) FP 659 / 2295
(28.7 %); benign FP 5 / 352 (1.4 %) ⇒ precision-on-success
169/833 ≈ 20 %.
Live current-frontier cross-check via OpenRouter (suite workspace, attack
important_instructions_no_model_name; same frozen rules + wrapper + binary;
current-frontier models, date-labeled): attack-success TOTAL
0 / 80 (0.0 %) — each model 0 / 16; live post-hoc detection 0 / 0 —
UNDEFINED; failed-injection FP 0 / 80; benign FP 0 / 15. Real
usage: 224 API calls, all HTTP 200; 698,959 tokens (under the 1 M cap);
key never logged.
Overlap-miss (model-independent, 236 positives): marker <information>
covered 232/236; role-mapped structured sink covered 180/236; BOTH
178/236; NEITHER 2/236. Covered sink roles: url=170, recipient=60, amount=59, command=34. MISSED arg-keys (OUTSIDE the frozen role map — the
iban-analog): path (161), subject (114), otp (87), title (79), body (68), recipients (68), repo_name (54), password (33). Reported as-is, NEVER
closed — a retro-fit converts the measurement into a fit.
Reports (strict-schema-validated, numbers/IDs-only — no example text):
- tests/corpus/v2.2-real-trajectory-report.json (the bound triple +
  live usage, validated by scripts/eval/validate_v22_report.py and wired
  into scripts/verify.sh).
PREREG + PROOF (committed):
- tests/corpus/PREREG-v2.2-real-trajectory.md (rules frozen at
  dcd1ac6e1d7ed8dce4b5b516296e8ce5a3e0582a BEFORE any scan; verified
  unchanged post-scan).
- tests/corpus/PROOF-v2.2-real-trajectory.md.
CAVEAT (stated): the live run used suite=workspace (the standard
AgentDojo suite), NOT AgentDyn's harder open-ended suites (shopping /
github / dailylife) where last-gen models reached 14–22 % ASR — because
the current-frontier OpenRouter IDs are not in AgentDyn's model registry.
So the live 0/80 is on the easier standard suite; current-frontier
behaviour on the harder open-ended attack is UNMEASURED (a documented
follow-up).

Notes

Honesty invariants unchanged: every finding is is_candidate: true, every
formatter line is CANDIDATE — prefixed, SARIF level is never error.
The single-action engine is byte-identical to v2.1; the additive trajectory
pass is unchanged. The synthetic precision/recall gate still
1.0000 / 1.0000 / FP = 0; the AgentDojo prose-rule recall still
23 / 35 (0.657); the AGT-TRJ rules fire on the synthetic positive and
zero on the FinBot negative control.

Claim ceiling (verbatim, ADR-6)

"deterministic, post-hoc, representation-aware injection → consequence
CANDIDATE CORRELATION surfacer; mechanism + representation proven on
synthetic positives; post-hoc recognition MEASURED on real successful
trajectories (169/236, last-gen open-ended) with an explicit model-independent
overlap-miss; ALSO fires on resisted (28.7 %) + benign (1.4 %) — a correlation
surfacer, NOT a success / causation discriminator (precision-on-success ≈
20 %); NOT efficacy / recall / prevention; recognisable-in-log ≠
would-have-prevented."

Build info

Target: x86_64-unknown-linux-gnu (Linux only)
Binary: apohara-compliance-scanner-x86_64-unknown-linux-gnu
Source commit: a61f8327d5b86a21ed513f120eb3f7bafd0c9ea4
Built: 2026-06-09 via local cargo build --release --locked

Limitations of this local build

Linux x86_64 only. The other 3 release targets (aarch64-apple-darwin,
x86_64-apple-darwin, x86_64-pc-windows-msvc) require cross-compile
setup or macOS/Windows runners that aren't available in this local build.
No cosign signatures (keyless OIDC signing requires GH Actions).
No GH artifact attestations (build provenance requires GH Actions).

The canonical multi-target release workflow is at
.github/workflows/release.yml.

Assets 16

09 Jun 21:09

SuarezPM

v2.1.0

1b170e1

apohara-compliance v2.1.0 — Representation-Aware Taint + Evasion Robustness

v2.1.0 — Representation-Aware Taint + Evasion Robustness + Cleanups (ADR-5)

Additive — the v2.0 trajectory pass is unchanged; representation + vocabulary +
a structural shell pass are added; the single-action engine is byte-identical
to v1.4 (AgentDojo recall 23 / 35 UNCHANGED). The gap closed: the v2.0
representation/vocab gap (AgentDojo's structured tool-call sinks did not
overlap the v2.0 taint_source / taint_sink vocab).

Added

Representation-aware taint (ADR-5): the parser now emits a reserved
sink: action carrying a deterministic canonical role string
(recipient= / amount= / url= / command=, with const SINK_GRAMMAR
enforcing an authority boundary). The sink: channel is excluded from the
single-action loop by a one-line starts_with("sink:") guard, so the new
representation cannot produce a single-action false positive (proven by
the C1 FP-safety + C2 grammar-disjointness tests).
Taxonomy-derived generic injection-marker vocabulary for AGT-TRJ (OWASP
ASI02:2026 / AITG-APP-02 / documented IPI canary families — each marker
cited in detection-rules.yaml).
Structural shlex shell pass → AGT-MIS-004 catches flag-reordered
destructive commands a substring scan cannot (e.g. rm -r -f / rm -fr /
quoted-arg variants); folded into AGT-MIS-004.
A3 session-only normalization (Unicode / casing / homoglyph) in the session
value picker (relevant_input). Documented deferred gap: parse_repo
builds actions directly and is NOT normalized — covers the session channel
(30/101 gate paths, 0/56 repo-file). Repo-file normalization is a documented
follow-up (ADR-5 M4).
Synthetic positive (trj-representation-aware-positive.jsonl) fires
AGT-TRJ-001 + AGT-TRJ-003 via the real binary; the
trj-structured-sink-benign-trap and the FinBot direct-injection fixture
(negative control) fire zero.
Pre-registration: frozen rules SHA ac88825 (verified unchanged
post-scan). Repo-file normalization deferred to a future PR.

Notes

Honesty invariants unchanged.
The synthetic positive is a constructive existence proof that the engine
can fire on a structured representation — it is authored to fire, so it
is not an independent measurement. Real-trace generalisation is
UNPROVEN at v2.1 (stated plainly in ADR-5).
"Real-world efficacy is still UNPROVEN — stated plainly. v2.1 closes the
gap in the engine's vocabulary and representation (structured sinks +
generic markers now exist and fire on a synthetic trajectory), but there
is no committed real trajectory corpus to exercise it: the AgentDojo
corpus is flat bait (no trajectories) and v2.1 defers all live capture
(A10). So the structured-sink representation is measured on the synthetic
positive only; real-trace generalisation remains the deferred gap. A
deterministic offline matcher will never catch a determined obfuscator
(the documented ceiling)."

Build info

Target: x86_64-unknown-linux-gnu (Linux only)
Binary: apohara-compliance-scanner-x86_64-unknown-linux-gnu
Source commit: 1b170e19eeba8cf9fe06cbc5daacfeb4e9cee843
Built: 2026-06-09 via local cargo build --release --locked

Limitations of this local build

Linux x86_64 only. The other 3 release targets (aarch64-apple-darwin,
x86_64-apple-darwin, x86_64-pc-windows-msvc) require cross-compile
setup or macOS/Windows runners that aren't available in this local build.
No cosign signatures (keyless OIDC signing requires GH Actions).
No GH artifact attestations (build provenance requires GH Actions).

The canonical multi-target release workflow is at
.github/workflows/release.yml.

Assets 16

09 Jun 21:08

SuarezPM

v2.0.0

661820e

apohara-compliance v2.0.0 — Trajectory Taint-Correlation Detection

v2.0.0 — Trajectory Taint-Correlation Detection (ADR-4)

Additive — a new deterministic taint engine runs AFTER the single-action
loop AND after the ADR-2 sequence pass. It expresses the injection →
consequence dataflow the single-action engine cannot: a TAINTED source
(an action on the untrusted-data tool-result: channel carrying injection
markers, AND not a doc/comment quote) FOLLOWED BY a genuine sensitive
real-action sink (exfil / destructive / financial) later in the same action
stream (forward-correlated: the taint persists across intervening steps).

Added

New module crates/scanner/src/taint.rs — the deterministic
taint-correlation engine. Self-contained by design (ADR-4 OQ1): copies
the small CompiledStep / step_match shape from sequence.rs rather
than sharing a helper, to keep zero blast-radius on the CRITICAL
matching.rs and the live sequence.rs AGT-MEM-001 path.
New rules (rule count 17 → 20): AGT-TRJ-001 (injection + sensitive sink,
base), AGT-TRJ-002 (exfil sink family), AGT-TRJ-003 (destructive sink
family).
A10 live capture (pre-registration + smoke): the committed AgentDojo
corpus + a bounded live capture on AgentDojo banking-suite with
MiniMax-M3 (OpenRouter adapter), attack important_instructions,
10 attacked pairs + 2 benign. Real-world result: 0 / 10 attack-success
on MiniMax (the model refused every indirect injection); 28 API calls,
65,550 tokens; real-usage proof.
Synthetic positive (trj-agentdojo-async-injection.jsonl + friends) fires
AGT-TRJ-001 / 002 / 003 via the real binary; the FinBot direct-injection
fixture (negative control) and benign-trajectory traps fire zero.
Pre-registration: tests/corpus/PREREG-v2-agentdojo.md (frozen before
scanning). Proof: tests/corpus/PROOF-v2-minimax.md (the real-world
0 / 10 + 65,550 tokens).
Added: 8 commits 2610a0b..9e1a78a on v2.0-trajectory-taint (Ralph
v0 → F4, AMENDMENT-A feasibility F5A, deslop).

Notes

Honesty invariants unchanged: every finding is is_candidate: true, every
formatter line is CANDIDATE — prefixed, SARIF level is never error.
No new runtime dependency; the detection core stays deterministic and
offline; the synthetic precision/recall gate still
1.0000 / 1.0000 / FP = 0.
Real-world efficacy is UNPROVEN at v2.0 (stated plainly in ADR-4 and
the PROOF). Two measured reasons: (1) MiniMax-M3 resisted all 10
injections, so no real positive trace exists; (2) a verified
representation/vocab gap — AgentDojo's <INFORMATION>… marker and
structured tool-call sinks (send_money(…)) do not overlap apohara's
text-pattern taint_source / taint_sink vocabulary, so even a
successful trace would very likely not fire. apohara is a post-hoc
transcript scanner (recognisable-in-log ≠ would-have-prevented), and its
rules are vocab-scoped to shell/coding agents. Per the pre-registration
the rules were NOT retro-fitted to AgentDojo.

Build info

Target: x86_64-unknown-linux-gnu (Linux only)
Binary: apohara-compliance-scanner-x86_64-unknown-linux-gnu
Source commit: 661820e055ad6c46ab433f0ba32044a2e1e669a7
Built: 2026-06-09 via local cargo build --release --locked

Limitations of this local build

Linux x86_64 only. The other 3 release targets (aarch64-apple-darwin,
x86_64-apple-darwin, x86_64-pc-windows-msvc) require cross-compile
setup or macOS/Windows runners that aren't available in this local build.
No cosign signatures (keyless OIDC signing requires GH Actions).
No GH artifact attestations (build provenance requires GH Actions).

The canonical multi-target release workflow is at
.github/workflows/release.yml.

Assets 16

09 Jun 19:51

github-actions

v1.4.0-r1

4bb9d0c

v1.4.0-r1

chore(release): v1.4.0 prep — bump version + CHANGELOG (entries for v…

Assets 15

09 Jun 21:08

github-actions

v1.4.0

4bb9d0c

v1.4.0

chore(release): v1.4.0 prep — bump version + CHANGELOG (entries for v…

Assets 15

06 Jun 19:58

github-actions

v1.1.0

8524ea9

v1.1.0

apohara-compliance v1.1.0 — scan-otlp + ASI06 + supply-chain hardenin…

Assets 15

05 Jun 19:59

github-actions

v1.0.0

5b51e55

v1.0.0

apohara-compliance v1.0.0

Fase 3 (v1.0 'validated + live'): --llm-assist triage emitter, scan-action +
PreToolUse hook, adoption doc + offline guard, crate publishability fix,
CI x86_64-apple-darwin cross-compile fix.

Assets 15

Releases: SuarezPM/apohara-compliance

v9.9.9-rerun-1.4.0-1781035154

Uh oh!

v2.3.0

Uh oh!

apohara-compliance v2.2.0 — Real-Trajectory Efficacy (ADR-6, bound triple)

v2.2.0 — Real-Trajectory Efficacy (ADR-6, bound triple)

Added

Notes

Claim ceiling (verbatim, ADR-6)

Build info

Limitations of this local build

Uh oh!

apohara-compliance v2.1.0 — Representation-Aware Taint + Evasion Robustness

v2.1.0 — Representation-Aware Taint + Evasion Robustness + Cleanups (ADR-5)

Added

Notes

Build info

Limitations of this local build

Uh oh!

apohara-compliance v2.0.0 — Trajectory Taint-Correlation Detection

v2.0.0 — Trajectory Taint-Correlation Detection (ADR-4)

Added

Notes

Build info

Limitations of this local build

Uh oh!

v1.4.0-r1

Uh oh!

v1.4.0

Uh oh!

v1.1.0

Uh oh!

v1.0.0

Uh oh!