Skip to content

Release v1744

Choose a tag to compare

@github-actions github-actions released this 14 Jun 04:23
· 3 commits to main since this release
1d12e88

Automated release from CI pipeline

Changes:
refactor(beyond-sota): ADR-155 M2 — host-verifiable §8 closeout (7 de-magic, 9 boundary tests, native-conv honest-null) (#1059)

  • refactor(train): ADR-155 M2 §8 — de-magic train non-tch tuning constants + boundary tests

Lift bare numeric literals used as thresholds / guard epsilons in the
non-tch (host-verifiable) train surface into named, documented consts and
pin each set with a *_consts_unchanged_from_literals test. Values are
bit-identical to the prior inline literals — cleanup, no behaviour change.

De-magicked (const + pin test):

  • metrics_core.rs: VISIBILITY_THRESHOLD (0.5), MIN_REFERENCE_EXTENT (1e-6),
    OKS_FALLBACK_SIGMA (0.07)
  • ruview_metrics.rs: NUM_KEYPOINTS (17), VISIBILITY_THRESHOLD (0.5),
    PCK_THRESHOLD (0.2), MIN_BBOX_DIAG (1e-3), MIN_DURATION_MINUTES (1e-6)
  • subcarrier.rs: SPARSE_BASIS_SIGMA (0.15), SPARSE_BASIS_THRESHOLD (1e-4),
    SPARSE_REGULARIZATION_LAMBDA (0.1), SPARSE_COO_PRUNE_EPS (1e-8),
    SPARSE_SOLVER_TOL (1e-5 f64), SPARSE_SOLVER_MAX_ITERS (500)
  • eval.rs: MIN_POSITIVE_MPJPE (1e-10)
  • domain.rs: LAYER_NORM_EPS (1e-5)
  • virtual_aug.rs: BOX_MULLER_U1_FLOOR (1e-10), MIN_ROOM_SCALE (1e-10)

Boundary / characterization tests (pin CURRENT behaviour):

  • visibility_threshold_boundary_is_inclusive (>= 0.5 at the edge)
  • degenerate_extent_below_floor_is_unscoreable ((0,0,0.0)/0.0, not perfect)
  • tracking_zero_duration_does_not_divide_by_zero
  • oks_short_array_is_bounded_at_keypoint_count (16 rows, no panic)
  • compute_interp_weights_single_target_is_index_zero (target_sc==1)
  • sparse_interp_single_target_is_finite
  • domain_gap_infinite_when_in_domain_perfect_but_cross_nonzero
  • domain_gap_unity_when_everything_perfect
  • augment_frame_zero_room_scale_passes_amplitude_finite

Doc-only (no behaviour change):

  • rapid_adapt.rs: correct module-doc O(eps) -> O(eps^2) for central differences
  • geometry.rs: add # Panics to DeepSets::encode (documents existing assert!)

train --no-default-features: 191 lib (was 176), 303 total (was 288), 0 failed.

Co-Authored-By: claude-flow ruv@ruv.net

  • feat(nn): ADR-155 M2 §3 — pure-Rust LinearHead::try_new input guard + de-magic softplus threshold

ADR-155 §3 found rf_encoder.rs has no adversarial checkpoint-deserialization
assert — its assert_eq!s in LinearHead::new are construction-time API contracts
on programmer-supplied vectors. This adds the honest, in-scope improvement the
M2 task allows: a pure-Rust fallible constructor so weights from an untrusted /
deserialized checkpoint can be shape-validated without panicking.

  • Add RfHeadError (WeightShape / BiasShape / VarWeightShape) + Display + Error.
  • Add LinearHead::try_new returning Result<Self, RfHeadError>; on success the
    head is byte-identical to LinearHead::new. new() is unchanged (still asserts;
    now documents # Panics and points to try_new) — no behaviour change for
    existing callers.
  • De-magic softplus's bare 20.0 overflow threshold into
    SOFTPLUS_LINEAR_THRESHOLD (value unchanged) + pin test.

Tests: try_new_accepts_valid_and_rejects_each_bad_shape (valid == new forward;
each bad shape → typed error), softplus_threshold_unchanged_from_literal.

nn --no-default-features lib: 37 passed (was 35), 0 failed.

Co-Authored-By: claude-flow ruv@ruv.net

  • perf(nn): ADR-155 M2 §4 — native-conv bench-first → MEASURED-INCONCLUSIVE (no perf change shipped)

The §8 "native-conv naive-loop rewrite" backlog item: DensePoseHead::
apply_conv_layer is a pure-Rust 6-nested-loop conv (benchable on this host, not
tch/ort-gated). Bench-first per the §0 PROOF discipline.

  • Add committed criterion bench benches/native_conv_bench.rs measuring forward()
    through the naive conv on representative single-layer configs (--no-default-
    features; no ort download).
  • Prototyped a bit-identical range-clamped variant (hoist the per-tap in-bounds
    branch by pre-clamping kh/kw ranges; same ic→kh→kw MAC order ⇒ bit-identical).
    MEASURED before/after on this host: ~35% faster on padding-heavy small-channel
    maps (4.40→2.84 ms) but a ~3% regression on channel-heavy maps (11.09→11.48
    ms), all inside a ±20% run-to-run noise floor. Verdict: INCONCLUSIVE — the
    benefit is not robustly positive, so the rewrite is NOT shipped and NOT a
    fabricated speedup. Reverted to the naive loop; honestly deferred (ADR-155 §8).
  • Add native_conv_matches_reference: a hand-computed characterization anchor
    (1×1 = scalar MAC; same-padded 3×3 ones = truncated-window sums 9/6/4) pinning
    CURRENT conv behaviour for any future rewrite.

nn --no-default-features lib: 38 passed (was 37), 0 failed. No behaviour change.

Co-Authored-By: claude-flow ruv@ruv.net

  • docs(adr-155): M2 §8.2 — enumerated host-verifiable P3 backlog clearance + CHANGELOG

Replace the §8 bulk "~40 lower-severity findings" line with the real, enumerated
M2 resolution (§8.2): 7 de-magicked (const + pin == prior literal), 9 boundary
tests, 1 input guard (rf_encoder try_new), 2 doc-only, 1 perf bench-first
MEASURED-INCONCLUSIVE (not shipped). Mark native-conv + rf_encoder RESOLVED;
state which §8 items stay data-gated (GraphPose-Fi/INT4/CSI-JEPA) or tch-gated
(proof/trainer/model panic sites, metrics *_v2 dead code) and ONNX read-lock
upstream-gated — blocked, not dropped. Declare the non-tch-verifiable subset of
§8 cleared.

Validation: train --no-default-features 303 passed (was 288); nn lib 38 (was 35);
workspace --no-default-features 3,293 passed, 0 failed; Python proof VERDICT PASS,
hash f8e76f21…46f7a UNCHANGED bit-exact.

Co-Authored-By: claude-flow ruv@ruv.net

Docker Image:
ghcr.io/ruvnet/RuView:1d12e8831a6deabc1c9bf32bc62a3e08cf952a3e