feat: PMAT-546 Architecture parity + gate job fix + 18 new tests by noahgift · Pull Request #733 · paiml/aprender

noahgift · 2026-04-12T12:12:20Z

Summary

5 new Architecture enum variants: FalconH1, Mamba, Moonshine, OpenElm, Rwkv7
2 new model-family YAML contracts: gptneox.yaml, opt.yaml
Provable contract model-family-parity-v1.yaml with 5 falsification conditions
18 new tests (6 parity + 10 rosetta inference + 2 import inference)
Extended architecture inference in both rosetta inspector and import pipeline
Top-level gate job in ci.yml — fixes org ruleset blocking auto-merge
chown post-step prevents root-owned files from Docker containers
Advisory exemptions: 10 new wasmtime 27 + rand 0.10 (test-only deps)
Toyota Way: fixed cargo fmt in 5 pre-existing crates

Five-Whys (Architecture Parity)

#	Why	Answer
1	Why 5 YAML families with no enum?	Created from HF families, not from import pipeline
2	Why the mismatch?	Authored at different times
3	Why no cross-validation?	Independent deliverables
4	Why no enforcement?	Invariant was implicit
5	Root cause	Nobody wrote the parity contract. Fixed.

Five-Whys (Gate Job)

#	Why	Answer
1	Why can't PR #731 auto-merge?	Org ruleset requires "gate" check
2	Why not matching?	Reusable workflow produces "ci / gate" (prefixed)
3	Why prefixed?	GitHub adds caller job name as namespace
4	Why doesn't ruleset match?	Exact context name required
5	Root cause	ci.yml lacked a top-level `gate` job. Fixed.

Test Results

aprender-core: 13,023 passed, 0 failed (+18 new)
aprender-contracts: 1,371 passed, 0 failed
cargo fmt --check: clean
cargo deny check advisories: clean

Test plan

cargo test -p aprender-core --lib — 13,023 pass
cargo test -p aprender-contracts --lib — 1,371 pass
cargo clippy -p aprender-core --lib -- -D warnings — clean
cargo fmt --all -- --check — clean
cargo deny check advisories — clean
CI: workspace-test (70 crates)
CI: gate (new top-level job)
CI: security (with advisory exemptions)

🤖 Generated with Claude Code

…falsification tests Five-Whys: Architecture enum had 14 non-Auto variants but only 12 matching model-family YAML contracts. 5 YAML families (falcon_h1, mamba, moonshine, openelm, rwkv7) had no Architecture variant. 2 enum variants (GptNeoX, Opt) had no YAML contract. Root cause: implicit assumption that these would stay in sync without enforcement. Changes: - Add 5 Architecture variants: FalconH1, Mamba, Moonshine, OpenElm, Rwkv7 - Create 2 YAML contracts: gptneox.yaml, opt.yaml - Create provable contract: model-family-parity-v1.yaml (5 falsification conds) - Add 6 parity tests in converter_types_tests_parity.rs - Update from_model_type(), is_llm(), display_name(), map_name() for all - Fix pre-existing cargo fmt in 5 crates (Toyota Way) - Update 1 pre-existing test (mamba/rwkv now recognized, not unknown) - 13,011 aprender-core tests pass, 1,371 contract tests pass Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

Transitive dep via quickcheck (test-only). Not in production path. Advisory: rand 0.10.0 is unsound with a custom logger using rand::rng(). Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

10 new wasmtime advisories published 2026-04-09 affect wasmtime 27.0.0 (test-only dep via aprender-test-lib). Not in production path. Upgrade to wasmtime 43 tracked in PR #731. New exemptions: RUSTSEC-2026-{0085,0086,0088,0089,0091,0092,0093,0094,0095,0096} Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

Extend RosettaStone::infer_architecture_from_tensors() to detect: - Mamba (mixer.in_proj/out_proj patterns) - RWKV (rwkv.blocks.* patterns) - GPT-NeoX (gpt_neox.* prefix, fused query_key_value) - OPT (model.decoder.layers.* prefix) - BERT (bert.* prefix) Previously only detected: GPT-2, Qwen2, LLaMA, generic transformer. 10 new falsification tests in tests_arch_inference.rs. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

Add Mamba (mixer.*) and RWKV (rwkv.blocks.*) detection to infer_architecture_from_names() in the import pipeline. Previously only the rosetta inspector detected these patterns. 2 new tests. aprender-core now 13,025 tests. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

Five-Whys: PR #731 all checks pass but auto-merge blocked. 1. Why blocked? Org ruleset "Green Main" requires check named "gate" 2. Why not matching? Reusable workflow produces "ci / gate" (prefixed) 3. Why prefixed? GitHub adds caller job name ("ci") as namespace 4. Why doesn't ruleset match? Rulesets require exact context name 5. Why not fixed before? PR #726 merged via admin bypass Fix: Add top-level "gate" job that checks ci + workspace-test results. Also add chown post-step from PR #731 to prevent root-owned files. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

noahgift and others added 6 commits April 12, 2026 14:11

fix: exempt RUSTSEC-2026-0097 (rand 0.10 unsound with custom logger)

025b4de

Transitive dep via quickcheck (test-only). Not in production path. Advisory: rand 0.10.0 is unsound with a custom logger using rand::rng(). Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

noahgift force-pushed the feat/pmat-next branch from 2796527 to b42e790 Compare April 12, 2026 12:42

noahgift changed the title ~~feat: PMAT-546 Architecture↔model-family parity — 5 new variants + 6 tests~~ feat: PMAT-546 Architecture parity + gate job fix + 18 new tests Apr 12, 2026

noahgift merged commit a33a28b into main Apr 12, 2026
17 of 20 checks passed

noahgift deleted the feat/pmat-next branch April 12, 2026 14:14

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: PMAT-546 Architecture parity + gate job fix + 18 new tests#733

feat: PMAT-546 Architecture parity + gate job fix + 18 new tests#733
noahgift merged 6 commits intomainfrom
feat/pmat-next

noahgift commented Apr 12, 2026 •

edited

Loading

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

noahgift commented Apr 12, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Five-Whys (Architecture Parity)

Five-Whys (Gate Job)

Test Results

Test plan

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

noahgift commented Apr 12, 2026 •

edited

Loading