Phase 5 — Close INV-6 HybridQkGain (5 Admitted) — ARCHITECTURE FLOOR PROOF
Parent: #436 · CRITICAL PATH BLOCKER
Why this matters
- INV-6 owns the gain term
gain ∝ √d_model for hybrid QK attention.
- Once Qed, the architectural floor (currently empirical 1.75–2.19) is pinned to a closed-form expression
arch_floor_bpb = 2 + phi^(-2) ≈ 2.382.
- This determines whether Gate-FINAL (BPB < 1.5) is reachable on the current architecture (probably no → motivates Phase 6 next-architecture).
- Until INV-6 closes, every empirical seed run is shadow-boxing against an unstated invariant.
5 Admitted obligations
| Theorem |
File |
Difficulty |
counter_gain_sqrt_d_model |
INV6_HybridQkGain.v |
hard |
counter_gain_unit |
same |
medium |
counter_lr_above_band |
same |
medium |
counter_lr_below_band |
same |
medium |
hybrid_qk_gain_phi_sq_well_typed |
same |
typing only — easy once others close |
Strategy
- Counter-gain unit:
gain(d) = sqrt(d) / sqrt(d_model_min) normalised to 1 at d = d_model_min (algebraic).
- Sqrt(d) bound: standard
sqrt_le_compat from Coq.Reals.Reals; combined with INV-3 d >= d_model_min.
- lr above/below band:
INV-1b lr_champion_in_safe_range already gives the band; counter-cases are lt / gt lemmas with concrete witnesses.
- Well-typed:
gain * phi^2 lives in R; no surprise.
Corollary (the publishable result)
Corollary architecture_floor_phi_squared :
arch_floor_bpb = 2 + phi^(-2).
Proof.
(* Combines counter_gain_sqrt_d_model (Phase 5)
with bpb_convergence_rate_bound (INV-1 Qed)
with phi_square_eq_phi_plus_one (CorePhi Qed). *)
Admitted. (* opened by Phase 5 closing *)
2 + phi^(-2) ≈ 2.382. The empirical 2.19 we observe is below this analytic upper bound — implying the actual floor sits at 2 + phi^(-2) ≈ 2.382 and the 2.19 measurements are still on the descent curve. The CKM/Lucas residual gap explains the 0.19 BPB difference.
Acceptance (R5)
Estimated effort
1 week for one strong Coq author, less if the team co-works through the band lemmas.
phi^2 + phi^-2 = 3 · TRINITY · INV-6 IS THE BLOCKER
Phase 5 — Close
INV-6 HybridQkGain(5 Admitted) — ARCHITECTURE FLOOR PROOFParent: #436 · CRITICAL PATH BLOCKER
Why this matters
gain ∝ √d_modelfor hybrid QK attention.arch_floor_bpb = 2 + phi^(-2) ≈ 2.382.5 Admitted obligations
counter_gain_sqrt_d_modelcounter_gain_unitcounter_lr_above_bandcounter_lr_below_bandhybrid_qk_gain_phi_sq_well_typedStrategy
gain(d) = sqrt(d) / sqrt(d_model_min)normalised to 1 atd = d_model_min(algebraic).sqrt_le_compatfromCoq.Reals.Reals; combined with INV-3d >= d_model_min.INV-1b lr_champion_in_safe_rangealready gives the band; counter-cases arelt/gtlemmas with concrete witnesses.gain * phi^2lives inR; no surprise.Corollary (the publishable result)
2 + phi^(-2) ≈ 2.382. The empirical 2.19 we observe is below this analytic upper bound — implying the actual floor sits at2 + phi^(-2) ≈ 2.382and the 2.19 measurements are still on the descent curve. The CKM/Lucas residual gap explains the 0.19 BPB difference.Acceptance (R5)
Corollary architecture_floor_phi_squaredQed in a new filessot.theorems5 rows flip Admitted → Qedgardener_runsticklane=PHASE-5-INV6-CLOSEDEstimated effort
1 week for one strong Coq author, less if the team co-works through the band lemmas.
phi^2 + phi^-2 = 3 · TRINITY · INV-6 IS THE BLOCKER