Ch.9 β GF vs FP16 BF16 MXFP4 baseline comparison
Master: trios#380 Β· Part: II β NUMERIC
Scope: software-only (Rust/Zig), no hardware/FPGA/KOSCHEI
Words: 700 Β· Priority: π΄ P0 Β· Owner: bench-agent
π― Scope (FROZEN, v2.1 SOFTWARE-ONLY)
State-of-art baseline comparison: GF16 vs FP16 vs BF16 vs MXFP4 (arXiv:2510.01863 microscaling). Required by NeurIPS 2026 reproducibility checklist β must compare against current SOTA. Software roundtrip MSE on log-normal weights, ML-typical distributions.
π Key claims / formulas
- MX/MXFP4 baseline (arXiv:2510.01863, arXiv:2510.14557 MX+)
- AdaptivFloat (Tambe 2019)
- BBFP NeurIPS 2020
- Roundtrip MSE per format
- Why GF16 wins Ο-distance among β€16-bit (in golden corridor)
- HONEST: does GF16 win on actual ML metric? OR only on Ο-distance?
π¦ Deliverables
paper/sections/9_gf_vs_fp16_bf16_mxfp4_baseline_comparison.tex
β
Definition of Done (v2.1)
π€ ONE-SHOT directive (when operator types ONE SHOT Ch.9)
A: take Markdown draft from Β§"Draft", convert to LaTeX, place in
paper/sections/9_<slug>.tex, ensure compile, open PR with Closes #<this>.
phi^2 + phi^-2 = 3 Β· TRINITY Β· NEVER STOP
Ch.9 β GF vs FP16 BF16 MXFP4 baseline comparison
Master: trios#380 Β· Part: II β NUMERIC
Scope: software-only (Rust/Zig), no hardware/FPGA/KOSCHEI
Words: 700 Β· Priority: π΄ P0 Β· Owner: bench-agent
π― Scope (FROZEN, v2.1 SOFTWARE-ONLY)
State-of-art baseline comparison: GF16 vs FP16 vs BF16 vs MXFP4 (arXiv:2510.01863 microscaling). Required by NeurIPS 2026 reproducibility checklist β must compare against current SOTA. Software roundtrip MSE on log-normal weights, ML-typical distributions.
π Key claims / formulas
π¦ Deliverables
paper/sections/9_gf_vs_fp16_bf16_mxfp4_baseline_comparison.texβ Definition of Done (v2.1)
π€ ONE-SHOT directive (when operator types
ONE SHOT Ch.9)phi^2 + phi^-2 = 3 Β· TRINITY Β· NEVER STOP