Skip to content

πŸ“– Ch.9 β€” GF vs FP16 BF16 MXFP4 baseline comparison (700w Β· πŸ”΄ P0 Β· II β€” NUMERIC)Β #399

@gHashTag

Description

@gHashTag

Ch.9 β€” GF vs FP16 BF16 MXFP4 baseline comparison

Master: trios#380 Β· Part: II β€” NUMERIC
Scope: software-only (Rust/Zig), no hardware/FPGA/KOSCHEI
Words: 700 Β· Priority: πŸ”΄ P0 Β· Owner: bench-agent

🎯 Scope (FROZEN, v2.1 SOFTWARE-ONLY)

State-of-art baseline comparison: GF16 vs FP16 vs BF16 vs MXFP4 (arXiv:2510.01863 microscaling). Required by NeurIPS 2026 reproducibility checklist β€” must compare against current SOTA. Software roundtrip MSE on log-normal weights, ML-typical distributions.

πŸ”‘ Key claims / formulas

  • MX/MXFP4 baseline (arXiv:2510.01863, arXiv:2510.14557 MX+)
  • AdaptivFloat (Tambe 2019)
  • BBFP NeurIPS 2020
  • Roundtrip MSE per format
  • Why GF16 wins Ο†-distance among ≀16-bit (in golden corridor)
  • HONEST: does GF16 win on actual ML metric? OR only on Ο†-distance?

πŸ“¦ Deliverables

  • paper/sections/9_gf_vs_fp16_bf16_mxfp4_baseline_comparison.tex

βœ… Definition of Done (v2.1)

  • Word count 700 Β±10%
  • All claims cited (primary literature)
  • No woody-shop / KOSCHEI / FPGA references
  • Compiles in main.tex via tectonic
  • L1 (no .sh) Β· L3 Β· L4 Β· L8

πŸ€– ONE-SHOT directive (when operator types ONE SHOT Ch.9)

A: take Markdown draft from Β§"Draft", convert to LaTeX, place in
paper/sections/9_<slug>.tex, ensure compile, open PR with Closes #<this>.

phi^2 + phi^-2 = 3 Β· TRINITY Β· NEVER STOP

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions