v0.7.0: Phase 8 — Developer Experience, Debugging & Multimodal
What's New
Tensor Debugger (M45)
- Binary trace recording (124-byte fixed-size entries) with per-op stats (min/max/mean/std)
- NaN/Inf sentinel detection with automatic halt
- Compile-time NaN risk analysis (log/sqrt/div patterns)
- Trace diffing for non-determinism diagnosis
- Chrome tracing export
- @no_trace and @trace_breakpoint decorators
Reproducibility Mode (M46)
- --deterministic flag with compile-time non-determinism detection
- 4 non-determinism categories: GPU atomics (auto-fixed), algorithm selection (auto-fixed), implicit RNG (error), external (warning)
- Deterministic kernel variant selection (sort-based reduction, fixed cuBLAS)
- RNG seed tracking (ExplicitSeed/Derived/Implicit)
- Graph hash computation for checkpoint fingerprinting
Multimodal Primitives (M48)
- PatchEmbed config with compile-time validation (image_size % patch_size)
- MelSpectrogram with compile-time mel filterbank (hz-to-mel triangular filters)
- CrossAttention config with Q/K dim matching and head divisibility
- Modality classification heuristic (Vision/Audio/Text by rank+dtype)
- @multimodal decorator validation
- 7 preprocessing FFI stubs (patch_embed, mel, cross_attention, resize, normalize, stft, resample)
Stats
- 678 unit tests passing
- Clippy clean
Full Changelog: v0.5.0...v0.7.0