Skip to content

v0.7.0: Phase 8 — Developer Experience, Debugging & Multimodal

Choose a tag to compare

@bwiemz bwiemz released this 18 Mar 22:11
· 1895 commits to main since this release

What's New

Tensor Debugger (M45)

  • Binary trace recording (124-byte fixed-size entries) with per-op stats (min/max/mean/std)
  • NaN/Inf sentinel detection with automatic halt
  • Compile-time NaN risk analysis (log/sqrt/div patterns)
  • Trace diffing for non-determinism diagnosis
  • Chrome tracing export
  • @no_trace and @trace_breakpoint decorators

Reproducibility Mode (M46)

  • --deterministic flag with compile-time non-determinism detection
  • 4 non-determinism categories: GPU atomics (auto-fixed), algorithm selection (auto-fixed), implicit RNG (error), external (warning)
  • Deterministic kernel variant selection (sort-based reduction, fixed cuBLAS)
  • RNG seed tracking (ExplicitSeed/Derived/Implicit)
  • Graph hash computation for checkpoint fingerprinting

Multimodal Primitives (M48)

  • PatchEmbed config with compile-time validation (image_size % patch_size)
  • MelSpectrogram with compile-time mel filterbank (hz-to-mel triangular filters)
  • CrossAttention config with Q/K dim matching and head divisibility
  • Modality classification heuristic (Vision/Audio/Text by rank+dtype)
  • @multimodal decorator validation
  • 7 preprocessing FFI stubs (patch_embed, mel, cross_attention, resize, normalize, stft, resample)

Stats

  • 678 unit tests passing
  • Clippy clean

Full Changelog: v0.5.0...v0.7.0