Skip to content

v1.13.0

Choose a tag to compare

@github-actions github-actions released this 16 Jun 22:46

1.13.0 (2026-06-16)

Features

  • compute: bf16 dispatch + parity tests for fused norm kernels (ADR 075 L4) (4d1466a)
  • compute: native bf16 GPU elementwise + AdamW dispatch (ADR 075 L4) (897d1d6)
  • compute: tiny-matrix batched GEMM kernel for small attention shapes (ADR 075 L3) (6554b8c)
  • gpuapi: bf16 fused norm methods on KernelRunner (ADR 075 L4) (f3db3ca)
  • gpuapi: bf16 kernel methods on KernelRunner (ADR 075 L4) (c6a4d3d)
  • kernels: bf16 forward-only fused norm CUDA kernels (ADR 075 L4) (c62c6e7)
  • kernels: bf16 GPU elementwise + AdamW CUDA kernels (ADR 075 L4) (dc3ed14)