Skip to content

v1.4.0-rc2

Pre-release
Pre-release

Choose a tag to compare

@imezx imezx released this 19 Nov 04:58
· 42 commits to main since this release

# What's new?


# What's changed?

  • (Changes) Several small fixes & improvements
    • Fixes incorrectness to dump & load State to optimizer & trainer.
    • Corrected Softmax backward pass to use the efficient Jacobian-vector product.
    • Added strict assertion to shapes for Math operations.
    • Softmax and Activations now reuse internal buffers to reduce memory allocation during training loops.
    • Fused Kernels: NN.Linear now combines matrix multiplication and bias addition into a single parallel block, reducing thread synchronization overhead significantly.