v0.6.0: Optimized kernels and upgraded dependencies for enhanced performance in PyTorch 2.4+

zhangyuxuann released this 19 Aug 07:35

· 106 commits to main since this release

a40f8f6

What's Changed

Optimized the custom LayerNorm kernel, further boosting end-to-end inference and training speed.
Integrated a custom Triton-based implementation of the TriangleAttention operator (triattention), improving computational efficiency.
Integrated the cuEquivariance operator from NVIDIA/cuEquivariance to accelerate equivariant operations, with notable efficiency gains in the TriangleAttention and TriangleMultiplication modules.
Upgraded the container image and dependencies to resolve efficiency bottlenecks in PyTorch 2.4 and later versions; Supported Biotite 1.2 and above.

New Contributors

@chaitjo made their first contribution in #175

Full Changelog: v0.5.5...v0.6.0

Contributors

chaitjo

Assets 2