v0.6.0: Optimized kernels and upgraded dependencies for enhanced performance in PyTorch 2.4+
What's Changed
- Optimized the custom LayerNorm kernel, further boosting end-to-end inference and training speed.
- Integrated a custom Triton-based implementation of the TriangleAttention operator (triattention), improving computational efficiency.
- Integrated the cuEquivariance operator from NVIDIA/cuEquivariance to accelerate equivariant operations, with notable efficiency gains in the TriangleAttention and TriangleMultiplication modules.
- Upgraded the container image and dependencies to resolve efficiency bottlenecks in PyTorch 2.4 and later versions; Supported Biotite 1.2 and above.
New Contributors
Full Changelog: v0.5.5...v0.6.0