Skip to content

v1.3.1 - Turing GPU Hotfix & Linux Support

Latest

Choose a tag to compare

@BlackSnowSkill BlackSnowSkill released this 23 May 07:53

This release brings a critical stability hotfix for pre-Ampere NVIDIA GPUs and introduces comprehensive Linux/Ubuntu installation instructions.

🛠️ Changes & Improvements:
Intellectual GPU Detection (Critical Hotfix): Added automatic CUDA compute capability verification. SageAttention is now safely auto-disabled for Turing (RTX 20xx / GTX 16xx) and older GPUs (compute capability < 8.0) when set to "auto".
Why: SageAttention does not officially support pre-Ampere GPUs. Running it on Turing with new PyTorch environments caused infinite Triton JIT compilation errors (MLIR type mismatch) and severe performance degradation (up to 2.0x speed drop). The node now automatically and smoothly falls back to stable native PyTorch SDPA.
🐧 Linux Setup Guide:** Added step-by-step setup guides for Linux/Ubuntu environments in both English and Russian READMEs.
🔍 Triton JIT Troubleshooting:** Documented solutions for common PyTorch Inductor compiler errors (such as PassManager::run failed under Ubuntu 24.04).