Skip to content

feat(v0.2.2): Ampere-optimized SGEMM with cp.async pipeline (18 TFLOPS)#37

Merged
m96-chan merged 8 commits intomainfrom
feature/v0.2.2-3090ti-tuning
Dec 13, 2025
Merged

feat(v0.2.2): Ampere-optimized SGEMM with cp.async pipeline (18 TFLOPS)#37
m96-chan merged 8 commits intomainfrom
feature/v0.2.2-3090ti-tuning

Commits

Commits on Dec 12, 2025

Commits on Dec 13, 2025