2.2.0+ regresses SDPA performance on Windows #125070

Xemorr · 2024-04-26T21:09:18Z

🐛 Describe the bug

In PyTorch <2.2.0, SDPA (scaled_dot_product_attention) supports Flash Attention v1 on Windows. In PyTorch 2.2.0>=, it does not support any Flash Attention on Windows.

Versions

This is a report of a regression between 2.1.2 and 2.2.0+

cc @peterjc123 @mszhanyi @skyline75489 @nbcsm @vladimir-aubrecht @iremyux @Blackhex @cristianPanaite

mikaylagawarecki added module: windows Windows support for PyTorch module: multi-headed-attention triaged This issue has been looked at a team member, and triaged and prioritized into an appropriate module labels Apr 26, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

2.2.0+ regresses SDPA performance on Windows #125070

2.2.0+ regresses SDPA performance on Windows #125070

Xemorr commented Apr 26, 2024 •

edited

2.2.0+ regresses SDPA performance on Windows #125070

2.2.0+ regresses SDPA performance on Windows #125070

Comments

Xemorr commented Apr 26, 2024 • edited

🐛 Describe the bug

Versions

Xemorr commented Apr 26, 2024 •

edited