Skip to content

[Unity][CUTLASS] Fixed stacked attention offload when QKV reshape uses the same shape expression#14728

Merged
masahi merged 2 commits intoapache:unityfrom
masahi:cutlass-stacked-attention-fix
Apr 27, 2023
Merged

[Unity][CUTLASS] Fixed stacked attention offload when QKV reshape uses the same shape expression#14728
masahi merged 2 commits intoapache:unityfrom
masahi:cutlass-stacked-attention-fix

Commits

Commits on Apr 26, 2023