[Feature question] key_padding_mask in flash attention #424

kanghui0204 · 2023-08-06T12:05:05Z

https://github.com/Dao-AILab/flash-attention/blob/main/flash_attn/modules/mha.py#L461
Hi exports , I saw in flash-attention MHA implementation don't support key_padding_mask feature, If we want to support it , do flash-attention have a API? or how can we do it with flash-attention?

samvanstroud mentioned this issue Sep 25, 2023

[v2] Attention Masking #352

Open

Provide feedback