An simple pytorch implementation of Flash MultiHead Attention
artificial-intelligence
transformer
attention
artificial-neural-networks
attention-mechanisms
attentionisallyouneed
gpt4
flash-attention
-
Updated
Feb 5, 2024 - Jupyter Notebook