An simple pytorch implementation of Flash MultiHead Attention
artificial-intelligence transformer attention artificial-neural-networks attention-mechanisms attentionisallyouneed gpt4 flash-attention
-
Updated
Feb 5, 2024 - Jupyter Notebook