Reference implementation of "Softmax Attention with Constant Cost per Token" (Heinsen, 2024)
attention
attention-mechanism
attention-model
linear-attention
linear-attention-model
heinsen-attention
-
Updated
Jun 6, 2024 - Python