Selective Attention Transformer

This repository contains an unofficial implementation of the Selective Attention Transformer, a novel architecture designed to enhance attention mechanisms in deep learning models.

For more details, you can refer to the arXiv paper.

Overview

The Selective Attention Transformer aims to improve the efficiency and effectiveness of attention mechanisms by selectively focusing on relevant parts of the input data. This implementation is based on the original paper by Author(s) Name(s).

Features

Selective Attention Mechanism: Focuses on the most relevant parts of the input data.
Scalable Architecture: Designed to work efficiently with large datasets.
Modular Design: Easy to integrate with existing deep learning frameworks.
Key-Value Cache (kv_cache): Enhances performance by caching key-value pairs for faster access during inference.

Name		Name	Last commit message	Last commit date
Latest commit History 19 Commits
model		model
README.md		README.md
sat.ipynb		sat.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Selective Attention Transformer

Overview

Features

About

Releases

Packages

Languages

fangyuan-ksgk/selective-attention-transformer

Folders and files

Latest commit

History

Repository files navigation

Selective Attention Transformer

Overview

Features

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages