This is an unofficial implementation of the paper BOAT: Bilateral Local Attention Vision Transformer.
The Swin variant is based on Swin Transformer
The CSwin variant is based on CSwin Tranformer
Please check corresponding folders for more installation, training and evaluation instructions.
This is developped based on CSWin Transformer and Swin-transformer
@article{BOAT,
author = {Tan Yu and Gangming Zhao and Ping Li and Yizhou Yu},
title = {{BOAT:} Bilateral Local Attention Vision Transformer},
journal = {CoRR},
volume = {abs/2201.13027},
year = {2022},
url = {https://arxiv.org/abs/2201.13027},
}