Skip to content

Latest commit

 

History

History
41 lines (25 loc) · 1.49 KB

README.md

File metadata and controls

41 lines (25 loc) · 1.49 KB

BOAT: Bilateral Local Attention Vision Transformer

This is an unofficial implementation of the paper BOAT: Bilateral Local Attention Vision Transformer.

The Swin variant is based on Swin Transformer

The CSwin variant is based on CSwin Tranformer

Please check corresponding folders for more installation, training and evaluation instructions.

Pre-trained models

BOAT-Swin-Tiny

BOAT-Swin-Small

BOAT-Swin-Base

BOAT-CSwin-Tiny

BOAT-CSwin-Small

BOAT-CSwin-Base

Acknowledgement

This is developped based on CSWin Transformer and Swin-transformer

If you use this code for your research, please consider citing:

@article{BOAT,
  author    = {Tan Yu and Gangming Zhao and Ping Li and Yizhou Yu},
  title     = {{BOAT:} Bilateral Local Attention Vision Transformer},
  journal   = {CoRR},
  volume    = {abs/2201.13027},
  year      = {2022},
  url       = {https://arxiv.org/abs/2201.13027},
}