This is the implementation of the paper:
"LambdaNetworks: Modeling Long-Range Interactions Without Attention," I. Bello et al., 2021
- Python 3
- PyTorch ,torchvision
python train.py --help
# An example of training script
python train.py \
--gpu 0 \
--lr 1e-2 \
--weight-decay 1e-4 \
--num-epoch 100 \
--batch-size 256