An example of Scatterbrain implementation (combining local attention and Performer)

Last update: Jan 02, 2023

Related tags

Overview

We use the template from https://github.com/ashleve/lightning-hydra-template. Please read the instructions there to understand the repo structure.

Implementation & Experiments

An example of Scatterbrain implementation (combining local attention and Performer) is in the file src/models/modules/attention/sblocal.py.

T2T-ViT inference on ImageNet

To run the T2T-ViT inference on ImageNet experiment:

Download the pretrained weights from the [T2T-ViT repo][https://github.com/yitu-opensource/T2T-ViT/releases]:

mkdir -p checkpoints/t2tvit
cd checkpoints/t2tvit
wget https://github.com/yitu-opensource/T2T-ViT/releases/download/main/81.7_T2T_ViTt_14.pth.tar

Convert the weights to the format compatible with our implementation of T2T-ViT:

# cd to scatterbrain path
python scripts/convert_checkpoint_t2t_vit.py checkpoints/t2tvit/81.7_T2T_ViTt_14.pth.tar

Download the ImageNet dataset (just the validation set will suffice). Below, /path/to/imagenet refers to the directory that contains the train and val directories.
Run the inference experiments:

python run.py experiment=imagenet-t2tvit-eval.yaml model/t2tattn_cfg=full datamodule.data_dir=/path/to/imagenet/ eval.ckpt=checkpoints/t2tvit/81.7_T2T_ViTt_14.pth.tar  # 81.7% acc
python run.py experiment=imagenet-t2tvit-eval.yaml model/t2tattn_cfg=local datamodule.data_dir=/path/to/imagenet/ eval.ckpt=checkpoints/t2tvit/81.7_T2T_ViTt_14.pth.tar  # 80.6% acc
python run.py experiment=imagenet-t2tvit-eval.yaml model/t2tattn_cfg=performer datamodule.data_dir=/path/to/imagenet/ eval.ckpt=checkpoints/t2tvit/81.7_T2T_ViTt_14.pth.tar  # 77.8-79.0% acc (there's randomness)
python run.py experiment=imagenet-t2tvit-eval.yaml model/t2tattn_cfg=sblocal datamodule.data_dir=/path/to/imagenet/ eval.ckpt=checkpoints/t2tvit/81.7_T2T_ViTt_14.pth.tar  # 81.1% acc

Requirements

Python 3.8+, Pytorch 1.9+, torchvision, torchtext, pytorch-fast-transformers, munch, einops, timm, hydra-core, hydra-colorlog, python-dotenv, rich, pytorch-lightning, lightning-bolts.

We provide a Dockerfile that lists all the required packages.

Citation

If you use this codebase, or otherwise found our work valuable, please cite:

@inproceedings{chen2021scatterbrain,
  title={Scatterbrain: Unifying Sparse and Low-rank Attention},
  author={Beidi Chen and Tri Dao and Eric Winsor and Zhao Song and Atri Rudra and Christopher R\'{e}},
  booktitle={Advances in Neural Information Processing Systems (NeurIPS)},
  year={2021}
}

An example of Scatterbrain implementation (combining local attention and Performer)

Related tags

Overview

Implementation & Experiments

T2T-ViT inference on ImageNet

Requirements

Citation

Owner

HazyResearch

an implementation of 3D Ken Burns Effect from a Single Image using PyTorch

Rule Based Classification Project

Code for Contrastive-Geometry Networks for Generalized 3D Pose Transfer

Code for ICCV 2021 paper "Distilling Holistic Knowledge with Graph Neural Networks"

This program can detect your face and add an Christams hat on the top of your head

Competitive Programming Club, Clinify's Official repository for CP problems hosting by club members.

[NeurIPS 2021] “Improving Contrastive Learning on Imbalanced Data via Open-World Sampling”,

Hand tracking demo for DIY Smart Glasses with a remote computer doing the work

Gesture recognition on Event Data

Submodular Subset Selection for Active Domain Adaptation (ICCV 2021)

Official PyTorch Implementation of paper "NeLF: Neural Light-transport Field for Single Portrait View Synthesis and Relighting", EGSR 2021.

E-Ink Magic Calendar that automatically syncs to Google Calendar and runs off a battery powered Raspberry Pi Zero

PyTorch implementation of Decoupling Value and Policy for Generalization in Reinforcement Learning

Implements Stacked-RNN in numpy and torch with manual forward and backward functions

This is the code repository for the paper A hierarchical semantic segmentation framework for computer-vision-based bridge column damage detection

Awesome Artificial Intelligence, Machine Learning and Deep Learning as we learn it

Adaptive Graph Convolution for Point Cloud Analysis

A `Neural = Symbolic` framework for sound and complete weighted real-value logic

Rethinking Semantic Segmentation from a Sequence-to-Sequence Perspective with Transformers

【ACMMM 2021】DSANet: Dynamic Segment Aggregation Network for Video-Level Representation Learning