Skyformer: Remodel Self-Attention with Gaussian Kernel and Nystr\"om Method (NeurIPS 2021)

Last update: Sep 20, 2022

Related tags

Overview

Skyformer

This repository is the official implementation of Skyformer: Remodel Self-Attention with Gaussian Kernel and Nystr"om Method (NeurIPS 2021).

Requirements

To install requirements in a conda environment:

conda create -n skyformer python=3.6
conda activate skyformer
pip install -r requirements.txt

Note: Specific requirements for data preprocessing are not included here.

Data Preparation

Processed files can be downloaded here, or processed with the following steps:

Requirements

tensorboard>=2.3.0
tensorflow>=2.3.1
tensorflow-datasets>=4.0.1

Download the TFDS files for pathfinder and then set _PATHFINER_TFDS_PATH to the unzipped directory (following https://github.com/google-research/long-range-arena/issues/11)
Download lra_release.gz (7.7 GB).
Unzip lra-release and put under ./data/.

cd data
wget https://storage.googleapis.com/long-range-arena/lra_release.gz
tar zxvf lra-release.gz

Create a directory lra_processed under ./data/.

mkdir lra_processed
cd ..

6.The directory structure would be (assuming the root dir is code)

./data/lra-processed
./data/long-range-arena-main
./data/lra_release

Create train, dev, and test dataset pickle files for each task.

cd preprocess
python create_pathfinder.py
python create_listops.py
python create_retrieval.py
python create_text.py
python create_cifar10.py

Note: most source code comes from LRA repo.

Run

Modify the configuration in config.py and run

python main.py --mode train --attn skyformer --task lra-text

mode: train, eval
attn: softmax, nystrom, linformer, reformer, perfromer, informer, bigbird, kernelized, skyformer
task: lra-listops, lra-pathfinder, lra-retrieval, lra-text, lra-image

Reference

@inproceedings{Skyformer,
    title={Skyformer: Remodel Self-Attention with Gaussian Kernel and Nystr\"om Method}, 
    author={Yifan Chen and Qi Zeng and Heng Ji and Yun Yang},
    booktitle={NeurIPS},
    year={2021}
}

Skyformer: Remodel Self-Attention with Gaussian Kernel and Nystr\"om Method (NeurIPS 2021)

Related tags

Overview

Skyformer

Requirements

Data Preparation

Run

Reference

Owner

Qi Zeng

Code for the paper "A Study of Face Obfuscation in ImageNet"

BoxInst: High-Performance Instance Segmentation with Box Annotations

MonoScene: Monocular 3D Semantic Scene Completion

Optimized code based on M2 for faster image captioning training

paper: Hyperspectral Remote Sensing Image Classification Using Deep Convolutional Capsule Network

Repo for the paper "DiLBERT: Cheap Embeddings for Disease Related Medical NLP"

Reviatalizing Optimization for 3D Human Pose and Shape Estimation: A Sparse Constrained Formulation

An implementation of Fastformer: Additive Attention Can Be All You Need in TensorFlow

Sign Language is detected in realtime using video sequences. Our approach involves MediaPipe Holistic for keypoints extraction and LSTM Model for prediction.

Council-GAN - Implementation for our paper Breaking the Cycle - Colleagues are all you need (CVPR 2020)

Motion Reconstruction Code and Data for Skills from Videos (SFV)

A universal framework for learning timestamp-level representations of time series

ADSPM: Attribute-Driven Spontaneous Motion in Unpaired Image Translation

Implementation of ToeplitzLDA for spatiotemporal stationary time series data.

Mengzi Pretrained Models

Vision Transformer for 3D medical image registration (Pytorch).

Shape-aware Semi-supervised 3D Semantic Segmentation for Medical Images

Implementation of the master's thesis "Temporal copying and local hallucination for video inpainting".

Python library containing BART query generation and BERT-based Siamese models for neural retrieval.