Unofficial PyTorch implementation of Attention Free Transformer (AFT) layers by Apple Inc.

Last update: Dec 12, 2022

Related tags

Deep Learning aft-pytorch

Overview

aft-pytorch

Unofficial PyTorch implementation of Attention Free Transformer's layers by Zhai, et al. [abs, pdf] from Apple Inc.

Installation

You can install aft-pytorch via pip:

pip install aft-pytorch

Usage

You can import the AFT-Full or AFT-Simple layer (as described in the paper) from the package like so:

`AFTFull`

from aft_pytorch import AFTFull

layer = AFTFull(
    max_seqlen=20,
    dim=512,
    hidden_dim=64
)

# a batch of sequences with 10 timesteps of length 512 each
x = torch.rand(32, 10, 512)
y = layer(x) # [32, 10, 512]

`AFTSimple`

from aft_pytorch import AFTSimple

layer = AFTSimple(
    max_seqlen=20,
    dim=512,
    hidden_dim=64
)

# a batch of sequences with 10 timesteps of length 512 each
x = torch.rand(32, 10, 512)
y = layer(x) # [32, 10, 512]

This layer wrapper is a 'plug-and-play' with your existing networks / Transformers. You can swap out the Self-Attention layer with the available layers in this package with minimal changes.

TODO

Add full AFT architecture
Add variants like, AFTConv, AFTLocal

Contributing

If you like this repo, please leave a star! If there are any amends or suggestions, feel free to raise a PR/issue.

Credits

@misc{attention-free-transformer,
title = {An Attention Free Transformer},
author = {Shuangfei Zhai and Walter Talbott and Nitish Srivastava and Chen Huang and Hanlin Goh and Ruixiang Zhang and Josh Susskind},
year = {2021},
URL = {https://arxiv.org/pdf/2105.14103.pdf}
}

License

MIT

Unofficial PyTorch implementation of Attention Free Transformer (AFT) layers by Apple Inc.

Related tags

Overview

aft-pytorch

Installation

Usage

`AFTFull`

`AFTSimple`

TODO

Contributing

Credits

License

Owner

Rishabh Anand

Fast, flexible and fun neural networks.

efficient neural audio synthesis in the waveform domain

Six - a Python 2 and 3 compatibility library

A full pipeline AutoML tool for tabular data

Transfer Learning Shootout for PyTorch's model zoo (torchvision)

Codebase of deep learning models for inferring stability of mRNA molecules

The Environment I built to study Reinforcement Learning + Pokemon Showdown

MNIST, but with Bezier curves instead of pixels

Implementation of experiments in the paper Clockwork Variational Autoencoders (project website) using JAX and Flax

CCAFNet: Crossflow and Cross-scale Adaptive Fusion Network for Detecting Salient Objects in RGB-D Images

LV-BERT: Exploiting Layer Variety for BERT (Findings of ACL 2021)

Python tools for 3D face: 3DMM, Mesh processing(transform, camera, light, render), 3D face representations.

🔅 Shapash makes Machine Learning models transparent and understandable by everyone

DeepCO3: Deep Instance Co-segmentation by Co-peak Search and Co-saliency

Python implementation of MULTIseq barcode alignment using fuzzy string matching and GMM barcode assignment

Video Instance Segmentation with a Propose-Reduce Paradigm (ICCV 2021)

HiddenMarkovModel implements hidden Markov models with Gaussian mixtures as distributions on top of TensorFlow

The Rich Get Richer: Disparate Impact of Semi-Supervised Learning

GUI for a Vocal Remover that uses Deep Neural Networks.

Fast Learning of MNL Model From General Partial Rankings with Application to Network Formation Modeling