Unofficial PyTorch implementation of TokenLearner by Google AI

Last update: Dec 20, 2022

Related tags

Deep Learning tokenlearner-pytorch

Overview

tokenlearner-pytorch

Unofficial PyTorch implementation of TokenLearner by Ryoo et al. from Google AI (abs, pdf)

Installation

You can install TokenLearner via pip:

pip install tokenlearner-pytorch

Usage

You can access the TokenLearner class from the tokenlearner_pytorch package. You can use this layer with a Vision Transformer, MLPMixer, or Video Vision Transformer as done in the paper.

import torch
from tokenlearner_pytorch import TokenLearner

tklr = TokenLearner(S=8)
x = torch.rand(512, 32, 32, 3)
y = tklr(x) # [512, 8, 3]

You can also use TokenLearner and TokenFuser together with Multi-head Self-Attention as done in the paper:

import torch
import torch.nn as nn
from tokenlearner_pytorch import TokenLearner, TokenFuser

mhsa = nn.MultiheadAttention(3, 1)
tklr = TokenLearner(S=8)
tkfr = TokenFuser(H=32, W=32, C=3, S=8)

x = torch.rand(512, 32, 32, 3) # a batch of images

y = tklr(x)
y = y.view(8, 512, 3)
y, _ = mhsa(y, y, y) # ignore attn weights
y = y.view(512, 8, 3)

out = tkfr(y, x) # [512, 32, 23, 3]

TODO

Add support for temporal dimension T
Implement TokenFuser with ViT
Implement TokenFuser with ViViT

Contributions

If I've made any errors or you have any suggestions, feel free to raise an Issue or PR. All contributions welcome!!

License

MIT

Unofficial PyTorch implementation of TokenLearner by Google AI

Related tags

Overview

tokenlearner-pytorch

Installation

Usage

TODO

Contributions

License

Owner

Rishabh Anand

Local trajectory planner based on a multilayer graph framework for autonomous race vehicles.

Code for CVPR2019 paper《Unequal Training for Deep Face Recognition with Long Tailed Noisy Data》

This is a official repository of SimViT.

Download files from DSpace systems (because for some reason DSpace won't let you)

Useful materials and tutorials for 110-1 NTU DBME5028 (Application of Deep Learning in Medical Imaging)

An SE(3)-invariant autoencoder for generating the periodic structure of materials

Episodic-memory - Ego4D Episodic Memory Benchmark

Code for "Sparse Steerable Convolutions: An Efficient Learning of SE(3)-Equivariant Features for Estimation and Tracking of Object Poses in 3D Space"

Pytorch tutorials for Neural Style transfert

Official implementation of "Motif-based Graph Self-Supervised Learning forMolecular Property Prediction"

Official pytorch implement for “Transformer-Based Source-Free Domain Adaptation”

Unsupervised Discovery of Object Radiance Fields

Self-Supervised Monocular 3D Face Reconstruction by Occlusion-Aware Multi-view Geometry Consistency[ECCV 2020]

using yolox+deepsort for object-tracker

Labelbox is the fastest way to annotate data to build and ship artificial intelligence applications

Measuring Coding Challenge Competence With APPS

SymPy-powered, Wolfram|Alpha-like answer engine totally in your browser, without backend computation

Unadversarial Examples: Designing Objects for Robust Vision

LexGLUE: A Benchmark Dataset for Legal Language Understanding in English

AntiFuzz: Impeding Fuzzing Audits of Binary Executables