External Attention Network

Last update: Dec 11, 2022

Related tags

Deep Learning EANet

Overview

Beyond Self-attention: External Attention using Two Linear Layers for Visual Tasks

paper : https://arxiv.org/abs/2105.02358

EAMLP will come soon

Jittor code will come soon

Pascal VOC test result link

Pascal VOC pretrained model link

You can download the pretrained model and then run python test.py to reproduce the pascal voc test result.

Other implementation:

Pytorch : https://github.com/xmu-xiaoma666/External-Attention-pytorch

TODO

release jittor semantic segmentation code and checkpoint.
release torch semantic segmentation code and checkpoint.
release point cloud related code and checkpoint.
merge segmentation module into mmsegmentation to reproduce the ADE20K and Cityscapes dataset results.
merge PyTorch-StudioGAN to reproduce the GAN results.

Acknowledgments

We would like to sincerely thank HamNet_seg, EMANet_seg, openseg, T2T-ViT, mmsegmentation and PyTorch-StudioGAN for their awesome released code.

Astract

Attention mechanisms, especially self-attention, play an increasingly important role in deep feature representation in visual tasks. Self-attention updates the feature at each position by computing a weighted sum of features using pair-wise affinities across all positions to capture long-range dependency within a single sample. However, self-attention has a quadratic complexity and ignores potential correlation between different samples. This paper proposes a novel attention mechanism which we call external attention, based on two external, small, learnable, and shared memories, which can be implemented easily by simply using two cascaded linear layers and two normalization layers; it conveniently replaces self-attention in existing popular architectures. External attention has linear complexity and implicitly considers the correlations between all samples. Extensive experiments on image classification, semantic segmentation, image generation, point cloud classification and point cloud segmentation tasks reveal that our method provides comparable or superior performance to the self-attention mechanism and some of its variants, with much lower computational and memory costs.

Jittor

Jittor is a high-performance deep learning framework which is easy to learn and use. It provides interfaces like Pytorch.

You can learn how to use Jittor in following links:

Jittor homepage: https://cg.cs.tsinghua.edu.cn/jittor/

Jittor github: https://github.com/Jittor/jittor

If you has any questions about Jittor, you can ask in Jittor developer QQ Group: 761222083

Citation

If it is helpful for your work, please cite this paper:

@misc{guo2021attention,
      title={Beyond Self-attention: External Attention using Two Linear Layers for Visual Tasks}, 
      author={Meng-Hao Guo and Zheng-Ning Liu and Tai-Jiang Mu and Shi-Min Hu},
      year={2021},
      eprint={2105.02358},
      archivePrefix={arXiv},
      primaryClass={cs.CV}
}

External Attention Network

Related tags

Overview

Beyond Self-attention: External Attention using Two Linear Layers for Visual Tasks

EAMLP will come soon

Jittor code will come soon

Pascal VOC test result link

Pascal VOC pretrained model link

Other implementation:

TODO

Acknowledgments

Astract

Jittor

Citation

Owner

MenghaoGuo

MagFace: A Universal Representation for Face Recognition and Quality Assessment

Computer vision - fun segmentation experience using classic and deep tools :)

HPRNet: Hierarchical Point Regression for Whole-Body Human Pose Estimation

Selene is a Python library and command line interface for training deep neural networks from biological sequence data such as genomes.

A Neural Net Training Interface on TensorFlow, with focus on speed + flexibility

Bayesian Meta-Learning Through Variational Gaussian Processes

[ICCV 2021 Oral] Deep Evidential Action Recognition

Protect against subdomain takeover

Official implementation of the paper "Light Field Networks: Neural Scene Representations with Single-Evaluation Rendering"

Deeprl - Standard DQN and dueling network for simple games

Unofficial TensorFlow implementation of Protein Interface Prediction using Graph Convolutional Networks.

The repository includes the code for training cell counting applications. (Keras + Tensorflow)

Neural Dynamic Policies for End-to-End Sensorimotor Learning

Anti-UAV base on PaddleDetection

[ICCV 2021] A Simple Baseline for Semi-supervised Semantic Segmentation with Strong Data Augmentation

Official code for paper "Optimization for Oriented Object Detection via Representation Invariance Loss".

This Deep Learning Model Predicts that from which disease you are suffering.

Animation of solving the traveling salesman problem to optimality using mixed-integer programming and iteratively eliminating sub tours

Official PyTorch implementation of DD3D: Is Pseudo-Lidar needed for Monocular 3D Object detection? (ICCV 2021), Dennis Park, Rares Ambrus, Vitor Guizilini, Jie Li, and Adrien Gaidon.

A naive ROS interface for visualDet3D.

External Attention Network

Related tags

Overview

Beyond Self-attention: External Attention using Two Linear Layers for Visual Tasks

EAMLP will come soon

Jittor code will come soon

Pascal VOC test result link

Pascal VOC pretrained model link

Other implementation:

TODO

Acknowledgments

Astract

Jittor

Citation

Owner

MenghaoGuo

MagFace: A Universal Representation for Face Recognition and Quality Assessment

Computer vision - fun segmentation experience using classic and deep tools :)

HPRNet: Hierarchical Point Regression for Whole-Body Human Pose Estimation

Selene is a Python library and command line interface for training deep neural networks from biological sequence data such as genomes.

A Neural Net Training Interface on TensorFlow, with focus on speed + flexibility

Bayesian Meta-Learning Through Variational Gaussian Processes

[ICCV 2021 Oral] Deep Evidential Action Recognition

Protect against subdomain takeover

Official implementation of the paper "Light Field Networks: Neural Scene Representations with Single-Evaluation Rendering"

Deeprl - Standard DQN and dueling network for simple games

Unofficial TensorFlow implementation of Protein Interface Prediction using Graph Convolutional Networks.

The repository includes the code for training cell counting applications. (Keras + Tensorflow)

Neural Dynamic Policies for End-to-End Sensorimotor Learning

Anti-UAV base on PaddleDetection

[ICCV 2021] A Simple Baseline for Semi-supervised Semantic Segmentation with Strong Data Augmentation

Official code for paper "Optimization for Oriented Object Detection via Representation Invariance Loss".

This Deep Learning Model Predicts that from which disease you are suffering.

Animation of solving the traveling salesman problem to optimality using mixed-integer programming and iteratively eliminating sub tours

Official PyTorch implementation of DD3D: Is Pseudo-Lidar needed for Monocular 3D Object detection? (ICCV 2021), Dennis Park*, Rares Ambrus*, Vitor Guizilini, Jie Li, and Adrien Gaidon.

A naive ROS interface for visualDet3D.

Official PyTorch implementation of DD3D: Is Pseudo-Lidar needed for Monocular 3D Object detection? (ICCV 2021), Dennis Park, Rares Ambrus, Vitor Guizilini, Jie Li, and Adrien Gaidon.