This repository provides the official implementation of 'Learning to ignore: rethinking attention in CNNs' accepted in BMVC 2021.

Last update: Jul 08, 2022

Overview

inverse_attention

This repository provides the official implementation of 'Learning to ignore: rethinking attention in CNNs' accepted in BMVC 2021.

Learning to ignore: rethinking attention in CNNs

Abstract:

Recently, there has been an increasing interest in applying attention mechanisms in Convolutional Neural Networks (CNNs) to solve computer vision tasks. Most of these methods learn to explicitly identify and highlight relevant parts of the scene and pass the attended image to further layers of the network. In this paper, we argue that such an approach might not be optimal. Arguably, explicitly learning which parts of the image are relevant is typically harder than learning which parts of the image are less relevant and, thus, should be ignored. In fact, in vision domain, there are many easy-to-identify patterns of irrelevant features. For example, image regions close to the borders are less likely to contain useful information for a classification task. Based on this idea, we propose to reformulate the attention mechanism in CNNs to learn to ignore instead of learning to attend. Specifically, we propose to explicitly learn irrelevant information in the scene and suppress it in the produced representation, keeping only important attributes. This implicit attention scheme can be incorporated into any existing attention mechanism. In this work, we validate this idea using two recent attention methods Squeeze and Excitation (SE) block and Convolutional Block Attention Module (CBAM). Experimental results on different datasets and model architectures show that learning to ignore, i.e., implicit attention, yields superior performance compared to the standard approaches.

Dependencies

The project was tested in Python 3 and Tensorflow 2. Run pip install -r requirements.txt to install dependent packages. Parts of the code are based on 'CBAM-keras'.

Running the code:

To test our approach on ImageNet, run main_imagenet.py. You need to: 1/ specify dataset_dir the TF-record directory of the dataset. 2/ choose the attention model to use, i.e., attention_module.

To test our approach on CIFAR10 or CIFAR100, run main_CIFAR.py. You need to: 1/ specify dataset and num_classes 2/ choose the attention model to use, i.e., attention_module.

Cite This Work

@article{laakom2021learning,
  title={Learning to ignore: rethinking attention in CNNs},
  author={Laakom, Firas and Chumachenko, Kateryna and Raitoharju, Jenni and Iosifidis, Alexandros and Gabbouj, Moncef},
  journal={arXiv preprint arXiv:2111.05684},
  year={2021}
}

This repository provides the official implementation of 'Learning to ignore: rethinking attention in CNNs' accepted in BMVC 2021.

Related tags

Overview

inverse_attention

Learning to ignore: rethinking attention in CNNs

Dependencies

Running the code:

Cite This Work

Owner

Firas Laakom

DeepGNN is a framework for training machine learning models on large scale graph data.

HPRNet: Hierarchical Point Regression for Whole-Body Human Pose Estimation

AI创造营：Metaverse启动机之重构现世，结合PaddlePaddle 和 Wechaty 创造自己的聊天机器人

TF2 implementation of knowledge distillation using the "function matching" hypothesis from the paper Knowledge distillation: A good teacher is patient and consistent by Beyer et al.

This repository implements and evaluates convolutional networks on the Möbius strip as toy model instantiations of Coordinate Independent Convolutional Networks.

Speech Recognition is an important feature in several applications used such as home automation, artificial intelligence

1st Solution For NeurIPS 2021 Competition on ML4CO Dual Task

This is a Pytorch implementation of paper: DropEdge: Towards Deep Graph Convolutional Networks on Node Classification

Disentangled Cycle Consistency for Highly-realistic Virtual Try-On, CVPR 2021

Tensorflow implementation of soft-attention mechanism for video caption generation.

A repo to show how to use custom dataset to train s2anet, and change backbone to resnext101

Implementation of Bagging and AdaBoost Algorithm

[ICML 2021] A fast algorithm for fitting robust decision trees.

Invasive Plant Species Identification

Exploring Versatile Prior for Human Motion via Motion Frequency Guidance (3DV2021)

Easy to use Audio Tagging in PyTorch

implementation of paper - You Only Learn One Representation: Unified Network for Multiple Tasks

FcaNet: Frequency Channel Attention Networks

To propose and implement a multi-class classification approach to disaster assessment from the given data set of post-earthquake satellite imagery.

This is the official implementation of 3D-CVF: Generating Joint Camera and LiDAR Features Using Cross-View Spatial Feature Fusion for 3D Object Detection, built on SECOND.

This repository provides the official implementation of 'Learning to ignore: rethinking attention in CNNs' accepted in BMVC 2021.

Related tags

Overview

inverse_attention

Learning to ignore: rethinking attention in CNNs

Dependencies

Running the code:

Cite This Work

Owner

Firas Laakom

DeepGNN is a framework for training machine learning models on large scale graph data.

HPRNet: Hierarchical Point Regression for Whole-Body Human Pose Estimation

AI创造营 ：Metaverse启动机之重构现世，结合PaddlePaddle 和 Wechaty 创造自己的聊天机器人

TF2 implementation of knowledge distillation using the "function matching" hypothesis from the paper Knowledge distillation: A good teacher is patient and consistent by Beyer et al.

This repository implements and evaluates convolutional networks on the Möbius strip as toy model instantiations of Coordinate Independent Convolutional Networks.

Speech Recognition is an important feature in several applications used such as home automation, artificial intelligence

1st Solution For NeurIPS 2021 Competition on ML4CO Dual Task

This is a Pytorch implementation of paper: DropEdge: Towards Deep Graph Convolutional Networks on Node Classification

Disentangled Cycle Consistency for Highly-realistic Virtual Try-On, CVPR 2021

Tensorflow implementation of soft-attention mechanism for video caption generation.

A repo to show how to use custom dataset to train s2anet, and change backbone to resnext101

Implementation of Bagging and AdaBoost Algorithm

[ICML 2021] A fast algorithm for fitting robust decision trees.

Invasive Plant Species Identification

Exploring Versatile Prior for Human Motion via Motion Frequency Guidance (3DV2021)

Easy to use Audio Tagging in PyTorch

implementation of paper - You Only Learn One Representation: Unified Network for Multiple Tasks

FcaNet: Frequency Channel Attention Networks

To propose and implement a multi-class classification approach to disaster assessment from the given data set of post-earthquake satellite imagery.

This is the official implementation of 3D-CVF: Generating Joint Camera and LiDAR Features Using Cross-View Spatial Feature Fusion for 3D Object Detection, built on SECOND.

AI创造营：Metaverse启动机之重构现世，结合PaddlePaddle 和 Wechaty 创造自己的聊天机器人