Fast Axiomatic Attribution for Neural Networks (NeurIPS*2021)

Last update: Nov 21, 2022

Overview

Fast Axiomatic Attribution for Neural Networks

This is the official repository accompanying the NeurIPS 2021 paper:

R. Hesse, S. Schaub-Meyer, and S. Roth. Fast axiomatic attribution for neural networks. NeurIPS, 2021, to appear.

Paper | Preprint (arXiv) | Project Page | Video

The repository contains:

Pre-trained $\mathcal{X}$ -DNN (X-DNN) variants of popular image classification models obtained by removing the bias term of each layer
Detailed information on how to easily compute axiomatic attributions in closed form for your own project
PyTorch code to reproduce the main experiments in the paper

Pretrained Models

Removing the bias from different image classification models has a surpringly minor impact on the predictive accuracy of the models while allowing to efficiently compute axiomatic attributions. Results of popular models with and without bias term (regular vs. X-) on the ImageNet validation split are:

Model	Top-5 Accuracy	Download
AlexNet	79.21	alexnet_model_best.pth.tar
X-AlexNet	78.54	xalexnet_model_best.pth.tar
VGG16	90.44	vgg16_model_best.pth.tar
X-VGG16	90.25	xvgg16_model_best.pth.tar
ResNet-50	92.56	fixup_resnet50_model_best.pth.tar
X-ResNet-50	91.12	xfixup_resnet50_model_best.pth.tar

Using X-Gradient in Your Own Project

In the following we illustrate how to efficiently compute axiomatic attributions for X-DNNs. For a detailed example please see demo.ipynb.

First, make sure that requires_grad of your input is set to True and run a forward pass:

inputs.requires_grad = True

# forward pass
outputs = model(inputs)

Next, you can compute X-Gradient via:

# compute attribution
target_outputs = torch.gather(outputs, 1, target.unsqueeze(-1))
gradients = torch.autograd.grad(torch.unbind(target_outputs), inputs, create_graph=True)[0] # set to false if attribution is only used for evaluation
xgradient_attributions = inputs * gradients

If the attribution is only used for evaluation you can set create_graph to False. If you want to use the attribution for training, e.g., for training with attribution priors, you can define attribution_prior() and update the weights of your model:

loss1 = criterion(outputs, target) # standard loss
loss2 = attribution_prior(xgradient_attributions) # attribution prior    

loss = loss1 + lambda * loss2 # set weighting factor for loss2

optimizer.zero_grad()
loss.backward()
optimizer.step()

Reproducing Experiments

The code and a README with detailed instructions on how to reproduce the results from experiments in Sec 4.1, Sec 4.2, and Sec 4.4. of our paper can be found in the imagenet folder. To reproduce the results from the experiment in Sec 4.3. please refer to the sparsity folder.

Prerequisites

Clone the repository: git clone https://github.com/visinf/fast-axiomatic-attribution.git
Set up environment
- add the required conda channels and create new environment:
- conda config --add channels pytorch
- conda config --add channels anaconda
- conda config --add channels pipy
- conda config --add channels conda-forge
- conda create --name fast-axiomatic-attribution --file requirements.txt
download ImageNet (ILSVRC2012)

Acknowledgments

We would like to thank the contributors of the following repositories for using parts of their publicly available code:

Citation

If you find our work helpful please consider citing

@inproceedings{Hesse:2021:FAA,
  title     = {Fast Axiomatic Attribution for Neural Networks},
  author    = {Hesse, Robin and Schaub-Meyer, Simone and Roth, Stefan},
  booktitle = {Advances in Neural Information Processing Systems (NeurIPS)},
  volume    = {34},
  year      = {2021}
}

Fast Axiomatic Attribution for Neural Networks (NeurIPS*2021)

Related tags

Overview

Fast Axiomatic Attribution for Neural Networks

Pretrained Models

Using X-Gradient in Your Own Project

Reproducing Experiments

Prerequisites

Acknowledgments

Citation

Owner

Visual Inference Lab @TU Darmstadt

Implementation of BI-RADS-BERT & The Advantages of Section Tokenization.

Deeply Supervised, Layer-wise Prediction-aware (DSLP) Transformer for Non-autoregressive Neural Machine Translation

MNIST, but with Bezier curves instead of pixels

QilingLab challenge writeup

🍷 Gracefully claim weekly free games and monthly content from Epic Store.

基于Pytorch实现优秀的自然图像分割框架！(包括FCN、U-Net和Deeplab)

Computer-Vision-Paper-Reviews - Computer Vision Paper Reviews with Key Summary along Papers & Codes

Python and Julia in harmony.

TransCD: Scene Change Detection via Transformer-based Architecture

Pytorch Performace Tuning, WandB, AMP, Multi-GPU, TensorRT, Triton

Motion Reconstruction Code and Data for Skills from Videos (SFV)

FinRL-Meta: A Universe for Data-Driven Financial Reinforcement Learning. 🔥

BOVText: A Large-Scale, Multidimensional Multilingual Dataset for Video Text Spotting

TensorFlow implementation of the paper "Hierarchical Attention Networks for Document Classification"

Curating a dataset for bioimage transfer learning

Yoga - Yoga asana classifier for python

GradAttack is a Python library for easy evaluation of privacy risks in public gradients in Federated Learning

Code for the submitted paper Surrogate-based cross-correlation for particle image velocimetry

Kroomsa: A search engine for the curious

Feedback is important: response-aware feedback mechanism for background based conversation

Fast Axiomatic Attribution for Neural Networks (NeurIPS*2021)

Related tags

Overview

Fast Axiomatic Attribution for Neural Networks

Pretrained Models

Using X-Gradient in Your Own Project

Reproducing Experiments

Prerequisites

Acknowledgments

Citation

Owner

Visual Inference Lab @TU Darmstadt

Implementation of BI-RADS-BERT & The Advantages of Section Tokenization.

Deeply Supervised, Layer-wise Prediction-aware (DSLP) Transformer for Non-autoregressive Neural Machine Translation

MNIST, but with Bezier curves instead of pixels

QilingLab challenge writeup

🍷 Gracefully claim weekly free games and monthly content from Epic Store.

基于Pytorch实现优秀的自然图像分割框架！(包括FCN、U-Net和Deeplab)

Computer-Vision-Paper-Reviews - Computer Vision Paper Reviews with Key Summary along Papers & Codes

Python and Julia in harmony.

TransCD: Scene Change Detection via Transformer-based Architecture

Pytorch Performace Tuning, WandB, AMP, Multi-GPU, TensorRT, Triton

Motion Reconstruction Code and Data for Skills from Videos (SFV)

FinRL­-Meta: A Universe for Data­-Driven Financial Reinforcement Learning. 🔥

BOVText: A Large-Scale, Multidimensional Multilingual Dataset for Video Text Spotting

TensorFlow implementation of the paper "Hierarchical Attention Networks for Document Classification"

Curating a dataset for bioimage transfer learning

Yoga - Yoga asana classifier for python

GradAttack is a Python library for easy evaluation of privacy risks in public gradients in Federated Learning

Code for the submitted paper Surrogate-based cross-correlation for particle image velocimetry

Kroomsa: A search engine for the curious

Feedback is important: response-aware feedback mechanism for background based conversation

FinRL-Meta: A Universe for Data-Driven Financial Reinforcement Learning. 🔥