Discretized Integrated Gradients for Explaining Language Models (EMNLP 2021)

Last update: Oct 27, 2022

Related tags

Overview

Discretized Integrated Gradients for Explaining Language Models (EMNLP 2021)

Overview of paths used in DIG and IG. w is the word being attributed. The gray region is the neighborhood of w. Green line depicts the straight-line path from w to w' used by IG and the green squares are the corresponding interpolation points. Left: In DIG-Greedy, we first monotonize each word in the neighborhood (red arrow). Then the word closest to its corresponding monotonic point is selected as the anchor (blue line to w_5 since the red arrow of w_5 has the shortest magnitude). Right: In DIG-MaxCount we first count the number of monotonic dimensions for each word in the neighborhood (shown in [.] above). Then, the word with the highest number of monotonic dimensions is selected as the anchor word (blue line to w_4), followed by changing the non-monotonic dimensions of w_4 (red line to c). Repeating this step gives the zigzag blue path. Finally, the red stars are the interpolated points used by our method. Please refer to the paper for more details.

Dependencies

Dependencies can be installed using requirements.txt.

Evaluating DIG:

Install all the requirements from requirements.txt.
Execute ./setup.sh for setting up the folder hierarchy for experiments.

Commands for reproducing the reported results on DistilBERT fine-tuned on SST2:

# Generate the KNN graph
python knn.py -dataset sst2 -nn distilbert

# DIG (strategy: Greedy)
python main.py -dataset sst2 -nn distilbert -strategy greedy

# DIG (strategy: MaxCount)
python main.py -dataset sst2 -nn distilbert -strategy maxcount

Similarly, commands can be changed for other settings.

Please contact Soumya for any clarifications or suggestions.

Discretized Integrated Gradients for Explaining Language Models (EMNLP 2021)

Related tags

Overview

Discretized Integrated Gradients for Explaining Language Models (EMNLP 2021)

Dependencies

Evaluating DIG:

Owner

INK Lab @ USC

This repo is duplication of jwyang/faster-rcnn.pytorch

For auto aligning, cropping, and scaling HR and LR images for training image based neural networks

Implementation of ICCV21 paper: PnP-DETR: Towards Efficient Visual Analysis with Transformers

Hierarchical Clustering: O(1)-Approximation for Well-Clustered Graphs

'Solving the sampling problem of the Sycamore quantum supremacy circuits

Fast Neural Style for Image Style Transform by Pytorch

Pytorch implementation of the paper Improving Text-to-Image Synthesis Using Contrastive Learning

The official implementation for ACL 2021 "Challenges in Information Seeking QA: Unanswerable Questions and Paragraph Retrieval".

Quantile Regression DQN a Minimal Working Example, Distributional Reinforcement Learning with Quantile Regression

Collaborative forensic timeline analysis

QSYM: A Practical Concolic Execution Engine Tailored for Hybrid Fuzzing

A module that used for encrypt code which includes RSA and AES

Performant, differentiable reinforcement learning

Turi Create simplifies the development of custom machine learning models.

This is the repository for The Machine Learning Workshops, published by AI DOJO

3D position tracking for soccer players with multi-camera videos

2nd solution of ICDAR 2021 Competition on Scientific Literature Parsing, Task B.

Dogs classification with Deep Metric Learning using some popular losses

Simple streamlit app to demonstrate HERE Tour Planning

A python library for implementing a recommender system