Code for layerwise detection of linguistic anomaly paper (ACL 2021)

Last update: Dec 07, 2022

Related tags

Overview

Layerwise Anomaly

This repository contains the source code and data for our ACL 2021 paper: "How is BERT surprised? Layerwise detection of linguistic anomalies" by Bai Li, Zining Zhu, Guillaume Thomas, Yang Xu, and Frank Rudzicz.

Citation

If you use our work in your research, please cite:

Li, B., Zhu, Z., Thomas, G., Xu, Y., and Rudzicz, F. (2021) How is BERT surprised? Layerwise detection of linguistic anomalies. In Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics (ACL).

@inproceedings{li2021layerwise,
  author = "Li, Bai and Zhu, Zining and Thomas, Guillaume and Xu, Yang and Rudzicz, Frank",
  title = "How is BERT surprised? Layerwise detection of linguistic anomalies",
  booktitle = "Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics (ACL)",
  publisher = "Association for Computational Linguistics",
  year = "2021",
}

Dependencies

The project was developed with the following library versions. Running with other versions may crash or produce incorrect results.

Python 3.7.5
CUDA Version: 11.0
torch==1.7.1
transformers==4.5.1
numpy==1.19.0
pandas==0.25.3
scikit-learn==0.22

Setup Instructions

Clone this repo: git clone https://github.com/SPOClab-ca/layerwise-anomaly
Download BNC Baby (4m word sample) from this link and extract into data/bnc/
Run BNC preprocessing script: python scripts/process_bnc.py --bnc_dir=data/bnc/download/Texts --to=data/bnc.pkl
Clone BLiMP repo: cd data && git clone https://github.com/alexwarstadt/blimp

GMM experiments on BLiMP (Figure 2 and Appendix A)

PYTHONPATH=. time python scripts/blimp_anomaly.py \
  --bnc_path=data/bnc.pkl \
  --blimp_path=data/blimp/data/ \
  --out=blimp_result

Frequency correlation (Figure 3 and Appendix B)

Run the notebooks/FreqSurprisal.ipynb notebook.

Surprisal gap experiments (Figure 4)

PYTHONPATH=. time python scripts/run_surprisal_gaps.py \
  --bnc_path=data/bnc.pkl \
  --out=surprisal_gaps

Accuracy scores (Table 2)

PYTHONPATH=. time python scripts/run_accuracy.py \
  --model_name=roberta-base \
  --anomaly_model=gmm

Run unit tests

PYTHONPATH=. pytest tests

Code for layerwise detection of linguistic anomaly paper (ACL 2021)

Related tags

Overview

Layerwise Anomaly

Citation

Dependencies

Setup Instructions

GMM experiments on BLiMP (Figure 2 and Appendix A)

Frequency correlation (Figure 3 and Appendix B)

Surprisal gap experiments (Figure 4)

Accuracy scores (Table 2)

Run unit tests

Owner

Lbl2Vec learns jointly embedded label, document and word vectors to retrieve documents with predefined topics from an unlabeled document corpus.

A GPU-optional modular synthesizer in pytorch, 16200x faster than realtime, for audio ML researchers.

Video-Music Transformer

Python scripts form performing stereo depth estimation using the CoEx model in ONNX.

🧮 Matrix Factorization for Collaborative Filtering is just Solving an Adjoint Latent Dirichlet Allocation Model after All

PICARD - Parsing Incrementally for Constrained Auto-Regressive Decoding from Language Models

Joint parameterization and fitting of stroke clusters

Magisk module to enable hidden features on Android 12 Developer Preview 1.

Official code repository for Continual Learning In Environments With Polynomial Mixing Times

Cosine Annealing With Warmup

Utilities to bridge Canvas-generated course rosters with GitLab's API.

This is the repository for our paper SimpleTrack: Understanding and Rethinking 3D Multi-object Tracking

A curated list of neural network pruning resources.

Adversarial Self-Defense for Cycle-Consistent GANs

2021-MICCAI-Progressively Normalized Self-Attention Network for Video Polyp Segmentation

Intel® Nervana™ reference deep learning framework committed to best performance on all hardware

Guiding evolutionary strategies by (inaccurate) differentiable robot simulators @ NeurIPS, 4th Robot Learning Workshop

[ICCV2021] IICNet: A Generic Framework for Reversible Image Conversion

Papers about explainability of GNNs

A Broader Picture of Random-walk Based Graph Embedding