PyTorch implementation of DCT fast weight RNNs

Last update: Dec 24, 2022

Overview

DCT based fast weights

This repository contains the official code for the paper: Training and Generating Neural Networks in Compressed Weight Space.

The main code includes:

DCT LSTM: LSTMs whose weights are encoded by discrete cosine transform (DCT).
DCT fast weight RNN: RNNs whose weights are encoded by DCT, and the DCT coefficients are parameterized by LSTMs.

The language modeling experiments reported in the paper were produced by porting code (with minor changes due to some clean-up) of this repository in a fork of this toolkit.

Requirements

torch_dct (can be installed via pip install torch_dct)
PyTorch with a version compatible with torch_dct.

Our experiments were conducted using PyTorch version 1.6.0 . More recent versions are apparently not compatible with torch_dct (at least at the time of writing this file). We recommend to run python custom_layer.py to check the compatibility.

References

If you make use of this toolkit for your experiments, please cite:

@inproceedings{irie2021training,
  title={Training and Generating Neural Networks in Compressed Weight Space},
  author={Kazuki Irie and J{\"u}rgen Schmidhuber},
  booktitle={Neural Compression: From Information Theory to Applications -- Workshop @ ICLR 2021},
  year={2021},
  address={Virtual only},
  month=may
}

PyTorch implementation of DCT fast weight RNNs

Related tags

Overview

DCT based fast weights

Requirements

References

Owner

Kazuki Irie

Beyond imagenet attack (accepted by ICLR 2022) towards crafting adversarial examples for black-box domains.

Colossal-AI: A Unified Deep Learning System for Large-Scale Parallel Training

Explainable Medical ImageSegmentation via GenerativeAdversarial Networks andLayer-wise Relevance Propagation

Attention-guided gan for synthesizing IR images

Subpopulation detection in high-dimensional single-cell data

Block-wisely Supervised Neural Architecture Search with Knowledge Distillation (CVPR 2020)

Grad2Task: Improved Few-shot Text Classification Using Gradients for Task Representation

Learning Visual Words for Weakly-Supervised Semantic Segmentation

Vector AI — A platform for building vector based applications. Encode, query and analyse data using vectors.

Official implementation of "SinIR: Efficient General Image Manipulation with Single Image Reconstruction" (ICML 2021)

Combine Tacotron2 and Hifi GAN to generate speech from text

Semi-Supervised Semantic Segmentation via Adaptive Equalization Learning, NeurIPS 2021 (Spotlight)

Pytorch implementation of Make-A-Scene: Scene-Based Text-to-Image Generation with Human Priors

noisy labels; missing labels; semi-supervised learning; entropy; uncertainty; robustness and generalisation.

Pytorch code for semantic segmentation using ERFNet

This codebase is the official implementation of Test-Time Classifier Adjustment Module for Model-Agnostic Domain Generalization (NeurIPS2021, Spotlight)

Official code for: A Probabilistic Hard Attention Model For Sequentially Observed Scenes

Animal Sound Classification (Cats Vrs Dogs Audio Sentiment Classification)

PIXIE: Collaborative Regression of Expressive Bodies

ViSD4SA, a Vietnamese Span Detection for Aspect-based sentiment analysis dataset