Official NumPy Implementation of Deep Networks from the Principle of Rate Reduction (2021)

Last update: Dec 16, 2022

Overview

Deep Networks from the Principle of Rate Reduction

This repository is the official NumPy implementation of the paper Deep Networks from the Principle of Rate Reduction (2021) by Kwan Ho Ryan Chan* (UC Berkeley), Yaodong Yu* (UC Berkeley), Chong You* (UC Berkeley), Haozhi Qi (UC Berkeley), John Wright (Columbia), and Yi Ma (UC Berkeley). For PyTorch version of ReduNet, please visit https://github.com/ryanchankh/redunet.

What is ReduNet?

ReduNet is a deep neural network construcuted naturally by deriving the gradients of the Maximal Coding Rate Reduction (MCR²) [1] objective. Every layer of this network can be interpreted based on its mathematical operations and the network collectively is trained in a feed-forward manner only. In addition, by imposing shift invariant properties to our network, the convolutional operator can be derived using only the data and MCR² objective function, hence making our network design principled and interpretable.

Figure: Weights and operations for one layer of ReduNet

[1] Yu, Yaodong, Kwan Ho Ryan Chan, Chong You, Chaobing Song, and Yi Ma. "Learning diverse and discriminative representations via the principle of maximal coding rate reduction" Advances in Neural Information Processing Systems 33 (2020).

Requirements

This codebase is written for python3. To install necessary python packages, run conda create --name redunet_official --file requirements.txt.

File Structure

Training

To train a model, one can run the training files, which has the dataset as thier names. For the appropriate commands to reproduce our experimental results, check out the experiment section below. All the files for training is listed below:

gaussian2d.py: mixture of Guassians in 2-dimensional Reals
gaussian3d.py: mixture of Guassians in 3-dimensional Reals
iris.py: Iris dataset from UCI Machine Learning Repository (link)
mice.py: Mice Protein Expression Data Set (link)
mnist1d.py: MNIST dataset, each image is multi-channel polar form and model is trained to have rotational invariance
mnist2d.py: MNIST dataset, each image is single-channel and model is trained to have translational invariance
sinusoid.py: mixture of sinusoidal waves, single and multichannel data

Evaluation and Ploting

Evaluation and plots are performed within each file. Functions are located in evaluate.py and plot.py.

Experiments

Run the following commands to train, test, evaluate and plot figures for different settings:

Main Paper

Gaussian 2D: Figure 2(a) - (c)

$ python3 gaussian2d.py --data 1 --noise 0.1 --samples 500 --layers 2000 --eta 0.5 --eps 0.1

Gaussian 3D: Figure 2(d) - (f)

$ python3 gaussian3d.py --data 1 --noise 0.1 --samples 500 --layers 2000 --eta 0.5 --eps 0.1

Rotational-Invariant MNIST: 3(a) - (d)

$ python3 mnist1d.py --samples 10 --channels 15 --outchannels 20 --time 200 --classes 0 1 2 3 4 5 6 7 8 9 --layers 40 --eta 0.5 --eps 0.1  --ksize 5

Translational-Invariant MNIST: 3(e) - (h)

$ python3 mnist2d.py --classes 0 1 2 3 4 5 6 7 8 9 --samples 10 --layers 25 --outchannels 75 --ksize 9 --eps 0.1 --eta 0.5

Appendix

For Iris and Mice Protein:

$ python3 iris.py --layers 4000 --eta 0.1 --eps 0.1
$ python3 mice.py --layers 4000 --eta 0.1 --eps 0.1

For 1D signals (Sinusoids):

$ python3 sinusoid.py --time 150 --samples 400 --channels 7 --layers 2000 --eps 0.1 --eta 0.1 --data 7 --kernel 3

For 1D signals (Rotational Invariant MNIST):

$ python3 mnist1d.py --classes 0 1 --samples 2000 --time 200 --channels 5 --layers 3500 --eta 0.5 --eps 0.1

For 2D translational invariant MNIST data:

$ python3 mnist2d.py --classes 0 1 --samples 500 --layers 2000 --eta 0.5 --eps 0.1

Reference

For technical details and full experimental results, please check the paper. Please consider citing our work if you find it helpful to yours:

@article{chan2020deep,
  title={Deep networks from the principle of rate reduction},
  author={Chan, Kwan Ho Ryan and Yu, Yaodong and You, Chong and Qi, Haozhi and Wright, John and Ma, Yi},
  journal={arXiv preprint arXiv:2010.14765},
  year={2020}
}

License and Contributing

This README is formatted based on paperswithcode.
Feel free to post issues via Github.

Contact

Please contact [email protected] and [email protected] if you have any question on the codes.

Official NumPy Implementation of Deep Networks from the Principle of Rate Reduction (2021)

Related tags

Overview

Deep Networks from the Principle of Rate Reduction

What is ReduNet?

Requirements

File Structure

Training

Evaluation and Ploting

Experiments

Main Paper

Appendix

Reference

License and Contributing

Contact

Owner

Ryan Chan

Replication package for the manuscript "Using Personality Detection Tools for Software Engineering Research: How Far Can We Go?" submitted to TOSEM

Nested Graph Neural Network (NGNN) is a general framework to improve a base GNN's expressive power and performance

This is the code for our paper "Iconary: A Pictionary-Based Game for Testing Multimodal Communication with Drawings and Text"

Code for "The Box Size Confidence Bias Harms Your Object Detector"

Official repository of the paper 'Essentials for Class Incremental Learning'

PyTorch implementations of neural network models for keyword spotting

Spatially-Adaptive Pixelwise Networks for Fast Image Translation, CVPR 2021

PixelPick This is an official implementation of the paper "All you need are a few pixels: semantic segmentation with PixelPick."

Learning infinite-resolution image processing with GAN and RL from unpaired image datasets, using a differentiable photo editing model.

ENet: A Deep Neural Network Architecture for Real-Time Semantic Segmentation

Source code for the BMVC-2021 paper "SimReg: Regression as a Simple Yet Effective Tool for Self-supervised Knowledge Distillation".

Reproduction of Vision Transformer in Tensorflow2. Train from scratch and Finetune.

CCAFNet: Crossflow and Cross-scale Adaptive Fusion Network for Detecting Salient Objects in RGB-D Images

The source code for the Cutoff data augmentation approach proposed in this paper: "A Simple but Tough-to-Beat Data Augmentation Approach for Natural Language Understanding and Generation".

This repository collects 100 papers related to negative sampling methods.

A keras-based real-time model for medical image segmentation (CFPNet-M)

Code for the paper "Asymptotics of ℓ2 Regularized Network Embeddings"

x-transformers-paddle 2.x version

WSDM‘2022: Knowledge Enhanced Sports Game Summarization

AgML is a comprehensive library for agricultural machine learning